標題: Spectro-Temporal Modulations for Robust Speech Emotion Recognition
作者: Yeh, Lan-Ying
Chi, Tai-Shih
電機工程學系
Department of Electrical and Computer Engineering
關鍵字: Emotion recognition;robust;spectro-temporal modulations
公開日期: 2010
摘要: Speech emotion recognition is mostly considered in clean speech. In this paper, joint spectro-temporal features (RS features) are extracted from an auditory model and are applied to detect the emotion status of noisy speech. The noisy speech is derived from the Berlin Emotional Speech database with added white and babble noises under various SNR levels. The clean train/noisy test scenario is investigated to simulate conditions with unknown noisy sources. The sequential forward floating selection (SFFS) method is adopted to demonstrate the redundancy of RS features and further dimensionality reduction is conducted. Compared to conventional MFCCs plus prosodic features, RS features show higher recognition rates especially in low SNR conditions.
URI: http://hdl.handle.net/11536/25432
ISBN: 978-1-61782-123-3
期刊: 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-4
起始頁: 789
結束頁: 792
顯示於類別:會議論文