完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.author | Hsu, Chung-Chien | en_US |
dc.contributor.author | Chien, Jen-Tzung | en_US |
dc.contributor.author | Chi, Tai-Shih | en_US |
dc.date.accessioned | 2018-08-21T05:56:37Z | - |
dc.date.available | 2018-08-21T05:56:37Z | - |
dc.date.issued | 2014-01-01 | en_US |
dc.identifier.issn | 2308-457X | en_US |
dc.identifier.uri | http://hdl.handle.net/11536/146419 | - |
dc.description.abstract | In this paper, a binary mask estimation algorithm is proposed based on modulations of speech. A multi-resolution spectro-temporal analytical auditory model is utilized to extract modulation features to estimate the binary mask, which is often used in speech segregation applications. The proposed method estimates noise from the beginning of each test sentence, a common approach seen in many conventional speech enhancement algorithms, to further enhance the modulation features. Experimental results demonstrate that the proposed method outperforms the AMS-GMM system in terms of the HIT-FA rate when estimating the binary mask. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | mask estimation | en_US |
dc.subject | spectro-temporal modulation | en_US |
dc.subject | frequency modulation | en_US |
dc.title | Binary Mask Estimation Based on Frequency Modulations | en_US |
dc.type | Proceedings Paper | en_US |
dc.identifier.journal | 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4 | en_US |
dc.citation.spage | 993 | en_US |
dc.citation.epage | 997 | en_US |
dc.contributor.department | 電機工程學系 | zh_TW |
dc.contributor.department | Department of Electrical and Computer Engineering | en_US |
dc.identifier.wosnumber | WOS:000395050100202 | en_US |
顯示於類別: | 會議論文 |