完整後設資料紀錄
DC 欄位語言
dc.contributor.authorHsu, Chung-Chienen_US
dc.contributor.authorChien, Jen-Tzungen_US
dc.contributor.authorChi, Tai-Shihen_US
dc.date.accessioned2018-08-21T05:56:37Z-
dc.date.available2018-08-21T05:56:37Z-
dc.date.issued2014-01-01en_US
dc.identifier.issn2308-457Xen_US
dc.identifier.urihttp://hdl.handle.net/11536/146419-
dc.description.abstractIn this paper, a binary mask estimation algorithm is proposed based on modulations of speech. A multi-resolution spectro-temporal analytical auditory model is utilized to extract modulation features to estimate the binary mask, which is often used in speech segregation applications. The proposed method estimates noise from the beginning of each test sentence, a common approach seen in many conventional speech enhancement algorithms, to further enhance the modulation features. Experimental results demonstrate that the proposed method outperforms the AMS-GMM system in terms of the HIT-FA rate when estimating the binary mask.en_US
dc.language.isoen_USen_US
dc.subjectmask estimationen_US
dc.subjectspectro-temporal modulationen_US
dc.subjectfrequency modulationen_US
dc.titleBinary Mask Estimation Based on Frequency Modulationsen_US
dc.typeProceedings Paperen_US
dc.identifier.journal15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4en_US
dc.citation.spage993en_US
dc.citation.epage997en_US
dc.contributor.department電機工程學系zh_TW
dc.contributor.departmentDepartment of Electrical and Computer Engineeringen_US
dc.identifier.wosnumberWOS:000395050100202en_US
顯示於類別:會議論文