Binary Mask Estimation Based on Frequency Modulations

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Hsu, Chung-Chien	en_US
dc.contributor.author	Chien, Jen-Tzung	en_US
dc.contributor.author	Chi, Tai-Shih	en_US
dc.date.accessioned	2018-08-21T05:56:37Z	-
dc.date.available	2018-08-21T05:56:37Z	-
dc.date.issued	2014-01-01	en_US
dc.identifier.issn	2308-457X	en_US
dc.identifier.uri	http://hdl.handle.net/11536/146419	-
dc.description.abstract	In this paper, a binary mask estimation algorithm is proposed based on modulations of speech. A multi-resolution spectro-temporal analytical auditory model is utilized to extract modulation features to estimate the binary mask, which is often used in speech segregation applications. The proposed method estimates noise from the beginning of each test sentence, a common approach seen in many conventional speech enhancement algorithms, to further enhance the modulation features. Experimental results demonstrate that the proposed method outperforms the AMS-GMM system in terms of the HIT-FA rate when estimating the binary mask.	en_US
dc.language.iso	en_US	en_US
dc.subject	mask estimation	en_US
dc.subject	spectro-temporal modulation	en_US
dc.subject	frequency modulation	en_US
dc.title	Binary Mask Estimation Based on Frequency Modulations	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.journal	15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4	en_US
dc.citation.spage	993	en_US
dc.citation.epage	997	en_US
dc.contributor.department	電機工程學系	zh_TW
dc.contributor.department	Department of Electrical and Computer Engineering	en_US
dc.identifier.wosnumber	WOS:000395050100202	en_US
顯示於類別：	會議論文