A Study of Mutual Information for GMM-Based Spectral Conversion

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Hwang, Hsin-Te	en_US
dc.contributor.author	Tsao, Yu	en_US
dc.contributor.author	Wang, Hsin-Min	en_US
dc.contributor.author	Wang, Yih-Ru	en_US
dc.contributor.author	Chen, Sin-Horng	en_US
dc.date.accessioned	2014-12-08T15:30:53Z	-
dc.date.available	2014-12-08T15:30:53Z	-
dc.date.issued	2012	en_US
dc.identifier.isbn	978-1-62276-759-5	en_US
dc.identifier.uri	http://hdl.handle.net/11536/22051	-
dc.description.abstract	The Gaussian mixture model (GMM)-based method has dominated the field of voice conversion (VC) for last decade. However, the converted spectra are excessively smoothed and thus produce muffled converted sound. In this study, we improve the speech quality by enhancing the dependency between the source (natural sound) and converted feature vectors (converted sound). It is believed that enhancing this dependency can make the converted sound closer to the natural sound. To this end, we propose an integrated maximum a posteriori and mutual information (MAPMI) criterion for parameter generation on spectral conversion. Experimental results demonstrate that the quality of converted speech by the proposed MAPMI method outperforms that by the conventional method in terms of formal listening test.	en_US
dc.language.iso	en_US	en_US
dc.subject	Voice conversion	en_US
dc.subject	mutual information	en_US
dc.subject	GMM	en_US
dc.title	A Study of Mutual Information for GMM-Based Spectral Conversion	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.journal	13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3	en_US
dc.citation.spage	78	en_US
dc.citation.epage	81	en_US
dc.contributor.department	電機工程學系	zh_TW
dc.contributor.department	Department of Electrical and Computer Engineering	en_US
dc.identifier.wosnumber	WOS:000320827200020	-
顯示於類別：	會議論文