EXPLORING MUTUAL INFORMATION FOR GMM-BASED SPECTRAL CONVERSION

Full metadata record

DC Field	Value	Language
dc.contributor.author	Hwang, Hsin-Te	en_US
dc.contributor.author	Tsao, Yu	en_US
dc.contributor.author	Wang, Hsin-Min	en_US
dc.contributor.author	Wang, Yih-Ru	en_US
dc.contributor.author	Chen, Sin-Horng	en_US
dc.date.accessioned	2014-12-08T15:30:03Z	-
dc.date.available	2014-12-08T15:30:03Z	-
dc.date.issued	2012	en_US
dc.identifier.isbn	978-1-4673-2507-3	en_US
dc.identifier.uri	http://hdl.handle.net/11536/21518	-
dc.description.abstract	In this paper, we propose a maximum mutual information (MMI) training criterion to refine the parameters of the joint density GMM (JDGMM) set to tackle the over-smoothing issue in voice conversion (VC). Conventionally, the maximum likelihood (ML) criterion is used to train a JDGMM set, which characterizes the joint property of the source and target feature vectors. The MMI training criterion, on the other hand, updates the parameters of the JDGMM set to increase its capability on modeling the dependency between the source and target feature vectors, and thus to make the converted sounds closer to the natural ones. The subjective listening test demonstrates that the quality and individuality of the converted speech by the proposed ML followed by MMI (ML+MMI) training method is better that by the ML training method.	en_US
dc.language.iso	en_US	en_US
dc.subject	Voice conversion	en_US
dc.subject	mutual information	en_US
dc.subject	GMM	en_US
dc.title	EXPLORING MUTUAL INFORMATION FOR GMM-BASED SPECTRAL CONVERSION	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.journal	2012 8TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING	en_US
dc.citation.spage	50	en_US
dc.citation.epage	54	en_US
dc.contributor.department	電機資訊學士班	zh_TW
dc.contributor.department	Undergraduate Honors Program of Electrical Engineering and Computer Science	en_US
dc.identifier.wosnumber	WOS:000316984700018	-
Appears in Collections:	Conferences Paper