Latent prosody model of continuous mandarin speech

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Chiang, Chen-Yu	en_US
dc.contributor.author	Wang, Xiao-Dong	en_US
dc.contributor.author	Liao, Yuan-Fu	en_US
dc.contributor.author	Wang, Yih-Ru	en_US
dc.contributor.author	Chen, Sin-Horng	en_US
dc.contributor.author	Hirose, Keikichi	en_US
dc.date.accessioned	2014-12-08T15:14:49Z	-
dc.date.available	2014-12-08T15:14:49Z	-
dc.date.issued	2007	en_US
dc.identifier.issn	1520-6149	en_US
dc.identifier.uri	http://hdl.handle.net/11536/11190	-
dc.description.abstract	The major difficulty of prosody modeling and automatic tone recognition of continuous Mandarin speech is the complex interaction of tones and prosody/intonation on F0 contours. In this study, we propose a latent prosody model (LPM) aiming to jointly model the affections of tone and prosody state on F0. The main purposes are twofold including (1) automatic prosody state labeling and (2) improving tone recognition accuracy. The basic idea is to introduce latent prosody state variables into an additive statistic model of F0 which already considers the affecting factors of tone and speaker. Experiments on the Tree-Bank corpus showed that LPM not only gave meaningful prosody state labeling results but also improved the average tone recognition rate from 80.86% of a multi-layer perceptron (MLP) baseline to 82.55%.	en_US
dc.language.iso	en_US	en_US
dc.subject	speech processing	en_US
dc.subject	speech recognition	en_US
dc.subject	tone recognition	en_US
dc.title	Latent prosody model of continuous mandarin speech	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.journal	2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol IV, Pts 1-3	en_US
dc.citation.spage	625	en_US
dc.citation.epage	628	en_US
dc.contributor.department	電信工程研究所	zh_TW
dc.contributor.department	Institute of Communications Engineering	en_US
dc.identifier.wosnumber	WOS:000248909200157	-
顯示於類別：	會議論文