完整後設資料紀錄
DC 欄位語言
dc.contributor.authorChiang, Chen-Yuen_US
dc.contributor.authorWang, Xiao-Dongen_US
dc.contributor.authorLiao, Yuan-Fuen_US
dc.contributor.authorWang, Yih-Ruen_US
dc.contributor.authorChen, Sin-Horngen_US
dc.contributor.authorHirose, Keikichien_US
dc.date.accessioned2014-12-08T15:14:49Z-
dc.date.available2014-12-08T15:14:49Z-
dc.date.issued2007en_US
dc.identifier.issn1520-6149en_US
dc.identifier.urihttp://hdl.handle.net/11536/11190-
dc.description.abstractThe major difficulty of prosody modeling and automatic tone recognition of continuous Mandarin speech is the complex interaction of tones and prosody/intonation on F0 contours. In this study, we propose a latent prosody model (LPM) aiming to jointly model the affections of tone and prosody state on F0. The main purposes are twofold including (1) automatic prosody state labeling and (2) improving tone recognition accuracy. The basic idea is to introduce latent prosody state variables into an additive statistic model of F0 which already considers the affecting factors of tone and speaker. Experiments on the Tree-Bank corpus showed that LPM not only gave meaningful prosody state labeling results but also improved the average tone recognition rate from 80.86% of a multi-layer perceptron (MLP) baseline to 82.55%.en_US
dc.language.isoen_USen_US
dc.subjectspeech processingen_US
dc.subjectspeech recognitionen_US
dc.subjecttone recognitionen_US
dc.titleLatent prosody model of continuous mandarin speechen_US
dc.typeProceedings Paperen_US
dc.identifier.journal2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol IV, Pts 1-3en_US
dc.citation.spage625en_US
dc.citation.epage628en_US
dc.contributor.department電信工程研究所zh_TW
dc.contributor.departmentInstitute of Communications Engineeringen_US
dc.identifier.wosnumberWOS:000248909200157-
顯示於類別:會議論文