完整後設資料紀錄
DC 欄位語言
dc.contributor.authorChiang, Chen-Yuen_US
dc.contributor.authorTang, Cheng-Changen_US
dc.contributor.authorYu, Hsiu-Minen_US
dc.contributor.authorWang, Yih-Ruen_US
dc.contributor.authorChen, Sin-Horngen_US
dc.date.accessioned2014-12-08T15:20:05Z-
dc.date.available2014-12-08T15:20:05Z-
dc.date.issued2009en_US
dc.identifier.isbn978-1-4244-4399-4en_US
dc.identifier.urihttp://hdl.handle.net/11536/14234-
dc.description.abstractIn this paper, the prosody of a parallel multi-speaking rate Mandarin read speech corpus is investigated. The corpus contains four parallel speech datasets uttered by a female professional announcer with various speech rates (SRs) of 4.40 (fast), 3.82 (normal), 2.97 (median) and 2.45 (slow) syllables/second. By using the unsupervised joint prosody labeling and modeling (PLM) method proposed previously, the relationship between SR and various prosodic features, including pause duration, patterns of three high-level prosodic constituents, and the break labels, are investigated. The analyses reported in this study could be very informative in developing prosody generation mechanism for text-to-speech and prosody modeling for automatic speech recognition in various SRs.en_US
dc.language.isoen_USen_US
dc.titleAn Investigation on the Mandarin Prosody of a Parallel Multi-Speaking Rate Speech Corpusen_US
dc.typeArticleen_US
dc.identifier.journalORIENTAL COCOSDA 2009 - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTSen_US
dc.citation.spage148en_US
dc.citation.epage153en_US
dc.contributor.department電信工程研究所zh_TW
dc.contributor.departmentInstitute of Communications Engineeringen_US
dc.identifier.wosnumberWOS:000278568800027-
顯示於類別:會議論文