標題: | An Investigation on the Mandarin Prosody of a Parallel Multi-Speaking Rate Speech Corpus |
作者: | Chiang, Chen-Yu Tang, Cheng-Chang Yu, Hsiu-Min Wang, Yih-Ru Chen, Sin-Horng 電信工程研究所 Institute of Communications Engineering |
公開日期: | 2009 |
摘要: | In this paper, the prosody of a parallel multi-speaking rate Mandarin read speech corpus is investigated. The corpus contains four parallel speech datasets uttered by a female professional announcer with various speech rates (SRs) of 4.40 (fast), 3.82 (normal), 2.97 (median) and 2.45 (slow) syllables/second. By using the unsupervised joint prosody labeling and modeling (PLM) method proposed previously, the relationship between SR and various prosodic features, including pause duration, patterns of three high-level prosodic constituents, and the break labels, are investigated. The analyses reported in this study could be very informative in developing prosody generation mechanism for text-to-speech and prosody modeling for automatic speech recognition in various SRs. |
URI: | http://hdl.handle.net/11536/14234 |
ISBN: | 978-1-4244-4399-4 |
期刊: | ORIENTAL COCOSDA 2009 - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS |
起始頁: | 148 |
結束頁: | 153 |
Appears in Collections: | Conferences Paper |