| 標題: | An Investigation on the Mandarin Prosody of a Parallel Multi-Speaking Rate Speech Corpus |
| 作者: | Chiang, Chen-Yu Tang, Cheng-Chang Yu, Hsiu-Min Wang, Yih-Ru Chen, Sin-Horng 電信工程研究所 Institute of Communications Engineering |
| 公開日期: | 2009 |
| 摘要: | In this paper, the prosody of a parallel multi-speaking rate Mandarin read speech corpus is investigated. The corpus contains four parallel speech datasets uttered by a female professional announcer with various speech rates (SRs) of 4.40 (fast), 3.82 (normal), 2.97 (median) and 2.45 (slow) syllables/second. By using the unsupervised joint prosody labeling and modeling (PLM) method proposed previously, the relationship between SR and various prosodic features, including pause duration, patterns of three high-level prosodic constituents, and the break labels, are investigated. The analyses reported in this study could be very informative in developing prosody generation mechanism for text-to-speech and prosody modeling for automatic speech recognition in various SRs. |
| URI: | http://hdl.handle.net/11536/14234 |
| ISBN: | 978-1-4244-4399-4 |
| 期刊: | ORIENTAL COCOSDA 2009 - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS |
| 起始頁: | 148 |
| 結束頁: | 153 |
| Appears in Collections: | Conferences Paper |

