標題: An Investigation on the Mandarin Prosody of a Parallel Multi-Speaking Rate Speech Corpus
作者: Chiang, Chen-Yu
Tang, Cheng-Chang
Yu, Hsiu-Min
Wang, Yih-Ru
Chen, Sin-Horng
電信工程研究所
Institute of Communications Engineering
公開日期: 2009
摘要: In this paper, the prosody of a parallel multi-speaking rate Mandarin read speech corpus is investigated. The corpus contains four parallel speech datasets uttered by a female professional announcer with various speech rates (SRs) of 4.40 (fast), 3.82 (normal), 2.97 (median) and 2.45 (slow) syllables/second. By using the unsupervised joint prosody labeling and modeling (PLM) method proposed previously, the relationship between SR and various prosodic features, including pause duration, patterns of three high-level prosodic constituents, and the break labels, are investigated. The analyses reported in this study could be very informative in developing prosody generation mechanism for text-to-speech and prosody modeling for automatic speech recognition in various SRs.
URI: http://hdl.handle.net/11536/14234
ISBN: 978-1-4244-4399-4
期刊: ORIENTAL COCOSDA 2009 - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS
起始頁: 148
結束頁: 153
顯示於類別:會議論文