標題: | An Exploration of Local Speaking Rate Variations in Mandarin Read Speech |
作者: | Liou, Guan-Tin Chiang, Chen-Yu Wang, Yih-Ru Chen, Sin-Horng 電機工程學系 Department of Electrical and Computer Engineering |
關鍵字: | speaking rate;SR-HPM;speech rate;articulation rate;prosody;text-to-speech;Mandarin |
公開日期: | 1-一月-2018 |
摘要: | This paper explores speaking rate variation in Mandarin read speech. In contrast to assuming that each utterance is generated in a constant or global speaking rate, this study seeks to estimate local speaking rate for each prosodic unit in an utterance. The exploration is based on the existing speaking rate-dependent hierarchical prosodic model (SR-HPM). The main idea is to first use the SR-HPM to explore the prosodic structures of utterances and extract the prosodic units. Then, local speaking rate is estimated for each prosodic unit (prosodic phrase in this study). Some major influence factors including tone, base syllable type, prosodic structure, and speaking rate of the higher prosodic units (utterance and BG/PG) are compensated in the local SR estimation. A syntactic-local SR model is constructed and use in the prosody generation of Mandarin TTS. Experimental results on a large read speech corpus generated by a professional female announcer showed that the generated prosody with local speaking rate variations is proved to be more vivid than the one with a constant speaking rate. |
URI: | http://hdl.handle.net/11536/152013 |
ISBN: | 978-1-5108-7221-9 |
ISSN: | 2308-457X |
期刊: | 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES |
起始頁: | 42 |
結束頁: | 46 |
顯示於類別: | 會議論文 |