標題: A high-performance Min-Nan/Taiwanese TTS system
作者: Kuo, WC
Zhong, XR
Wang, YR
Chen, SH
電信工程研究所
Institute of Communications Engineering
公開日期: 2003
摘要: In this paper, the implementation of a high-performance Min-Nan/Taiwanese TTS system is presented. The system can convert both Min-Nan/Taiwanese texts, represented in a hybrid Han-Lo written form, and Chinese texts into natural Taiwanese speeches. It is an improved version of the system developed previously [11]. Improvements include: the add of a "Chinese-to-Min-Nan/Taiwanese" lexicon to solve the OOV problem and to increase the ability of processing Chinese text; the use of explicit tone sandhi rules to ease the learning of prosody generation; a further processing of the training database to detect all breaks not associated with PMs; and the use of four RNNs to separately generate four types of prosodic parameters. The system is implemented by software and runs in real-time on PC. An informal subjective listening test confirmed that the system performed well. All synthetic speeches sounded natural for well-tokenized Min-Nan/Taiwanese texts and for automatic tokenized Chinese texts.
URI: http://hdl.handle.net/11536/18681
ISBN: 0-7803-7663-3
期刊: 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I
起始頁: 512
結束頁: 515
顯示於類別:會議論文