標題: A STATISTICAL-MODEL BASED FUNDAMENTAL-FREQUENCY SYNTHESIZER FOR MANDARINE SPEECH
作者: CHEN, SH
CHANG, S
LEE, SM
電信工程研究所
電信研究中心
Institute of Communications Engineering
Center for Telecommunications Research
公開日期: 1-七月-1992
摘要: A novel method based on a statistical model for the fundamental-frequency (F0) synthesis in Mandarin text-to-speech is proposed. Specifically, a statistical model is employed to determine the relationship between F0 contour patterns of syllables and linguistic features representing the context. Parameters of the model were empirically estimated from a large training set of sentential utterances. Phonologic rules are then automatically deduced through the training process and implicitly memorized in the model. In the synthesis process, contextual features are extracted from a given input text, and the best estimates of F0 contour patterns of syllable are then found by a Viterbi algorithm using the well-trained model. This method can be regarded as employing a stochastic grammar to reduce the number of candidates of F0 contour pattern at each decision point of synthesis. Although linguistic features on various levels of input text can be incorporated into the model, only some relevant contextual features extracted from neighboring syllables were used in this study. Performance of this method was examined by simulation using a database composed of nine repetitions of 112 declarative sentential utterances of the same text, all spoken by a single speaker. By closely examining the well-trained model, some evidence was found to show that the declination effect as well as several sandhi rules are implicitly contained in the model. Experimental results show that 77.56% of synthesized F0 contours coincide with the VQ-quantized counterpart of the original natural speech. Naturalness of the synthesized speech was confirmed by an informal listening test.
URI: http://hdl.handle.net/11536/3355
ISSN: 0001-4966
期刊: JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA
Volume: 92
Issue: 1
起始頁: 114
結束頁: 120
顯示於類別:期刊論文


文件中的檔案:

  1. A1992JD13400009.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。