标题: 汉语多种腔调语速相依韵律模型之建立与其在语音合成之应用
A Modeling of Accent-Dependent SR-HPM for Mandarin Speech and its Application to TTS
作者: 郭姿秀
陈信宏
林正中
Kuo, Tzu-Hsiu
Chen, Sin-Horng
Lin, Cheng-Chung
资讯科学与工程研究所
关键字: 韵律标记;语者调适;腔调;语音合成;语速;汉语;prosody tag;speaker adaptation;speaking rate;mandarin;accent;HTS
公开日期: 2017
摘要: 本论文应用现有的语速相依阶层式韵律模型(SR-HPM)来探讨汉语腔调的模式,首先由204位语者所产生的语料建立一个多语者SR-HPM,接着使用调适训练技术,将多语者SR-HPM当作样本模型来产生四个腔调的SR-HPMs,藉由分析此四个腔调的SR-HPMs我们可以观察到个别腔调的许多韵律发音的特性,这些实验结果与我们现有的语言学知识相符。最后实作完成个人化TTS系统,让合成语音的韵律符合个人所属的腔调,由主客观评测证实这些个人化TTS系统有很好的效能。
This thesis discusses the accent modeling of multi-speaker Mandarin speech based on the existing speaker-dependnet hierarchical prosodic model (SR-HPM). It first constructs a compact multi-speaker SR-HPM using a speech corpus produced by 204 speakers with different accents. It then adopts the adaptative training technique to construct four accent-dependent SR-HPMs with the multi-speaker SR-HPM as the reference model. Through analyzing these four models, many distinct prosody pronunciation features for each accent of Mandarin speech can be found. These observations conform to our prior linguistic knowledge. An application of using these accent-dependnet SR-HPMs to construct personalized TTS systems with their own accent is realized. Both objective and subjective tests confirmed the high performances of these TTS systems
URI: http://etd.lib.nctu.edu.tw/cdrfb3/record/nctu/#GT070456144
http://hdl.handle.net/11536/141827
显示于类别:Thesis