標題: STRUCTURAL MAXIMUM A POSTERIORI SPEAKER ADAPTATION OF SPEAKING RATE-DEPENDENT HIERARCHICAL PROSODIC MODEL FOR MANDARIN TTS
作者: Liao, I-Bin
Chiang, Chen-Yu
Chen, Sin-Horng
電機學院
College of Electrical and Computer Engineering
關鍵字: speaker adaptation;hierarchical prosodic model;prosodic-acoustic features;Mandarin TTS
公開日期: 2016
摘要: In this paper, a structural maximum a posterior speaker adaptation method to adjust the existing speaking rate (SR) dependent hierarchical prosodic model (SR-HPM) to a new speaker\'s data for realizing a new voice of any given SR is discussed. The adaptive SR-HPM is formulated based on MAP estimation with a reference SR-HPM serving as an informative prior. The prior information provided by the reference SR-HPM is hierarchically organized by decision trees. The results of objective and subjective evaluations showed that the proposed method not only performed slightly better than the maximum likelihood-based model in the observed SR range of the target speaker\'s data, but also was much better in the unseen SR range.
URI: http://hdl.handle.net/11536/136366
ISBN: 978-1-4799-9988-0
ISSN: 1520-6149
期刊: 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS
起始頁: 5625
結束頁: 5629
顯示於類別:會議論文