标题: Prosody-dependent Acoustic Modeling for Mandarin Speech Recognition
作者: Chiu, Tzu-Hsuan
Chiang, Chen-Yu
Liao, Yuan-Fu
Yang, Jyh-Her
Wang, Yih-Ru
Chen, Sin-Horng
电机工程学系
Department of Electrical and Computer Engineering
关键字: acoustic modeling;speech recognition;prosody-dependent acoustic model;prosodic break
公开日期: 2012
摘要: A study on introducing prosodic information to acoustic modeling (AM) for speech recognition is reported in this paper. It extends the conventional context-dependent (CD) triphone HMM modeling approach to further consider the dependency of phone model on the break type of nearby inter-syllable boundary. Four break types are considered, including major break, minor break, normal non-break, and tightly-coupled non-break. In the training phase, break labeling is automatically accomplished by a Prosody Labeling and Modeling algorithm proposed previously. Then, prosody-and phonetic-dependent phone models are constructed by a standard decision tree-based context clustering of HMMs. The effectiveness of the new AM was examined on a Mandarin syllable recognition task. Experimental results showed that the new approach outperformed the conventional CD-AM on achieving better syllable recognition rate as well as on obtaining a more efficient syllable lattice with better compromise on complexity verse syllable coverage rate.
URI: http://hdl.handle.net/11536/23150
ISBN: 978-7-5608-4869-3
期刊: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II
起始页: 139
结束页: 142
显示于类别:Conferences Paper