| 標題: | Prosody-dependent Acoustic Modeling for Mandarin Speech Recognition |
| 作者: | Chiu, Tzu-Hsuan Chiang, Chen-Yu Liao, Yuan-Fu Yang, Jyh-Her Wang, Yih-Ru Chen, Sin-Horng 電機工程學系 Department of Electrical and Computer Engineering |
| 關鍵字: | acoustic modeling;speech recognition;prosody-dependent acoustic model;prosodic break |
| 公開日期: | 2012 |
| 摘要: | A study on introducing prosodic information to acoustic modeling (AM) for speech recognition is reported in this paper. It extends the conventional context-dependent (CD) triphone HMM modeling approach to further consider the dependency of phone model on the break type of nearby inter-syllable boundary. Four break types are considered, including major break, minor break, normal non-break, and tightly-coupled non-break. In the training phase, break labeling is automatically accomplished by a Prosody Labeling and Modeling algorithm proposed previously. Then, prosody-and phonetic-dependent phone models are constructed by a standard decision tree-based context clustering of HMMs. The effectiveness of the new AM was examined on a Mandarin syllable recognition task. Experimental results showed that the new approach outperformed the conventional CD-AM on achieving better syllable recognition rate as well as on obtaining a more efficient syllable lattice with better compromise on complexity verse syllable coverage rate. |
| URI: | http://hdl.handle.net/11536/23150 |
| ISBN: | 978-7-5608-4869-3 |
| 期刊: | PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II |
| 起始頁: | 139 |
| 結束頁: | 142 |
| Appears in Collections: | Conferences Paper |

