標題: | 自發性對話語音辨識之初步研究 Preliminary Study on Spontaneous Speech Recognition |
作者: | 徐文翰 Wen-Han Hsu 王逸如 Dr. Yih-Ru Wang 電信工程研究所 |
關鍵字: | 自發性中文對話語音辨識;感嘆語;非語音聲音;聲學模型;語言模型;Spontaneous Mandarin speech recognition;Particles;Uncertain sounds;Paralinguistic phenomena;Acoustic modeling;Language model;MCDC |
公開日期: | 2003 |
摘要: | 在本論文中,我們建立一個自發性中文對話語音辨識基本系統架構,探討中文語音及自發性語料的特殊語音現象,如感嘆語(particles)、語音發音變異(uncertain sounds)、非語音聲音 (paralinguistic sounds)等,之聲學模型建立方法,使用中研院提供的八個雙人對話語料做實驗,獲得之音節辨識率為43.33%。為使辨識系統更為完善,我們加入語言模型,並以語言調適的技術,使之更為優化,最後音節辨識率達到53.93%,較基本系統提升了10.6%。 In the thesis, a basic spontaneous Mandarin speech recognition system is established. The study focuses on the acoustic modeling for 411 Mandarin base-syllables as well as some special phenomena of spontaneous speech such as particles, uncertain sounds, and paralinguistic phenomena. Performance of the proposed system was examined by simulations using a Mandarin dialogue speech database called MCDC (Mandarin Conversational Dialogue Corpus). A syllable accuracy rate of 43.33% was obtained. By adding a bi-gram language model with proper adaptation, the syllable accuracy rate increased to 53.93% which was 10.6% better than the baseline system. |
URI: | http://140.113.39.130/cdrfb3/record/nctu/#GT009113621 http://hdl.handle.net/11536/47101 |
Appears in Collections: | Thesis |
Files in This Item:
If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.