標題: 自發性對話語音辨識之初步研究
Preliminary Study on Spontaneous Speech Recognition
作者: 徐文翰
Wen-Han Hsu
王逸如
Dr. Yih-Ru Wang
電信工程研究所
關鍵字: 自發性中文對話語音辨識;感嘆語;非語音聲音;聲學模型;語言模型;Spontaneous Mandarin speech recognition;Particles;Uncertain sounds;Paralinguistic phenomena;Acoustic modeling;Language model;MCDC
公開日期: 2003
摘要: 在本論文中,我們建立一個自發性中文對話語音辨識基本系統架構,探討中文語音及自發性語料的特殊語音現象,如感嘆語(particles)、語音發音變異(uncertain sounds)、非語音聲音 (paralinguistic sounds)等,之聲學模型建立方法,使用中研院提供的八個雙人對話語料做實驗,獲得之音節辨識率為43.33%。為使辨識系統更為完善,我們加入語言模型,並以語言調適的技術,使之更為優化,最後音節辨識率達到53.93%,較基本系統提升了10.6%。
In the thesis, a basic spontaneous Mandarin speech recognition system is established. The study focuses on the acoustic modeling for 411 Mandarin base-syllables as well as some special phenomena of spontaneous speech such as particles, uncertain sounds, and paralinguistic phenomena. Performance of the proposed system was examined by simulations using a Mandarin dialogue speech database called MCDC (Mandarin Conversational Dialogue Corpus). A syllable accuracy rate of 43.33% was obtained. By adding a bi-gram language model with proper adaptation, the syllable accuracy rate increased to 53.93% which was 10.6% better than the baseline system.
URI: http://140.113.39.130/cdrfb3/record/nctu/#GT009113621
http://hdl.handle.net/11536/47101
顯示於類別:畢業論文


文件中的檔案:

  1. 362101.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。