標題: 以特徵參數正規化為基礎之強健性語音辨認
Robust Speech Recognition Based on Feature Normalization
作者: 高世哲
Shyh-Jer Kao
陳信宏
Sin-Horng Chean
電信工程研究所
關鍵字: 倒頻譜正規化法;ARMA濾波器;分佈等化法;mean and variance normalization(MVN);ARMA filter;Histogram(HEQ);AURORA
公開日期: 2006
摘要: 在本論文中,主要是針對強健性語音特徵參數作深入的探討,將現有的倒頻譜正規化法及分佈等化法做些許的改進。我們將分佈等化法加上ARMA濾波器後,經由國語數字串辨認實驗,辨識率從80.08%提升到82.03%。另外,我們提出的分兩群式MVA系統,也在經過改良後,辨識率由傳統MVA系統的81.31%提升到82.26%,同時,我們也做了理想分群MVA系統實驗,得知若準確分群,辨識率可提升至83.63%。最後,我們利用正確的基頻將語音再多分一群,理想三群式MVA系統實驗結果顯示,辨識率可達86.25%。
In this thesis. Some robust speech feature processing algorithms were proposed, in order to improve the speech recognition performance under the noisy environments . First, the well-known robust speech feature processing algorithms such as mean variance normalization(MVN) and histogram equalization(HEQ) was implemented in a Mandarin AURORA-like system database as the base-line system. Then, the class-based MVA was proposed to further implement the speech recognition performance. The class-based MVA algorithm was first categorized the signal into speech and non-speech parts and applied MVAs to each class separately. A 82.26% recognition rate can be achieved comparing to 81.31% in traditional MVA. Final, a Three-class voiced, unvoiced and non-speech MVA was investigated. A 86.25% recognition rate can be achieved under the ideal category of voiced/unvoiced/non-speech case.
URI: http://140.113.39.130/cdrfb3/record/nctu/#GT009313542
http://hdl.handle.net/11536/78357
顯示於類別:畢業論文


文件中的檔案:

  1. 354201.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。