以特徵參數正規化為基礎之強健性語音辨認

標題:	以特徵參數正規化為基礎之強健性語音辨認 Robust Speech Recognition Based on Feature Normalization
作者:	高世哲 Shyh-Jer Kao 陳信宏 Sin-Horng Chean 電信工程研究所
關鍵字:	倒頻譜正規化法;ARMA濾波器;分佈等化法;mean and variance normalization(MVN);ARMA filter;Histogram(HEQ);AURORA
公開日期:	2006
摘要:	在本論文中，主要是針對強健性語音特徵參數作深入的探討，將現有的倒頻譜正規化法及分佈等化法做些許的改進。我們將分佈等化法加上ARMA濾波器後，經由國語數字串辨認實驗，辨識率從80.08%提升到82.03%。另外，我們提出的分兩群式MVA系統，也在經過改良後，辨識率由傳統MVA系統的81.31%提升到82.26%，同時，我們也做了理想分群MVA系統實驗，得知若準確分群，辨識率可提升至83.63%。最後，我們利用正確的基頻將語音再多分一群，理想三群式MVA系統實驗結果顯示，辨識率可達86.25%。 In this thesis. Some robust speech feature processing algorithms were proposed, in order to improve the speech recognition performance under the noisy environments . First, the well-known robust speech feature processing algorithms such as mean variance normalization(MVN) and histogram equalization(HEQ) was implemented in a Mandarin AURORA-like system database as the base-line system. Then, the class-based MVA was proposed to further implement the speech recognition performance. The class-based MVA algorithm was first categorized the signal into speech and non-speech parts and applied MVAs to each class separately. A 82.26% recognition rate can be achieved comparing to 81.31% in traditional MVA. Final, a Three-class voiced, unvoiced and non-speech MVA was investigated. A 86.25% recognition rate can be achieved under the ideal category of voiced/unvoiced/non-speech case.
URI:	http://140.113.39.130/cdrfb3/record/nctu/#GT009313542 http://hdl.handle.net/11536/78357
顯示於類別：	畢業論文

文件中的檔案：

354201.pdf

若為 zip 檔案，請下載檔案解壓縮後，用瀏覽器開啟資料夾中的 index.html 瀏覽全文。