基於聽覺語言學與模糊類神經網路之英文母音辨識技術

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	洪英士	en_US
dc.contributor.author	Ying-Shih Hung	en_US
dc.contributor.author	周志成	en_US
dc.contributor.author	林進燈	en_US
dc.contributor.author	Chi-Cheng Jou	en_US
dc.contributor.author	Chin-Teng Lin	en_US
dc.date.accessioned	2014-12-12T01:39:31Z	-
dc.date.available	2014-12-12T01:39:31Z	-
dc.date.issued	2003	en_US
dc.identifier.uri	http://140.113.39.130/cdrfb3/record/nctu/#GT009112533	en_US
dc.identifier.uri	http://hdl.handle.net/11536/44868	-
dc.description.abstract	在本論文中，我們提出一新的語者不相關的英文母音辨識技術。首先，我們提出一組名為「聽學增強型-離散餘弦序列係數(AE-DCSC)」的新特徵。此特徵的想法是將許多聽學語言學上有關英文母音的研究成果實現在頻譜的強化上，讓其更具有代表性與差異化。其中，頻譜正規化(Spectrum-Level-Normalization)用以平衡不同共振峰的高度差異。根據語言學的研究，共振峰的位置比其高度來的重要。諧音的強化(Enhancement of Spectral Peaks)則能有效的壓抑介於諧音間頻譜微小的變化，使其更具強健性。為了能在有限的特徵維度裡有效地保留母音頻譜隨時間的變化情形，我們採用了離散餘弦序列係數這項技術。此技術具有可改變的頻率與時間的彎曲比例，這讓我們能根據訊號的特性，找出最具有代表性的特徵。而在本系統中，我們採用一前向式自我建構類神經模糊推理網路(SONFIN)做為核心辨識器。利用其可自我建構並調整的架構與參數學習功能，與優異的模糊類神經推論過程，來達到較佳之辨識效果。最後，我們提出一基於語言學特徵的確認程序。針對較為混淆的辨識結果，擷取其在聽學語言學上的特徵，並與我們事先建立的知識庫理的模型比對。以找出最可信的辨識結果。實驗證明，在TIMIT的資料庫下，此系統的辨識率可達74.75%，優於其他在文獻上所見的結果。這說明了我們在此所提出的辨識系統所具有的潛力與優越性。	zh_TW
dc.description.abstract	In this thesis, we proposed a novel speaker-independent English vowel recognition technique based on acoustic-phonetics and fuzzy neural networks. At first, we proposed a new feature set called as “AE-DCSC”. It was derived from the researches of acoustic-phonetics and implemented here to enhance the spectrum so that the features became more representative and discriminative. The technique spectrum-level-normalization was used to balance the amplitude difference between formants. Moreover, the enhancement of spectral peaks was used to suppress the variation of valley between harmonics. These processes let the spectrum more robust and noise-free. In order to preserve the temporal cues of vowels, the technique DCSC was used. The flexible time/frequency warping scales were adjusted according to properties of signals. An on-line self-constructing neural fuzzy inference network (SONFIN) was adopted as the main classifier in this system. SONFIN found its optimal structure and parameters automatically and achieved the better classification result via superior inference process. Finally an acoustic-checking procedure was proposed. We applied it to the ambiguous case in which the acoustic characteristics was evaluated and compared with the model in our knowledge-base database. The proposed approach resulted in an accuracy rate of 74.75% in TIMIT database, which higher than other published result for the same task. The potential and effectiveness of the proposed system was verified.	en_US
dc.language.iso	en_US	en_US
dc.subject	語音辨識	zh_TW
dc.subject	頻譜分析	zh_TW
dc.subject	語者不相關	zh_TW
dc.subject	聽覺語言學	zh_TW
dc.subject	模糊類神經網路	zh_TW
dc.subject	speech recognition	en_US
dc.subject	spectrum analysis	en_US
dc.subject	speaker-independent	en_US
dc.subject	acoustic-phonetic	en_US
dc.subject	fuzzy neural network	en_US
dc.title	基於聽覺語言學與模糊類神經網路之英文母音辨識技術	zh_TW
dc.title	Speaker-Independent English Vowel Recognition Technique Based on Acoustic-Phonetics and Fuzzy Neural Networks	en_US
dc.type	Thesis	en_US
dc.contributor.department	電控工程研究所	zh_TW
顯示於類別：	畢業論文

文件中的檔案：

253301.pdf

若為 zip 檔案，請下載檔案解壓縮後，用瀏覽器開啟資料夾中的 index.html 瀏覽全文。