以類神經網路做國語語音辨認之研究

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	陳文源	en_US
dc.contributor.author	Wen-Yuan Chen	en_US
dc.contributor.author	陳信宏	en_US
dc.contributor.author	Sin-Horng Chen	en_US
dc.date.accessioned	2014-12-12T02:13:38Z	-
dc.date.available	2014-12-12T02:13:38Z	-
dc.date.issued	1994	en_US
dc.identifier.uri	http://140.113.39.130/cdrfb3/record/nctu/#NT830430016	en_US
dc.identifier.uri	http://hdl.handle.net/11536/59199	-
dc.description.abstract	這篇論文研究有關類神經網路於國語語音辨認上的應用。首先提出一種爆裂音參數的抽取方式，能自動切出輸入語音的爆裂音段落，並利用直交的多項式展開法，計算出一組固定數目的參數，以描述該段爆裂音在頻譜和時間軸上的特性，該參數可直接輸入類神經網路做爆裂音的辨認。利用直交的多項式展開法，我們也提出廣泛式最小失真度語音切割法，該方法能找出一組段落界限值，使得切割後的語音參數能在最小失真度的情況下，表示原來語音資料。切割後的參數不須再經過任何轉換程序，即可直接送入階層式類神經網路做辨認。實驗結果證明，在參數量相同的情況下，本論文所提出的廣泛式切割法，不論在失真度或辨認率上均比傳統的切割法好。在改善類神經網路的使用效率方面，我們提出具有時序加權特性的階層式類神經網路，其連接鍵值會隨時間變化，用以學習語音中的動態訊息，毋須另外執行動態時間校準程序，該網路就能有效的解決輸入語音與類神經網路之間的時間對準問題。在大字彙辨認方面，利用國語語音的特性，我們提出串連的階層式類神經網路和層次架構的遞迴類神經網路。這兩種網路架構以聲母、韻母或音素為基本辨認單位，一個類神經網路對應於一待辨認單位。串連的階層式類神經網路是以動態時間對準法，將輸入語音的時框映對至串連的階層式類神經網路。層次架構的遞迴類神經網路則另外使用一個遞迴類神經網路做聲母、韻母的切割和加權，因此毋需使用耗時的動態時間對準法，計算量可大量節省。 In this dissertation, several novel ANN-based speech recognition methods for discriminating isolated Mandarin speech are discussed. First, a new method to recognize six plosives in isolated Mandarin syllables is proposed. Next, an MLP-based method is proposed for isolated word recognition. Speech signal is first pre-processed by a generalized minimal distortion segmentation (GMDS) algorithm to find a set of boundaries that minimize the accumulated distortion of orthonormal polynomial expansions of all segments. Experimental results showed that dynamics of speech signal can be more accurately captured by the GMDS algorithm so as to improve the performance of the following MLP recognizer. Another approach based on a generalized MLP, referred to as time weighting MLP (TWMLP) is then proposed for isolated word recognition. In the TWMLP, weights which connect hidden nodes and output nodes are generalized to be varied with time in order to memorize the temporal information of training utterances. Last, two new methods are proposed for large vocabulary isolated Mandarin speech recognition. One is a sequential MLP based method. The other is a hierarchical recurrent neural networks based method.	zh_TW
dc.language.iso	en_US	en_US
dc.subject	語音辨認，類神經網路，爆裂音，最小失真度切割法，階層式類神經網路，遞迴類神經網路。	zh_TW
dc.subject	Speech Recognition, Mandarin, Artificial Neural Networks,	en_US
dc.title	以類神經網路做國語語音辨認之研究	zh_TW
dc.title	A Study on Mandarin Speech Recognition Using Connectionist Networks	en_US
dc.type	Thesis	en_US
dc.contributor.department	電子研究所	zh_TW
顯示於類別：	畢業論文