完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.author | 林威成 | en_US |
dc.contributor.author | Wei-Cheng Lin | en_US |
dc.contributor.author | 王逸如 | en_US |
dc.contributor.author | Yih-Ru Wang | en_US |
dc.date.accessioned | 2014-12-12T02:28:31Z | - |
dc.date.available | 2014-12-12T02:28:31Z | - |
dc.date.issued | 2001 | en_US |
dc.identifier.uri | http://140.113.39.130/cdrfb3/record/nctu/#NT900435060 | en_US |
dc.identifier.uri | http://hdl.handle.net/11536/68937 | - |
dc.description.abstract | 在本論文中,主要針對連續語音的預切割系統,進行研究與分析。在此提出以遞迴式類神經網路結合有限狀態機的基本架構,對連續語音做粗分類與細分類,以供不同目的的後級處理器使用。在粗分類方面,我們將連續語音分為靜音與語音兩部分,由實驗結果可知,能得到正確的靜音與語音邊界。在細分類方面,我們將語音分為聲母、韻母、韻尾鼻音、靜音與聲母-韻母間的轉換狀態,在實作的過程中,我們發現對於音節耦合處,預切割無法有效的處理。因此我們對產生連音的情形做統計與分析,並建立連音模型,使得後級的音節辨認系統可以運用這些資訊以得到辨認率的提升。最後,對於韻律片語邊界的偵測,我們提出高斯混和模型與多層神經元的類神經網路兩種方法,也可以得到不錯的辨識結果。 | zh_TW |
dc.description.abstract | In this thesis, the recurrent neural network (RNN) and finite state machine (FSM) were used to construct a pre-segmentation unit in speech processing system. A RNN pre-segment network was used to classify the input speech into silence, initial, final and nasal. Two speech databases, MAT-2000 and TCC-300, were used to examine the effectiveness of the RNN pre-segment network. And the FSM’s were used in second stage to constraint the segmentation result according to the phonetic structure of Mandarin speech. First, a FSM was used to classify the input signal into silence/speech. And another FSM was used to segment the signal into silence, initial, initial/final transition, final, nasal, silence. The performance of above two RNN-FSM segmentation schemes was carefully examined by experiments. Finally, beside the sentence and syllable boundaries, the prosodic boundaries of speech was also be detected by using a statistical method and MLP neural network. | en_US |
dc.language.iso | zh_TW | en_US |
dc.subject | 遞迴式類神經網路 | zh_TW |
dc.subject | 切割 | zh_TW |
dc.subject | 多層式類神經網路 | zh_TW |
dc.subject | 有限狀態機 | zh_TW |
dc.subject | 隱藏式馬可夫模型 | zh_TW |
dc.subject | recurrent neural network | en_US |
dc.subject | segmentation | en_US |
dc.subject | MLP | en_US |
dc.subject | FSM | en_US |
dc.subject | HMM | en_US |
dc.title | 使用遞迴式類神經網路之語音段切割 | zh_TW |
dc.title | RNN-based Segmentation for Speech Recognition | en_US |
dc.type | Thesis | en_US |
dc.contributor.department | 電信工程研究所 | zh_TW |
顯示於類別: | 畢業論文 |