使用韻律訊息於建立聲學模型之中文語音辨認

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	邱子軒	en_US
dc.contributor.author	陳信宏	en_US
dc.date.accessioned	2014-12-12T01:55:49Z	-
dc.date.available	2014-12-12T01:55:49Z	-
dc.date.issued	2012	en_US
dc.identifier.uri	http://140.113.39.130/cdrfb3/record/nctu/#GT079913501	en_US
dc.identifier.uri	http://hdl.handle.net/11536/49287	-
dc.description.abstract	本研究探討如何使用韻律訊息於聲學模型(acoustic model, AM)之建立，用於中文語音辨認。本研究在訓練聲學模型時，將傳統前後文相關(context dependent) 的tri-phone HMM拓展至在音節邊界時，同時考慮韻律停頓(prosodic break)的影響。其中韻律停頓分為四種強度，用以表示音節間不同的緊密接合程度，並採用分類回歸決策樹(Classification and Regression Trees, CART)建立一個與前後文及韻律停頓相關的聲學模型。在辨認時分為兩個階段，在第一階段只利用聲學模型進行音節的辨認產生音節圖(syllable lattice)，且含有韻律停頓的資訊。在第二階段，針對音節圖配合詞典並輔以韻律停頓的資訊進行構詞，將其轉為詞圖(word lattice)，最後再結合語言模型(language model, LM)重新計分(rescoring)，實現詞的辨認。使用TCC300語料庫之實驗結果顯示本方法較傳統之tri-phone HMM有較好的辨認率。	zh_TW
dc.description.abstract	The thesis presents a study on introducing prosody information to acoustic modeling for Mandarin speech recognition. Its idea is to extend the conventional context-dependent (CD) tri-phone HMM modeling approach to further consider the dependency of phone model on the break type of nearby inter-syllable boundary. Four break types are considered, including major break, minor break, normal non-break, and tightly-coupled non-break. In the training phase, prosody- and phonetic-dependent phone models are constructed by using Classification and Regression Trees (CART) Algorithm. In the test phase, a two-stage recognition approach is adopted. In the first stage, we use the acoustic models to generate a syllable lattice which contains prosodic break information. In the second stage, we first construct a word lattice from the syllable lattice by constructing all possible words using a lexicon with the help of prosodic information, and then find the best output word sequence by rescoring using a trigram language model. Experimental results on the TCC300 database showed that the proposed method slightly outperformed the conventional method using tri-phone acoustic models.	en_US
dc.language.iso	zh_TW	en_US
dc.subject	語音辨認	zh_TW
dc.subject	聲學模型	zh_TW
dc.subject	韻律	zh_TW
dc.subject	Speech Recognition	en_US
dc.subject	Acoustic Model	en_US
dc.subject	Prosody	en_US
dc.title	使用韻律訊息於建立聲學模型之中文語音辨認	zh_TW
dc.title	Incorporating Prosody Information in Acoustic Modeling for Mandarin Speech Recognition	en_US
dc.type	Thesis	en_US
dc.contributor.department	電信工程研究所	zh_TW
顯示於類別：	畢業論文

文件中的檔案：

350101.pdf

若為 zip 檔案，請下載檔案解壓縮後，用瀏覽器開啟資料夾中的 index.html 瀏覽全文。