標題: | 自發性中文語音基本辨認系統之建立 An Implementation of Spontaneous Mandarin Speech Recognition Baseline System |
作者: | 羅應順 Ying-Shuen Lo 陳信宏 Sin-Horng Chen 電信工程研究所 |
關鍵字: | 自發性中文對話語音辨識;發音變異;Spontaneous Mandarin speech recognition;pronunciation variation |
公開日期: | 2004 |
摘要: | 在本論文中,我們建立一個自發性中文對話語音辨識基本系統架構,探討中文語音及自發性語料的特殊語音現象,如感嘆詞(particles)、不確定語音發音(uncertain sounds)、非語音聲音(paralinguistic sounds)等。我們使用中研院提供的八段雙人對話語料庫做實驗,最後,獲得的音節辨識率約為56.4% (引入語音模型)。除此之外,在我們的系統裡,我們使用KPCA(kernel principal components analysis)方法,去進行基本音節HMM模型分裂,來模擬發音變異現象。 In the thesis,a basic spontaneous Mandarin speech recognition is established. The study focuses on the acoustic modeling for 411 Mandarin base-syllables as well as some special phenomena of spontaneous speech such as particles, uncertain sounds, and paralinguistic phenomena. Performance of the database called MCDC (Mandarin Conversational Dialogue Corpus). Finally, A syllable accuracy rate of 56.4% with adapted language model. In addition,the kernel principal components analysis (KPCA) method is used to split the base-syllable HMM models in order to model the pronunciation variation in our system. |
URI: | http://140.113.39.130/cdrfb3/record/nctu/#GT009213621 http://hdl.handle.net/11536/70623 |
Appears in Collections: | Thesis |
Files in This Item:
If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.