標題: 中文自發性語音之聲學模式及韻律模式的改進
Improved Acoustic Modeling and Prosody Modeling for Mandarin Spontaneous-Speech Recognition
作者: 游俊龍
You, Chung-Long
王逸如
Wang, Yih-Ru
電信工程研究所
關鍵字: 自發性語音;聲學模型;模型調適;階層式韻律模型;spontaneous speech;acoustic model;model adaptation;Hierarchical Prosodic Model
公開日期: 2015
摘要: 自發性語音(Spontaneous speech)是最接近人們日常生活的對話,因此也顯得非常重要。本研究主要分為兩個部分,分別對於自發性語音之聲學模型(Acoustic Model, AM)與韻律模型(Prosodic Model, PM)進行改善。在聲學模型方面,本研究利用朗讀式語音(Read speech)來協助訓練,我們使用模型調適(Model Adaptation)的方法,將朗讀式語音的聲學模型調適至自發性語音的聲學模型,再進一步利用Skip state HMM來改善刪除型錯誤過多的情形。 而在自發性語音韻律模型方面,本研究沿用過去所提出的階層式韻律模型(Hierarchical Prosodic Model, HPM)為基礎,來建構適合自發性語音的韻律模式,本研究對於音節韻律模型(Syllable prosodic model)進行修改,考慮其他可能的影響因子(Affecting Factor, AF),最後,語料庫經過自動標記後,探討自發性語音中特有現象的韻律變化,並期望這些發現以及改善可幫助未來進行自發性語音相關的研究。
The spontaneous speech is the closest talking way to people’s daily life, therefore it appears to be very important. This thesis has two parts, one is about improving acoustic model and the other is about improving prosody model. In acoustic modeling, we use the read speeh data to assist the training and use the method of model adaptation to adapting acoustic model of read speech to spontaneous speech. Furthermore, we use the technology of the skip state HMM to fix deletion error problem. In prosodic modeling, we construct prododic model which is adapted to spontaneous speech based on the Hierarchical Prosodic Model (HPM). We modify syllable prosodic model and consider other possible affecting factors. Lastly, an analysis of disfluencies related to the labeling results is also discussed and we expect those results would be able to improve the research on spontaneous speech.
URI: http://140.113.39.130/cdrfb3/record/nctu/#GT070260265
http://hdl.handle.net/11536/127315
顯示於類別:畢業論文