漢語語速相依韻律模型之語者調適及其在語音合成之應用

Full metadata record

DC Field	Value	Language
dc.contributor.author	王柏鈞	en_US
dc.contributor.author	Wang, Po-Chun	en_US
dc.contributor.author	王逸如	en_US
dc.contributor.author	Wang, Yih-Ru	en_US
dc.date.accessioned	2014-12-12T02:43:38Z	-
dc.date.available	2014-12-12T02:43:38Z	-
dc.date.issued	2014	en_US
dc.identifier.uri	http://140.113.39.130/cdrfb3/record/nctu/#GT070160269	en_US
dc.identifier.uri	http://hdl.handle.net/11536/75600	-
dc.description.abstract	本論文提出一個語者韻律調適方法，來將現有的可調語速漢語文字轉語音系統的語速相依階層式韻律模型調適至新語者的資料，以製做此新語者的合成語音，本研究主要探討兩個問題：資料稀少及模型參數外插，問題的起因是調適語料不多且只存在一部分的語速範圍內。本研究使用類似原先訓練語速相依階層式韻律模型的概念，先使用調適語料訓練出一個新語者的階層式韻律模型，再將此模型修改調整成為語速相依的模型，在其中我們使用了最大事後機率(Maximum a posterior, MAP)調適同時考慮模型參數外差的作法，以解決上述兩問題。由一位男性新語者的實驗結果顯示，調適後產生的語速相依階層式韻律模型可以涵蓋整個語速範圍(0.15-0.3 seconds/syllable)，因此可以使用它來產生此新語者的任何語速合成語音的韻律參數。	zh_TW
dc.description.abstract	In this thesis, a speaker adaptation methodto adapt an existing speaking rate-dependent hierarchical prosodic model (SR-HPM) of an SR-controlled Mandarin TTS system to new speaker’s data for realizing a new voice is proposed.Two main problems are solved: data sparseness for adaptation utterances existed only in a small range of normal speaking rate and no adaptation data in both ranges of fast and slow speaking rates. The proposed method follows the idea of SR-HPM training to firstly normalize the prosodic-acoustic features of the new speaker’s speech data, to then train an HPM by the PLM algorithm, and to lastly refine the HPM to a speaking rate-dependent model. The MAP adaptation method with model parameter extrapolation is applied to cope with the above two problems. Experimental results on a male speaker’s adaptation data confirmed that the resulting adaptive SR-HPM has reasonable parameters covering a wide range of speaking rates (0.15-0.3 seconds/syllable) and hence can be used in the TTS system to generate prosodic-acoustic features for synthesizing the new speaker’s voice of any given speaking rate	en_US
dc.language.iso	zh_TW	en_US
dc.subject	文字轉語音合成	zh_TW
dc.subject	調適	zh_TW
dc.subject	語速韻律模型	zh_TW
dc.subject	外插法	zh_TW
dc.subject	最大事後機率	zh_TW
dc.subject	線性迴歸	zh_TW
dc.subject	TTS	en_US
dc.subject	Adaptation	en_US
dc.subject	Speaking Rate-Dependent Hierarchical Prosodic Model	en_US
dc.subject	Extrapolation	en_US
dc.subject	MAP	en_US
dc.subject	Linear Regression	en_US
dc.title	漢語語速相依韻律模型之語者調適及其在語音合成之應用	zh_TW
dc.title	Speaker Adaptation of Speaking Rate-Dependent Hierarchical Prosodic Model For Mandarin TTS	en_US
dc.type	Thesis	en_US
dc.contributor.department	電信工程研究所	zh_TW
Appears in Collections:	Thesis