Full metadata record
DC FieldValueLanguage
dc.contributor.author王柏鈞en_US
dc.contributor.authorWang, Po-Chunen_US
dc.contributor.author王逸如en_US
dc.contributor.authorWang, Yih-Ruen_US
dc.date.accessioned2014-12-12T02:43:38Z-
dc.date.available2014-12-12T02:43:38Z-
dc.date.issued2014en_US
dc.identifier.urihttp://140.113.39.130/cdrfb3/record/nctu/#GT070160269en_US
dc.identifier.urihttp://hdl.handle.net/11536/75600-
dc.description.abstract本論文提出一個語者韻律調適方法,來將現有的可調語速漢語文字轉語音系統的語速相依階層式韻律模型調適至新語者的資料,以製做此新語者的合成語音,本研究主要探討兩個問題:資料稀少及模型參數外插,問題的起因是調適語料不多且只存在一部分的語速範圍內。本研究使用類似原先訓練語速相依階層式韻律模型的概念,先使用調適語料訓練出一個新語者的階層式韻律模型,再將此模型修改調整成為語速相依的模型,在其中我們使用了最大事後機率(Maximum a posterior, MAP)調適同時考慮模型參數外差的作法,以解決上述兩問題。由一位男性新語者的實驗結果顯示,調適後產生的語速相依階層式韻律模型可以涵蓋整個語速範圍(0.15-0.3 seconds/syllable),因此可以使用它來產生此新語者的任何語速合成語音的韻律參數。zh_TW
dc.description.abstractIn this thesis, a speaker adaptation methodto adapt an existing speaking rate-dependent hierarchical prosodic model (SR-HPM) of an SR-controlled Mandarin TTS system to new speaker’s data for realizing a new voice is proposed.Two main problems are solved: data sparseness for adaptation utterances existed only in a small range of normal speaking rate and no adaptation data in both ranges of fast and slow speaking rates. The proposed method follows the idea of SR-HPM training to firstly normalize the prosodic-acoustic features of the new speaker’s speech data, to then train an HPM by the PLM algorithm, and to lastly refine the HPM to a speaking rate-dependent model. The MAP adaptation method with model parameter extrapolation is applied to cope with the above two problems. Experimental results on a male speaker’s adaptation data confirmed that the resulting adaptive SR-HPM has reasonable parameters covering a wide range of speaking rates (0.15-0.3 seconds/syllable) and hence can be used in the TTS system to generate prosodic-acoustic features for synthesizing the new speaker’s voice of any given speaking rateen_US
dc.language.isozh_TWen_US
dc.subject文字轉語音合成zh_TW
dc.subject調適zh_TW
dc.subject語速韻律模型zh_TW
dc.subject外插法zh_TW
dc.subject最大事後機率zh_TW
dc.subject線性迴歸zh_TW
dc.subjectTTSen_US
dc.subjectAdaptationen_US
dc.subjectSpeaking Rate-Dependent Hierarchical Prosodic Modelen_US
dc.subjectExtrapolationen_US
dc.subjectMAPen_US
dc.subjectLinear Regressionen_US
dc.title漢語語速相依韻律模型之語者調適及其在語音合成之應用zh_TW
dc.titleSpeaker Adaptation of Speaking Rate-Dependent Hierarchical Prosodic Model For Mandarin TTSen_US
dc.typeThesisen_US
dc.contributor.department電信工程研究所zh_TW
Appears in Collections:Thesis