多語者漢語韻律模型之建立與其在語者韻律轉換之應用

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	劉子睿	en_US
dc.contributor.author	Liu, Tzu-Jui	en_US
dc.contributor.author	陳信宏	en_US
dc.contributor.author	Chen, Sin-Horng	en_US
dc.date.accessioned	2014-12-12T02:35:33Z	-
dc.date.available	2014-12-12T02:35:33Z	-
dc.date.issued	2013	en_US
dc.identifier.uri	http://140.113.39.130/cdrfb3/record/nctu/#GT070060209	en_US
dc.identifier.uri	http://hdl.handle.net/11536/72642	-
dc.description.abstract	本研究提出以語者韻律模式為基礎的語者韻律轉換方法，其系統架構包含語者韻律模式訓練及聲音韻律轉換兩階段。語者韻律模式訓練階段可再分成語者獨立韻律模型訓練及語者相依韻律模型調適兩個部分，它首先以PLM演算法訓練一個語者獨立韻律模型並對訓練語料產生韻律及停頓標記；接著以最大事後機率調適法則將語者獨立韻律模型調適成語者相關韻律模型，並以遞迴方式反覆疊代更新兩類模型直到收斂；聲音韻律轉換階段則包含來源語者韻律分析及目標語者韻律合成，它使用來源語者之語者相關韻律模型來分析輸入語音之韻律信息，以產生韻律標記，然後以目標語者之語者相關韻律模型來合成輸出語音之韻律參數，包括音節基頻軌跡、音節長度、音節能量、及音節間停頓長度。本研究實驗使用自行錄製的部分平行語料庫，包含9男6女的朗讀語音，實驗結果顯示我們所提出的方法轉換效果略優於傳統的高斯正規化法，並且在部分Source語者及Target語者韻律狀態的影響數值相差劇烈之處，可以產生補償的效果。	zh_TW
dc.description.abstract	In this thesis, a speech prosody conversion method based on speaker’s prosody modeling is proposed. The method comprises a prosody modeling phase and a prosody conversion phase. In the prosody modeling phase, the PLM algorithm proposed previously is firstly employed to train an SI prosodic model from a multi-speaker training dataset and label all training utterances with prosodic states for all syllables as well as break types for all syllable junctures. Then, the maximum a posterior probability (MAP) method is applied to adapt the SI prosodic model to generate a speaker dependent (SD) prosodic model for each speaker. In the prosody conversion phase, the SD prosodic model of the source speaker is firstly used to analyze the input speech to generate prosodic tags. Then, the prosody of the output speech is generated using these prosodic tags by the SD prosodic model of the target speaker. The prosodic information generated includes syllable pitch contour, syllable duration, syllable energy level, and syllable-juncture pause duration. A corpus containing read speeches of six female and nine male speakers was used to examine the validity of the proposed method. Experimental results confirmed that the proposed method performed slightly better than the conventional Z-score normalization method.	en_US
dc.language.iso	zh_TW	en_US
dc.subject	韻律模型	zh_TW
dc.subject	語者調適	zh_TW
dc.subject	韻律轉換	zh_TW
dc.subject	prosody model	en_US
dc.subject	speaker adapation	en_US
dc.subject	prosody conversion	en_US
dc.title	多語者漢語韻律模型之建立與其在語者韻律轉換之應用	zh_TW
dc.title	Multi-Speaker Mandarin Speech Prosody Modeling and its Application to Speaker Prosody Conversion	en_US
dc.type	Thesis	en_US
dc.contributor.department	電信工程研究所	zh_TW
顯示於類別：	畢業論文

文件中的檔案：

020901.pdf

若為 zip 檔案，請下載檔案解壓縮後，用瀏覽器開啟資料夾中的 index.html 瀏覽全文。