國語聲訊處理

標題:	國語聲訊處理 A Study on Mandarin Speech and Song Signal Processing
作者:	王鴻彬 Wang, Horng-Bin 陳信宏 Sin-Horng Chen 電信工程研究所
關鍵字:	聲訊處理;剩餘誤差訊號;基週標位;基週同步疊加;再取樣;隱藏式馬可夫模型;Audio Processing;Residual Error Signal;Pitch Mark;PSOLA;Resampling;HMM
公開日期:	1995
摘要:	在本論文中﹐我們針對語音與樂音之特性做處理﹐研究的主題可分為兩個部份來看﹕在第一部份﹐藉由頻譜訊息參數及韻律訊息參數之改變﹐對中文文句翻語音系統做語者之更換﹐其中頻譜訊息參數是從新語者錄製之語料﹐抽取國語語音411個音節波形樣板並偵測基週標位﹐而韻律訊息參數是自新語料中統計代表韻律訊息之聲母和韻母長度､停頓點長度及可代表聲調之基週軌跡參數﹐以調整韻律訊息之產生。第二個部份是做聲訊處理﹐以產生特殊音效﹐包括利用基週標位調整做語音聲調之升降和播放速度之改變﹐用再取樣法則做歌聲聲調之改變和放音速度之緩急 ﹐利用聲音延遲產生回音﹐設計一頻譜轉換器做聲音轉換 In this thesis, several topics of signal processing for Mandarin speech and singing signal are studied. First, the technique of processing some utterances of a female speaker in order to add her voice to a Mandarin TTS system is studied. In this work, we first automatically segment all utterances into syllable segments, and then manually extract waveform templates of 411 synthesis units of base-syllable. Some prosodic parameters are then extracted and used to calculate their first- and second-order statistics in order to adapt the prosodic information synthesis to the style of the new speaker. Second, an LP model-based speech processing technique is proposed. The input speech signal is processed to shift the key, to change the speed, to simulate the echo effect, and to make a spectral transform. Last, a time-domain processing scheme for singing signal is discussed. The input singing signal is processed to shift the key, to change the speed, and to simulate the echo effect. Informal listening tests confirmed that all these proposed methods function well
URI:	http://140.113.39.130/cdrfb3/record/nctu/#NT840435010 http://hdl.handle.net/11536/60760
Appears in Collections:	Thesis