標題: | Rhythm Speech Lyrics Input for MIDI-Based Singing Voice Synthesis |
作者: | Lee, Hong-Ru Huang, Chih-Fang Hsu, Chih-Hao Wang, Wen-Nan 機械工程學系 Department of Mechanical Engineering |
關鍵字: | Singing voice synthesis;rhythm speech lyrics input;phonetic segmentation |
公開日期: | 2009 |
摘要: | This paper presents useful techniques and considerations in implementing underlying mandarin singing voice synthesis system using the RSLI unit The system can receive the continuous speech of the lyrics of a song, and can synthesize the intended song based on the MIDI-based music database This system is designed based on 3 units The first one is the input unit which allows the user specifies a musical score and phonetically-spelled lyrics to system The second one is the modified unit and it is employed to implement the pitch-shilling function using the PSOLA method The thud one is the mixed unit which has some undesirable artificial-sounding buzzy-effects. Including echo and vibrato effects Moreover. the energy, duration. and spectrum modifications ale also implemented in the mixed unit The synthesized singing voice sounds reasonably good From the subjective listening test, the MOS (mean opinion set le) of 3 3 and 3 2 ale obtained for the synthesized singing voices and the similarity of smiler 's voice, respectively |
URI: | http://hdl.handle.net/11536/13111 |
ISBN: | 978-3-642-10466-4 |
ISSN: | 0302-9743 |
期刊: | ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2009 |
Volume: | 5879 |
起始頁: | 459 |
結束頁: | 468 |
Appears in Collections: | Conferences Paper |