完整後設資料紀錄
DC 欄位語言
dc.contributor.authorChiang, Chen-Yuen_US
dc.contributor.authorHung, Yu-Pingen_US
dc.contributor.authorChen, Sin-Horngen_US
dc.contributor.authorWang, Yih-Ruen_US
dc.date.accessioned2017-04-21T06:50:19Z-
dc.date.available2017-04-21T06:50:19Z-
dc.date.issued2013en_US
dc.identifier.isbn978-0-7695-5120-3en_US
dc.identifier.urihttp://dx.doi.org/10.1109/IIH-MSP.2013.24en_US
dc.identifier.urihttp://hdl.handle.net/11536/135358-
dc.description.abstractIn this paper, a novel parametric prosody coding approach for Mandarin speech is proposed. It employs a hierarchical prosodic model (HPM) as a prosody generating model in the encoder to analyze the speech prosody of the input utterance to obtain a parametric representation of four prosodic-acoustic features of syllable pitch contour, syllable duration, syllable energy level, and syllable-juncture pause duration for encoding. In the decoder, the four prosodic-acoustic features are reconstructed by a synthesis operation using the decoded HPM parameters. The reconstructed prosodic features are lastly used in an HMM-based speech synthesizer to help to generate the reconstructed speech. Experimental results show that the reconstructed speech has good quality at low data rates of 114.9 bits/s for a speaker-dependent task. An informal listening test confirmed decoded speeches sounded very fluently.en_US
dc.language.isoen_USen_US
dc.subjectProsody codingen_US
dc.subjectProsodic modelen_US
dc.titleA New Model-based Prosody Coder for Mandarin Speechen_US
dc.typeProceedings Paperen_US
dc.identifier.doi10.1109/IIH-MSP.2013.24en_US
dc.identifier.journal2013 NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2013)en_US
dc.citation.spage60en_US
dc.citation.epage63en_US
dc.contributor.department電機工程學系zh_TW
dc.contributor.departmentDepartment of Electrical and Computer Engineeringen_US
dc.identifier.wosnumberWOS:000343596900016en_US
dc.citation.woscount0en_US
顯示於類別:會議論文