完整後設資料紀錄
DC 欄位語言
dc.contributor.authorChiang, Chen-Yuen_US
dc.contributor.authorYang, Jyh-Heren_US
dc.contributor.authorLiu, Ming-Chiehen_US
dc.contributor.authorWang, Yih-Ruen_US
dc.contributor.authorLiao, Yuan-Fuen_US
dc.contributor.authorChen, Sin-Hornen_US
dc.date.accessioned2019-04-02T06:04:18Z-
dc.date.available2019-04-02T06:04:18Z-
dc.date.issued2011-01-01en_US
dc.identifier.urihttp://hdl.handle.net/11536/150571-
dc.description.abstractIn this paper, a new model-based Mandarin-speech coding system is proposed. It employs a prosody-enriched ASR with a hierarchical prosodic model (HPM) to generate from the input speech enriched transcriptions, including linguistic features, prosodic tags and spectral parameters in the encoder. By sending these features to the decoder, we can first reconstruct the prosodic-acoustic features of syllable pitch contour, syllable duration, syllable energy level, and inter-syllable pause duration by HPM using the linguistic features and prosodic tags; and then combined with spectral parameters to reconstruct the input speech signal by an HMM-based speech synthesizer. Experimental results show that the reconstructed speech has good quality at a low data rate of 543 bits/s.en_US
dc.language.isoen_USen_US
dc.subjectmodel-based speech codingen_US
dc.subjectprosody-enriched ASRen_US
dc.subjectenriched transcriptionsen_US
dc.subjecthierarchical prosodic modelen_US
dc.titleA New Model-based Mandarin-speech Coding Systemen_US
dc.typeProceedings Paperen_US
dc.identifier.journal12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5en_US
dc.citation.spage2572en_US
dc.citation.epage2575en_US
dc.contributor.department電信工程研究所zh_TW
dc.contributor.departmentInstitute of Communications Engineeringen_US
dc.identifier.wosnumberWOS:000316502201132en_US
dc.citation.woscount0en_US
顯示於類別:會議論文