A New Model-based Mandarin-speech Coding System

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Chiang, Chen-Yu	en_US
dc.contributor.author	Yang, Jyh-Her	en_US
dc.contributor.author	Liu, Ming-Chieh	en_US
dc.contributor.author	Wang, Yih-Ru	en_US
dc.contributor.author	Liao, Yuan-Fu	en_US
dc.contributor.author	Chen, Sin-Horn	en_US
dc.date.accessioned	2019-04-02T06:04:18Z	-
dc.date.available	2019-04-02T06:04:18Z	-
dc.date.issued	2011-01-01	en_US
dc.identifier.uri	http://hdl.handle.net/11536/150571	-
dc.description.abstract	In this paper, a new model-based Mandarin-speech coding system is proposed. It employs a prosody-enriched ASR with a hierarchical prosodic model (HPM) to generate from the input speech enriched transcriptions, including linguistic features, prosodic tags and spectral parameters in the encoder. By sending these features to the decoder, we can first reconstruct the prosodic-acoustic features of syllable pitch contour, syllable duration, syllable energy level, and inter-syllable pause duration by HPM using the linguistic features and prosodic tags; and then combined with spectral parameters to reconstruct the input speech signal by an HMM-based speech synthesizer. Experimental results show that the reconstructed speech has good quality at a low data rate of 543 bits/s.	en_US
dc.language.iso	en_US	en_US
dc.subject	model-based speech coding	en_US
dc.subject	prosody-enriched ASR	en_US
dc.subject	enriched transcriptions	en_US
dc.subject	hierarchical prosodic model	en_US
dc.title	A New Model-based Mandarin-speech Coding System	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.journal	12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5	en_US
dc.citation.spage	2572	en_US
dc.citation.epage	2575	en_US
dc.contributor.department	電信工程研究所	zh_TW
dc.contributor.department	Institute of Communications Engineering	en_US
dc.identifier.wosnumber	WOS:000316502201132	en_US
dc.citation.woscount	0	en_US
顯示於類別：	會議論文