完整後設資料紀錄
DC 欄位語言
dc.contributor.authorYang, Jyh-Heren_US
dc.contributor.authorLiu, Ming-Chiehen_US
dc.contributor.authorChang, Hao-Hsiangen_US
dc.contributor.authorChiang, Chen-Yuen_US
dc.contributor.authorWang, Yih-Ruen_US
dc.contributor.authorChen, Sin-Horngen_US
dc.date.accessioned2014-12-08T15:20:53Z-
dc.date.available2014-12-08T15:20:53Z-
dc.date.issued2011en_US
dc.identifier.isbn978-1-4577-0539-7en_US
dc.identifier.issn1520-6149en_US
dc.identifier.urihttp://hdl.handle.net/11536/14862-
dc.description.abstractThis paper presents a new probabilistic framework of Mandarin speech recognition by incorporating a sophisticated hierarchical prosody model into the conventional HMM-based system. The prosody model describes the relations of linguistic cues of various levels, break types and prosodic states which represent the prosody hierarchical structure, and prosody-related acoustic features. Aside from producing the recognized word sequences, the system also decodes other information including word's part-of-speech, punctuation marks, inter-syllable break types, and prosodic states of syllables. Experimental results on the TCC300 corpus, which consists of paragraphic utterances, showed that the proposed system significantly outperformed the baseline system. The word and character error rates decreased from 24.4% and 18.1% to 20.7% and 14.4% (or 15.2% and 20.4% relative improvements), respectively.en_US
dc.language.isoen_USen_US
dc.subjectHierarchical prosody modelen_US
dc.subjectMandarin speech recognitionen_US
dc.titleENRICHING MANDARIN SPEECH RECOGNITION BY INCORPORATING A HIERARCHICAL PROSODY MODELen_US
dc.typeProceedings Paperen_US
dc.identifier.journal2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSINGen_US
dc.citation.spage5052en_US
dc.citation.epage5055en_US
dc.contributor.department電信工程研究所zh_TW
dc.contributor.departmentInstitute of Communications Engineeringen_US
dc.identifier.wosnumberWOS:000296062405165-
顯示於類別:會議論文