ENRICHING MANDARIN SPEECH RECOGNITION BY INCORPORATING A HIERARCHICAL PROSODY MODEL

Full metadata record

DC Field	Value	Language
dc.contributor.author	Yang, Jyh-Her	en_US
dc.contributor.author	Liu, Ming-Chieh	en_US
dc.contributor.author	Chang, Hao-Hsiang	en_US
dc.contributor.author	Chiang, Chen-Yu	en_US
dc.contributor.author	Wang, Yih-Ru	en_US
dc.contributor.author	Chen, Sin-Horng	en_US
dc.date.accessioned	2014-12-08T15:20:53Z	-
dc.date.available	2014-12-08T15:20:53Z	-
dc.date.issued	2011	en_US
dc.identifier.isbn	978-1-4577-0539-7	en_US
dc.identifier.issn	1520-6149	en_US
dc.identifier.uri	http://hdl.handle.net/11536/14862	-
dc.description.abstract	This paper presents a new probabilistic framework of Mandarin speech recognition by incorporating a sophisticated hierarchical prosody model into the conventional HMM-based system. The prosody model describes the relations of linguistic cues of various levels, break types and prosodic states which represent the prosody hierarchical structure, and prosody-related acoustic features. Aside from producing the recognized word sequences, the system also decodes other information including word's part-of-speech, punctuation marks, inter-syllable break types, and prosodic states of syllables. Experimental results on the TCC300 corpus, which consists of paragraphic utterances, showed that the proposed system significantly outperformed the baseline system. The word and character error rates decreased from 24.4% and 18.1% to 20.7% and 14.4% (or 15.2% and 20.4% relative improvements), respectively.	en_US
dc.language.iso	en_US	en_US
dc.subject	Hierarchical prosody model	en_US
dc.subject	Mandarin speech recognition	en_US
dc.title	ENRICHING MANDARIN SPEECH RECOGNITION BY INCORPORATING A HIERARCHICAL PROSODY MODEL	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.journal	2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING	en_US
dc.citation.spage	5052	en_US
dc.citation.epage	5055	en_US
dc.contributor.department	電信工程研究所	zh_TW
dc.contributor.department	Institute of Communications Engineering	en_US
dc.identifier.wosnumber	WOS:000296062405165	-
Appears in Collections:	Conferences Paper