Knowledge Integration for Improving Performance in LVCSR

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Chiang, Chen-Yu	en_US
dc.contributor.author	Siniscalchi, Sabato Marco	en_US
dc.contributor.author	Chen, Sin-Horng	en_US
dc.contributor.author	Lee, Chin-Hui	en_US
dc.date.accessioned	2018-08-21T05:56:37Z	-
dc.date.available	2018-08-21T05:56:37Z	-
dc.date.issued	2013-01-01	en_US
dc.identifier.issn	2308-457X	en_US
dc.identifier.uri	http://hdl.handle.net/11536/146414	-
dc.description.abstract	This paper presents a knowledge integration framework to improve performance in large vocabulary continuous speech recognition. Two types of knowledge sources, manner attribute and prosodic structure, are incorporated. For manner of articulation, six attribute detectors trained with an American English corpus (WSJO) are utilized to rescore hypothesized phones in word lattices obtained by a baseline ASR system. For the prosodic structure, models trained with an unsupervised joint prosody labeling and modeling (PLM) technique using WSJO are used in lattice rescoring. Experimental results on the American English WSJ word recognition task of the Nov92 test set show that the proposed approach significantly outperforms the baseline system that does not use articulatory and prosodic information. The results also demonstrate the effectiveness and usefulness of the PLM technique in constructing prosodic models for American English ASR.	en_US
dc.language.iso	en_US	en_US
dc.subject	LVCSR	en_US
dc.subject	knowledge-based system	en_US
dc.subject	prosody labeling/modeling	en_US
dc.subject	attribute detector	en_US
dc.title	Knowledge Integration for Improving Performance in LVCSR	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.journal	14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5	en_US
dc.citation.spage	1785	en_US
dc.citation.epage	1789	en_US
dc.contributor.department	電機工程學系	zh_TW
dc.contributor.department	Department of Electrical and Computer Engineering	en_US
dc.identifier.wosnumber	WOS:000395050000373	en_US
顯示於類別：	會議論文