標題: Knowledge Integration for Improving Performance in LVCSR
作者: Chiang, Chen-Yu
Siniscalchi, Sabato Marco
Chen, Sin-Horng
Lee, Chin-Hui
電機工程學系
Department of Electrical and Computer Engineering
關鍵字: LVCSR;knowledge-based system;prosody labeling/modeling;attribute detector
公開日期: 1-一月-2013
摘要: This paper presents a knowledge integration framework to improve performance in large vocabulary continuous speech recognition. Two types of knowledge sources, manner attribute and prosodic structure, are incorporated. For manner of articulation, six attribute detectors trained with an American English corpus (WSJO) are utilized to rescore hypothesized phones in word lattices obtained by a baseline ASR system. For the prosodic structure, models trained with an unsupervised joint prosody labeling and modeling (PLM) technique using WSJO are used in lattice rescoring. Experimental results on the American English WSJ word recognition task of the Nov92 test set show that the proposed approach significantly outperforms the baseline system that does not use articulatory and prosodic information. The results also demonstrate the effectiveness and usefulness of the PLM technique in constructing prosodic models for American English ASR.
URI: http://hdl.handle.net/11536/146414
ISSN: 2308-457X
期刊: 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5
起始頁: 1785
結束頁: 1789
顯示於類別:會議論文