標題: Traditional Chinese Parser and Language Modeling for Mandadin ASR
作者: Lin, Ang-Hsing
Wang, Yih-Ru
Chen, Sin-Horng
傳播研究所
Institute of Communication Studies
關鍵字: Chinese word segmentation;Conditional random field;Language model;weighted finite state transducer;automatic speech recognition
公開日期: 1-一月-2013
摘要: A new approach of traditional Chinese parser to improving the language modeling of Mandarin speech recognition is proposed in this paper. The parser first uses a preprocessing to correct some word segmentation inconsistencies of the text corpus. It then employs a CRF-based word segmentation method and a CRF-based POS tagger to resegment the texts so as to generate better word strings for training an n-gram language model (LM) for ASR. Experimental results on the TCC-300 corpus showed that a word error rate (WER) of 13.4% was achieved by the proposed method. It is about 45% improvement on the relative WER reduction as compared with the previous system.
URI: http://hdl.handle.net/11536/125033
ISSN: 
期刊: 2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE)
顯示於類別:會議論文