完整後設資料紀錄
DC 欄位語言
dc.contributor.authorChen, Sin-Horngen_US
dc.contributor.authorYang, Jyh-Heren_US
dc.contributor.authorChiang, Chen-Yuen_US
dc.contributor.authorLiu, Ming-Chiehen_US
dc.contributor.authorWang, Yih-Ruen_US
dc.date.accessioned2014-12-08T15:22:32Z-
dc.date.available2014-12-08T15:22:32Z-
dc.date.issued2012-08-01en_US
dc.identifier.issn1558-7916en_US
dc.identifier.urihttp://hdl.handle.net/11536/15935-
dc.description.abstractThis paper presents a new prosody-assisted automatic speech recognition (ASR) system for Mandarin speech. It differs from the conventional approach of using simple prosodic cues on employing a sophisticated prosody modeling approach based on a four-layer prosody-hierarchy structure to automatically generate 12 prosodic models from a large unlabeled speech database by the joint prosody labeling and modeling (PLM) algorithm proposed previously. By incorporating these 12 prosodic models into a two-stage ASR system to rescore the word lattice generated in the first stage by the conventional hidden Markov model (HMM) recognizer, we can obtain a better recognized word string. Besides, some other information can also be decoded, including part of speech (POS), punctuation mark (PM), and two types of prosodic tags which can be used to construct the prosody-hierarchy structure of the testing speech. Experimental results on the TCC300 database, which consists of long paragraphic utterances, showed that the proposed system significantly outperformed the baseline scheme using an HMM recognizer with a factored language model which models word, POS, and PM. Performances of 20.7%, 14.4%, and 9.6% in word, character, and base-syllable error rates were obtained. They corresponded to 3.7%, 3.7%, and 2.4% absolute (or 15.2%, 20.4%, and 20% relative) error reductions. By an error analysis, we found that many word segmentation errors and tone recognition errors were corrected.en_US
dc.language.isoen_USen_US
dc.subjectProsody modelingen_US
dc.subjectprosody-assisted automatic speech recognition (ASR)en_US
dc.subjectprosody-hierarchy structureen_US
dc.titleA New Prosody-Assisted Mandarin ASR Systemen_US
dc.typeArticleen_US
dc.identifier.journalIEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSINGen_US
dc.citation.volume20en_US
dc.citation.issue6en_US
dc.citation.epage1669en_US
dc.contributor.department電機工程學系zh_TW
dc.contributor.departmentDepartment of Electrical and Computer Engineeringen_US
dc.identifier.wosnumberWOS:000302532000001-
dc.citation.woscount2-
顯示於類別:期刊論文


文件中的檔案:

  1. 000302532000001.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。