標題: | Modular recurrent neural networks for Mandarin syllable recognition |
作者: | Chen, SH Liao, YF 電信工程研究所 Institute of Communications Engineering |
關鍵字: | Mandarin speech recognition;MCE/GPD algorithms;modular recurrent neural networks |
公開日期: | 1-十一月-1998 |
摘要: | A new modular recurrent neural network (MRNN)-based speech-recognition method that can recognize the entire vocabulary of 1280 highly confusable Mandarin syllables is proposed in this paper, The basic idea is to first split the complicated task, in both feature and temporal domains, into several much simpler subtasks involving subsyllable and tone discrimination, and then to use two weighting RNN's to generate several dynamic weighting functions to integrate the subsolutions into a complete solution. The novelty of the proposed method lies mainly in the use of appropriate a priori linguistic knowledge of simple initial-final structures of Mandarin syllables in the architecture design of the MRNN, The resulting MRNN is therefore effective and efficient in discriminating among highly confusable Mandarin syllables. Thus both the time-alignment and scaling problems of the ANN-based approach for large-vocabulary speech-recognition can be addressed. Experimental results show that the proposed method and its extensions, the reverse-time MRNN (Rev-MRNN) and bidirection MRNN (Bi-MRNN), all outperform an advanced HMM method trained with the MCE/GPD algorithm in both recognition-rate and system complexity. |
URI: | http://dx.doi.org/10.1109/72.728393 http://hdl.handle.net/11536/31766 |
ISSN: | 1045-9227 |
DOI: | 10.1109/72.728393 |
期刊: | IEEE TRANSACTIONS ON NEURAL NETWORKS |
Volume: | 9 |
Issue: | 6 |
起始頁: | 1430 |
結束頁: | 1441 |
顯示於類別: | 期刊論文 |