A modular RNN-based method for continuous Mandarin speech recognition

doi:10.1109/89.905999

Full metadata record

DC Field	Value	Language
dc.contributor.author	Liao, YF	en_US
dc.contributor.author	Chen, SH	en_US
dc.date.accessioned	2014-12-08T15:44:06Z	-
dc.date.available	2014-12-08T15:44:06Z	-
dc.date.issued	2001-03-01	en_US
dc.identifier.issn	1063-6676	en_US
dc.identifier.uri	http://dx.doi.org/10.1109/89.905999	en_US
dc.identifier.uri	http://hdl.handle.net/11536/29782	-
dc.description.abstract	A new modular recurrent neural network (MRNN)-based method for continuous Mandarin speech recognition (CMSR) is proposed. The MRNN recognizer is composed of four main modules. The first is a sub-MRNN module whose function is to generate discriminant functions for all 412 base-syllables. It accomplishes the task by using four recurrent neural network (RNN) submodules. The second is an RNN module which is designed to detect syllable boundaries for providing timing cues in order to help solve the time-alignment problem. The third is also an RNN module whose function is to generate discriminant functions for 143 intersyllable diphone-like units to compensate the intersyllable coarticulation effect. The fourth is a dynamic programming (DP)-based recognition search module. Its function is to integrate the other three modules and solve the time-alignment problem for generating the recognized base-syllable sequence. A new multilevel pruning scheme designed to speed up the recognition process is also proposed. The whole MRNN can be trained by a sophisticated three-stage minimum classification error/generalized probabilistic descent (MCE/GPD) algorithm. Experimental results showed that the proposed method performed better than the maximum likelihood (ML)-trained hidden Markov model (HMM) method and is comparable to the MCE/GPD-trained HMM method. The multilevel pruning scheme was also found to be very efficient.	en_US
dc.language.iso	en_US	en_US
dc.subject	Mandarin speech recognition	en_US
dc.subject	MCE/GPD algorithms	en_US
dc.subject	modular recurrent neural networks	en_US
dc.title	A modular RNN-based method for continuous Mandarin speech recognition	en_US
dc.type	Article	en_US
dc.identifier.doi	10.1109/89.905999	en_US
dc.identifier.journal	IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING	en_US
dc.citation.volume	9	en_US
dc.citation.issue	3	en_US
dc.citation.spage	252	en_US
dc.citation.epage	263	en_US
dc.contributor.department	電信工程研究所	zh_TW
dc.contributor.department	Institute of Communications Engineering	en_US
dc.identifier.wosnumber	WOS:000167288600007	-
dc.citation.woscount	3	-
Appears in Collections:	Articles

Files in This Item:

000167288600007.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.