標題: BAYESIAN RECURRENT NEURAL NETWORK LANGUAGE MODEL
作者: Chien, Jen-Tzung
Ku, Yuan-Chu
電機學院
College of Electrical and Computer Engineering
關鍵字: Recurrent neural network;language model;Bayesian learning;Hessian matrix
公開日期: 2014
摘要: This paper presents a Bayesian approach to construct the recurrent neural network language model (RNN-LM) for speech recognition. Our idea is to regularize the RNN-LM by compensating the uncertainty of the estimated model parameters which is represented by a Gaussian prior. The objective function in Bayesian RNN (BRNN) is formed as the regularized cross entropy error function. The regularized model is not only constructed by training the regularized parameters according to the maximum a posteriori criterion but also estimating the Gaussian hyperparameter by maximizing the marginal likelihood. A rapid approximation to Hessian matrix is developed by selecting a small set of salient outer-products and illustrated to be effective for BRNN-LM. BRNN-LM achieves sparser model than RNN-LM. Experiments on different corpora show promising improvement by applying BRNN-LM using different amount of training data.
URI: http://hdl.handle.net/11536/135882
ISBN: 978-1-4799-7129-9
期刊: 2014 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY SLT 2014
起始頁: 206
結束頁: 211
顯示於類別:會議論文