標題: Group Sparse Hidden Markov Models for Speech Recognition
作者: Chien, Jen-Tzung
Chiang, Cheng-Chun
電機資訊學士班
Undergraduate Honors Program of Electrical Engineering and Computer Science
關鍵字: Bayesian learning;group sparsity;hidden Markov model;speech recognition
公開日期: 2012
摘要: This paper presents the group sparse hidden Markov models (GS-HMMs) where a sequence of acoustic features is driven by Markov chain and each feature vector is represented by two groups of basis vectors. The group of common bases represents the features across states within a HMM. The group of individual bases compensates the intra-state residual information. Importantly, the sparse prior for sensing weights is controlled by the Laplacian scale mixture (LSM) distribution which is obtained by multiplying Laplacian variable with an inverse Gamma variable. The scale mixture parameter in LSM makes the distribution even sparser. This parameter serves as an automatic relevance determination for selecting the relevant bases from two groups. The weights and two sets of bases in GS-HMMs are estimated via Bayesian learning. We apply this framework for acoustic modeling and show the robustness of GS-HMMs for speech recognition in presence of different noises types and SNRs.
URI: http://hdl.handle.net/11536/22056
ISBN: 978-1-62276-759-5
期刊: 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3
起始頁: 2645
結束頁: 2648
顯示於類別:會議論文