標題: | Group Sparse Hidden Markov Models for Speech Recognition |
作者: | Chien, Jen-Tzung Chiang, Cheng-Chun 電機資訊學士班 Undergraduate Honors Program of Electrical Engineering and Computer Science |
關鍵字: | Bayesian learning;group sparsity;hidden Markov model;speech recognition |
公開日期: | 2012 |
摘要: | This paper presents the group sparse hidden Markov models (GS-HMMs) where a sequence of acoustic features is driven by Markov chain and each feature vector is represented by two groups of basis vectors. The group of common bases represents the features across states within a HMM. The group of individual bases compensates the intra-state residual information. Importantly, the sparse prior for sensing weights is controlled by the Laplacian scale mixture (LSM) distribution which is obtained by multiplying Laplacian variable with an inverse Gamma variable. The scale mixture parameter in LSM makes the distribution even sparser. This parameter serves as an automatic relevance determination for selecting the relevant bases from two groups. The weights and two sets of bases in GS-HMMs are estimated via Bayesian learning. We apply this framework for acoustic modeling and show the robustness of GS-HMMs for speech recognition in presence of different noises types and SNRs. |
URI: | http://hdl.handle.net/11536/22056 |
ISBN: | 978-1-62276-759-5 |
期刊: | 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 |
起始頁: | 2645 |
結束頁: | 2648 |
顯示於類別: | 會議論文 |