标题: | Laplace Group Sensing for Acoustic Models |
作者: | Chien, Jen-Tzung 電機資訊學士班 Undergraduate Honors Program of Electrical Engineering and Computer Science |
关键字: | Acoustic model;basis representation;group sparsity;Laplace distribution;speech recognition |
公开日期: | 1-五月-2015 |
摘要: | This paper presents the group sparse learning for acoustic models where a sequence of acoustic features is driven by Markov chain and each feature vector is represented by groups of basis vectors. The group of common bases represents the features across Markov states within a regression class. The group of individual basis compensates the intra-state residual information. Laplace distribution is used as the sparse prior of sensing weights for group basis representation. Laplace parameter serves as regularization parameter or automatic relevance determination which controls the selection of relevant bases for acoustic modeling. The groups of regularization parameters and basis vectors are estimated from training data by maximizing the marginal likelihood over sensing weights which is implemented by Laplace approximation using the Hessian matrix and the maximum a posteriori parameters. Model uncertainty is compensated through full Bayesian treatment. The connection of Laplace group sensing to lasso regularization is illustrated. Experiments on noisy speech recognition show the robustness of group sparse acoustic models in presence of different noise types and SNRs. |
URI: | http://dx.doi.org/10.1109/TASLP.2015.2412466 http://hdl.handle.net/11536/124443 |
ISSN: | 2329-9290 |
DOI: | 10.1109/TASLP.2015.2412466 |
期刊: | IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING |
Volume: | 23 |
起始页: | 909 |
结束页: | 922 |
显示于类别: | Articles |