標題: Laplace Group Sensing for Acoustic Models
作者: Chien, Jen-Tzung
電機資訊學士班
Undergraduate Honors Program of Electrical Engineering and Computer Science
關鍵字: Acoustic model;basis representation;group sparsity;Laplace distribution;speech recognition
公開日期: 1-五月-2015
摘要: This paper presents the group sparse learning for acoustic models where a sequence of acoustic features is driven by Markov chain and each feature vector is represented by groups of basis vectors. The group of common bases represents the features across Markov states within a regression class. The group of individual basis compensates the intra-state residual information. Laplace distribution is used as the sparse prior of sensing weights for group basis representation. Laplace parameter serves as regularization parameter or automatic relevance determination which controls the selection of relevant bases for acoustic modeling. The groups of regularization parameters and basis vectors are estimated from training data by maximizing the marginal likelihood over sensing weights which is implemented by Laplace approximation using the Hessian matrix and the maximum a posteriori parameters. Model uncertainty is compensated through full Bayesian treatment. The connection of Laplace group sensing to lasso regularization is illustrated. Experiments on noisy speech recognition show the robustness of group sparse acoustic models in presence of different noise types and SNRs.
URI: http://dx.doi.org/10.1109/TASLP.2015.2412466
http://hdl.handle.net/11536/124443
ISSN: 2329-9290
DOI: 10.1109/TASLP.2015.2412466
期刊: IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Volume: 23
起始頁: 909
結束頁: 922
顯示於類別:期刊論文