標題: Bayesian Factorization and Learning for Monaural Source Separation
作者: Chien, Jen-Tzung
Yang, Po-Kai
電機資訊學士班
Undergraduate Honors Program of Electrical Engineering and Computer Science
關鍵字: Bayesian learning;model complexity;monaural source separation;nonnegative matrix factorization
公開日期: 1-一月-2016
摘要: This paper presents a new Bayesian nonnegative matrix factorization (NMF) for monaural source separation. Using this approach, the reconstruction error based on NMF is represented by a Poisson distribution, and the NMF parameters, consisting of the basis and weight matrices, are characterized by the exponential priors. A variational Bayesian inference procedure is developed to learn variational parameters and model parameters. The randomness in separation process is faithfully represented so that the system robustness to model variations in heterogeneous environments could be achieved. Importantly, the exponential prior parameters are used to impose sparseness in basis representation. The variational lower bound of log marginal likelihood is adopted as the objective to control model complexity. The dependencies of variational objective on model parameters are fully characterized in the derived closed-form solution. A clustering algorithm is performed to find the groups of bases for unsupervised source separation. The experiments on speech/music separation and singing voice separation show that the proposed Bayesian NMF (BNMF) with adaptive basis representation outperforms the NMF with fixed number of bases and the other BNMFs in terms of signal-to-distortion ratio and the global normalized source to distortion ratio.
URI: http://dx.doi.org/10.1109/TASLP.2015.2502141
http://hdl.handle.net/11536/129513
ISSN: 2329-9290
DOI: 10.1109/TASLP.2015.2502141
期刊: IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Volume: 24
起始頁: 185
結束頁: 195
顯示於類別:期刊論文