標題: | Bayesian Factorization and Learning for Monaural Source Separation |
作者: | Chien, Jen-Tzung Yang, Po-Kai 電機資訊學士班 Undergraduate Honors Program of Electrical Engineering and Computer Science |
關鍵字: | Bayesian learning;model complexity;monaural source separation;nonnegative matrix factorization |
公開日期: | 1-一月-2016 |
摘要: | This paper presents a new Bayesian nonnegative matrix factorization (NMF) for monaural source separation. Using this approach, the reconstruction error based on NMF is represented by a Poisson distribution, and the NMF parameters, consisting of the basis and weight matrices, are characterized by the exponential priors. A variational Bayesian inference procedure is developed to learn variational parameters and model parameters. The randomness in separation process is faithfully represented so that the system robustness to model variations in heterogeneous environments could be achieved. Importantly, the exponential prior parameters are used to impose sparseness in basis representation. The variational lower bound of log marginal likelihood is adopted as the objective to control model complexity. The dependencies of variational objective on model parameters are fully characterized in the derived closed-form solution. A clustering algorithm is performed to find the groups of bases for unsupervised source separation. The experiments on speech/music separation and singing voice separation show that the proposed Bayesian NMF (BNMF) with adaptive basis representation outperforms the NMF with fixed number of bases and the other BNMFs in terms of signal-to-distortion ratio and the global normalized source to distortion ratio. |
URI: | http://dx.doi.org/10.1109/TASLP.2015.2502141 http://hdl.handle.net/11536/129513 |
ISSN: | 2329-9290 |
DOI: | 10.1109/TASLP.2015.2502141 |
期刊: | IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING |
Volume: | 24 |
起始頁: | 185 |
結束頁: | 195 |
顯示於類別: | 期刊論文 |