貝氏非負矩陣分解於單通道音訊分離之研究

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	楊博凱	en_US
dc.contributor.author	Yang, Po-Kai	en_US
dc.contributor.author	簡仁宗	en_US
dc.contributor.author	Chien, Jen-Tzung	en_US
dc.date.accessioned	2014-12-12T02:42:26Z	-
dc.date.available	2014-12-12T02:42:26Z	-
dc.date.issued	2013	en_US
dc.identifier.uri	http://140.113.39.130/cdrfb3/record/nctu/#GT070160265	en_US
dc.identifier.uri	http://hdl.handle.net/11536/75116	-
dc.description.abstract	本論文提出貝氏(Bayesian)非負矩陣分解(Nonnegative Matrix Factorization)演算法並實現於單通道(Single-Channel)語音與音樂訊號分離以及歌聲(Singing Voice)與背景伴奏(Accompaniment)訊號分離。我們使用卜瓦松(Poisson)機率分佈作為相似度(Likelihood)函數，並且以指數(Exponential)機率分佈表示基底矩陣與權重矩陣之事前(Prior)資訊來建立起非負矩陣分解模型，透過變異性貝氏(Variational Bayesian) 期望最大化(Expectation-Maximization, EM)演算法有效地找出貝氏非負矩陣分解模型中變異性參數與模型參數的封閉解。此模型的特色在於指數事前機率分佈的參數可以用來控制基底矩陣的稀疏程度。在變異性貝氏推論過程中，變異性下界(Lower Bound)是以邊際化相似度函數(Marginal Likelihood)為評比基準來處理模型正規化(Model Regularization)的議題，並針對不同的混合訊號自動找出合適的基底個數。本論文評估兩套單通道音訊分離實驗效能，監督式(Supervised)語音與音樂分離以及非監督式(Unsupervised)歌聲與背景伴奏分離。在監督式學習實驗裡，我們驗證了透過貝氏非負矩陣分解自動選取基底個數比傳統非負矩陣分解固定基底個數有較佳的分離效果。另外，非監督式學習必須額外將分解出的基底向量作分群，我們提出的非負矩陣分解分群法及貝氏訊號分離，已證實優於現今文獻中所有其他方法的分離效果。	zh_TW
dc.description.abstract	This paper proposes a new Bayesian nonnegative matrix factorization (NMF) approach for speech and music separation as well as for singing voice separation from background music accompaniment. Using this approach, the reconstruction error based on NMF is represented by a Poisson distribution and the NMF parameters, consisting of basis and weight matrices, are characterized by the exponential priors. A variational Bayesian (VB) expectation-maximization (EM) algorithm is developed to implement an efficient closed-form solution to variational parameters and model parameters for monaural audio source separation. Importantly, the exponential prior parameter is used to control the sparseness in basis representation. The variational lower bound in VB-EM procedure is derived as an objective to conduct adaptive basis selection for different mixed signals with variations from different speakers, singers, instruments and background accompaniments. Model regularization is tackled through the uncertainty modeling via variational inference based on the maximization of marginal likelihood. The experiments on supervised single-channel speech/music separation show that the adaptive basis representation in Bayesian NMF performs better than the NMF with the fixed number of bases in terms of signal-to-distortion ratio. In addition, we implement the proposed Bayesian NMF for unsupervised monaural singing voice separation where an additional grouping of the factorized basis vectors is performed. The two groups of basis vectors are obtained to reconstruct the source signals of singing voice and background accompaniment. The experimental results on MIR-1K database demonstrate that the Bayesian NMF performs better than other unsupervised separation algorithms in terms of the global normalized source to distortion ratio.	en_US
dc.language.iso	en_US	en_US
dc.subject	非負矩陣分解	zh_TW
dc.subject	模型選擇	zh_TW
dc.subject	貝氏學習	zh_TW
dc.subject	單通道訊號分離	zh_TW
dc.subject	歌聲分離	zh_TW
dc.subject	nonnegative matrix factorization	en_US
dc.subject	model selection	en_US
dc.subject	Bayesian learning	en_US
dc.subject	monaural source separation	en_US
dc.subject	singing voice separation	en_US
dc.title	貝氏非負矩陣分解於單通道音訊分離之研究	zh_TW
dc.title	Bayesian nonnegative matrix factorization for monaural audio source separation	en_US
dc.type	Thesis	en_US
dc.contributor.department	電信工程研究所	zh_TW
顯示於類別：	畢業論文