以多解析度聽覺模型嵌入之神經網路模擬聽覺專注現象之語音強化演算法

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	賴貞延	zh_TW
dc.contributor.author	冀泰石	zh_TW
dc.contributor.author	Lai, Chen-Yen	en_US
dc.date.accessioned	2018-01-24T07:39:01Z	-
dc.date.available	2018-01-24T07:39:01Z	-
dc.date.issued	2017	en_US
dc.identifier.uri	http://etd.lib.nctu.edu.tw/cdrfb3/record/nctu/#GT070450723	en_US
dc.identifier.uri	http://hdl.handle.net/11536/140223	-
dc.description.abstract	於本論文中，我們根據神經生物學研究發現的專注聽覺現象和生物聽覺實驗發現的大腦聽覺皮質上神經作用的模式，結合現今正當紅的類神經網路學習，發想出一種獨特的類神經網路模型，並針對語音增強這個議題做討論，期望能藉由神經生理學的知識，有效的解決工程上的問題。而我們所設計的這個類神經網路模型，是以基本的卷積神經網路模型作為基底，再作微調整，特別的是，我們嵌入了由 NSL 提出的聽覺模型，把其用於模擬大腦皮質 A1 區，設計可同時解析時頻域資訊的濾波器，放置於卷積神經網路的卷積層當成初始值；之後模型經過訓練，根據設定目標的需要，會自動微調整其中參數，使輸入資料映射至目標的型態，而在我們的語音增強議題上，目標即是乾淨的語音參數。訓練完後的模型，之前嵌入卷積層的濾波器初始值也會被調整至可映射到乾淨語音參數的形式，即自動噪音消除，而這個模型參數微調整的動作，我們認為非常相似於神經生物學上的專注聽覺反應，即當有特定目的要達成時，大腦皮質產生的濾波器與在安靜環境中使用濾波器並不相同。我們設計了幾種不同的比較模型，並且也與傳統的神經網路模型進行比較，進而發現在訓練資料相當不足的情況下，我們所設計的模型表現都優於其他種模型，即可以快速地達到收斂的狀態。	zh_TW
dc.description.abstract	In this thesis, we propose a neural network to emulate auditory attention on speech enhancement. The proposed system integrates a spectro-temporal analytical auditory model with a multi-layer fully-connected network to form a quasi-CNN structure. The initial kernels of the convolutional layer are derived from the neuro-physiological auditory model. To simulate the plasticity of cortical neurons, the kernels are allowed to adjust themselves pertaining to the task at hand. For the application of speech enhancement, the Fourier spectrogram instead of the auditory spectrogram is used as input to the proposed system such that the speech signal can be well reconstructed. The proposed system performs comparably with standard DNN and CNN systems when plenty resources are available. On the other hand, under the limited-resource condition, the proposed system outperforms standard systems in all test settings.	en_US
dc.language.iso	en_US	en_US
dc.subject	語音增強	zh_TW
dc.subject	聽覺模型	zh_TW
dc.subject	專注聽覺現象	zh_TW
dc.subject	speech enhancement	en_US
dc.subject	auditory model	en_US
dc.subject	attentional hearing	en_US
dc.title	以多解析度聽覺模型嵌入之神經網路模擬聽覺專注現象之語音強化演算法	zh_TW
dc.title	Multi-resolution auditory model embedded neural network for attentional hearing on speech enhancement	en_US
dc.type	Thesis	en_US
dc.contributor.department	電機工程學系	zh_TW
顯示於類別：	畢業論文