標題: A frequency bin-wise nonlinear masking algorithm in convolutive mixtures for speech segregation
作者: Chi, Tai-Shih
Huang, Ching-Wen
Chou, Wen-Sheng
電機工程學系
Department of Electrical and Computer Engineering
公開日期: 1-五月-2012
摘要: A frequency bin-wise nonlinear masking algorithm is proposed in the spectrogram domain for speech segregation in convolutive mixtures. The contributive weight from each speech source to a time-frequency unit of the mixture spectrogram is estimated by a nonlinear function based on location cues. For each sound source, a non-binary mask is formed from the estimated weights and is multiplied to the mixture spectrogram to extract the sound. Head-related transfer functions (HRTFs) are used to simulate convolutive sound mixtures perceived by listeners. Simulation results show our proposed method outperforms convolutive independent component analysis and degenerate unmixing and estimation technique methods in almost all test conditions. (C) 2012 Acoustical Society of America
URI: http://hdl.handle.net/11536/16328
ISSN: 0001-4966
期刊: JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA
Volume: 131
Issue: 5
結束頁: EL361
顯示於類別:期刊論文


文件中的檔案:

  1. 000303601600002.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。