標題: Location Classification of Nonstationary Sound Sources Using Binaural Room Distribution Patterns
作者: Hu, Jwu-Sheng
Liu, Wei-Han
電控工程研究所
Institute of Electrical and Control Engineering
關鍵字: Head-related transfer function (HRTF);interaural level difference (ILD);interaural phase difference (IPD);sound source localization
公開日期: 1-五月-2009
摘要: This paper discusses the relationships between the nonstationarity of sound sources and the distribution patterns of interaural phase differences (IPDs) and interaural level differences (ILDs) based on short-term frequency analysis. The amplitude variation of nonstationary sound sources is modeled by the exponent of polynomials from the concept of moving pole model. According to the model, the sufficient condition for utilizing the distribution patterns of IPDs and ILDs to localize a nonstationary sound source is suggested and the phenomena of multiple peaks in the distribution pattern can be explained. Simulation is performed to interpret the relation between the distribution patterns of IPD and ILD and the nonstationary sound source. Furthermore, a Gaussian-mixture binaural room distribution model (GMBRDM) is proposed to model distribution patterns of IPDs and ILDs for nonstationary sound source location classification. The effectiveness and performance of the proposed GMBRDM are demonstrated by experimental results.
URI: http://dx.doi.org/10.1109/TASL.2008.2011528
http://hdl.handle.net/11536/7331
ISSN: 1558-7916
DOI: 10.1109/TASL.2008.2011528
期刊: IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Volume: 17
Issue: 4
起始頁: 682
結束頁: 692
顯示於類別:期刊論文


文件中的檔案:

  1. 000274223300013.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。