標題: | Location Classification of Nonstationary Sound Sources Using Binaural Room Distribution Patterns |
作者: | Hu, Jwu-Sheng Liu, Wei-Han 電控工程研究所 Institute of Electrical and Control Engineering |
關鍵字: | Head-related transfer function (HRTF);interaural level difference (ILD);interaural phase difference (IPD);sound source localization |
公開日期: | 1-五月-2009 |
摘要: | This paper discusses the relationships between the nonstationarity of sound sources and the distribution patterns of interaural phase differences (IPDs) and interaural level differences (ILDs) based on short-term frequency analysis. The amplitude variation of nonstationary sound sources is modeled by the exponent of polynomials from the concept of moving pole model. According to the model, the sufficient condition for utilizing the distribution patterns of IPDs and ILDs to localize a nonstationary sound source is suggested and the phenomena of multiple peaks in the distribution pattern can be explained. Simulation is performed to interpret the relation between the distribution patterns of IPD and ILD and the nonstationary sound source. Furthermore, a Gaussian-mixture binaural room distribution model (GMBRDM) is proposed to model distribution patterns of IPDs and ILDs for nonstationary sound source location classification. The effectiveness and performance of the proposed GMBRDM are demonstrated by experimental results. |
URI: | http://dx.doi.org/10.1109/TASL.2008.2011528 http://hdl.handle.net/11536/7331 |
ISSN: | 1558-7916 |
DOI: | 10.1109/TASL.2008.2011528 |
期刊: | IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING |
Volume: | 17 |
Issue: | 4 |
起始頁: | 682 |
結束頁: | 692 |
顯示於類別: | 期刊論文 |