Robust speaker's location detection in a vehicle environment using GMM models

doi:10.1109/TSMCB.2005.859084

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Hu, Jwu-Sheng	en_US
dc.contributor.author	Cheng, Chieh-Cheng	en_US
dc.contributor.author	Liu, Wei-Han	en_US
dc.date.accessioned	2014-12-08T15:16:57Z	-
dc.date.available	2014-12-08T15:16:57Z	-
dc.date.issued	2006-04-01	en_US
dc.identifier.issn	1083-4419	en_US
dc.identifier.uri	http://dx.doi.org/10.1109/TSMCB.2005.859084	en_US
dc.identifier.uri	http://hdl.handle.net/11536/12420	-
dc.description.abstract	Human-computer interaction (HCI) using speech communication is becoming increasingly important, especially in driving where safety is the primary concern. Knowing the speaker's location (i.e., speaker localization) not only improves the enhancement results of a corrupted signal, but also provides assistance to speaker identification. Since conventional speech localization algorithms suffer from the uncertainties of environmental complexity and noise, as well as from the microphone mismatch problem, they are frequently not robust in practice. Without a high reliability, the acceptance of speech-based HCI would never be realized. This work presents a novel speaker's location detection method and demonstrates high accuracy within a vehicle cabinet using a single linear microphone array. The proposed approach utilize Gaussian mixture models (GMM) to model the distributions of the phase differences among the microphones caused by the complex characteristic of room acoustic and microphone mismatch. The model can be applied both in near-field and far-field situations in a noisy environment. The individual Gaussian component of a GMM represents some general location-dependent but content and speaker-independent phase difference distributions. Moreover, the scheme performs well not only in nonline-of-sight cases, but also when the speakers are aligned toward the microphone array but at difference distances from it. This strong performance can be achieved by exploiting the fact that the phase difference distributions at different locations are distinguishable in the environment of a. car. The experimental results also show that the proposed method outperforms the conventional multiple signal classification method (MUSIC) technique at various SNRs.	en_US
dc.language.iso	en_US	en_US
dc.subject	Gaussian mixture models (GMM)	en_US
dc.subject	human-computer interaction (HCI)	en_US
dc.subject	microphone array	en_US
dc.subject	sound localization	en_US
dc.title	Robust speaker's location detection in a vehicle environment using GMM models	en_US
dc.type	Article	en_US
dc.identifier.doi	10.1109/TSMCB.2005.859084	en_US
dc.identifier.journal	IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS	en_US
dc.citation.volume	36	en_US
dc.citation.issue	2	en_US
dc.citation.spage	403	en_US
dc.citation.epage	412	en_US
dc.contributor.department	電控工程研究所	zh_TW
dc.contributor.department	Institute of Electrical and Control Engineering	en_US
dc.identifier.wosnumber	WOS:000252227000013	-
dc.citation.woscount	10	-
顯示於類別：	期刊論文

文件中的檔案：

000252227000013.pdf

若為 zip 檔案，請下載檔案解壓縮後，用瀏覽器開啟資料夾中的 index.html 瀏覽全文。