Full metadata record
DC FieldValueLanguage
dc.contributor.authorHu, Jwu-Shengen_US
dc.contributor.authorCheng, Chieh-Chengen_US
dc.contributor.authorLiu, Wei-Hanen_US
dc.date.accessioned2014-12-08T15:16:57Z-
dc.date.available2014-12-08T15:16:57Z-
dc.date.issued2006-04-01en_US
dc.identifier.issn1083-4419en_US
dc.identifier.urihttp://dx.doi.org/10.1109/TSMCB.2005.859084en_US
dc.identifier.urihttp://hdl.handle.net/11536/12420-
dc.description.abstractHuman-computer interaction (HCI) using speech communication is becoming increasingly important, especially in driving where safety is the primary concern. Knowing the speaker's location (i.e., speaker localization) not only improves the enhancement results of a corrupted signal, but also provides assistance to speaker identification. Since conventional speech localization algorithms suffer from the uncertainties of environmental complexity and noise, as well as from the microphone mismatch problem, they are frequently not robust in practice. Without a high reliability, the acceptance of speech-based HCI would never be realized. This work presents a novel speaker's location detection method and demonstrates high accuracy within a vehicle cabinet using a single linear microphone array. The proposed approach utilize Gaussian mixture models (GMM) to model the distributions of the phase differences among the microphones caused by the complex characteristic of room acoustic and microphone mismatch. The model can be applied both in near-field and far-field situations in a noisy environment. The individual Gaussian component of a GMM represents some general location-dependent but content and speaker-independent phase difference distributions. Moreover, the scheme performs well not only in nonline-of-sight cases, but also when the speakers are aligned toward the microphone array but at difference distances from it. This strong performance can be achieved by exploiting the fact that the phase difference distributions at different locations are distinguishable in the environment of a. car. The experimental results also show that the proposed method outperforms the conventional multiple signal classification method (MUSIC) technique at various SNRs.en_US
dc.language.isoen_USen_US
dc.subjectGaussian mixture models (GMM)en_US
dc.subjecthuman-computer interaction (HCI)en_US
dc.subjectmicrophone arrayen_US
dc.subjectsound localizationen_US
dc.titleRobust speaker's location detection in a vehicle environment using GMM modelsen_US
dc.typeArticleen_US
dc.identifier.doi10.1109/TSMCB.2005.859084en_US
dc.identifier.journalIEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICSen_US
dc.citation.volume36en_US
dc.citation.issue2en_US
dc.citation.spage403en_US
dc.citation.epage412en_US
dc.contributor.department電控工程研究所zh_TW
dc.contributor.departmentInstitute of Electrical and Control Engineeringen_US
dc.identifier.wosnumberWOS:000252227000013-
dc.citation.woscount10-
Appears in Collections:Articles


Files in This Item:

  1. 000252227000013.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.