Full metadata record
DC FieldValueLanguage
dc.contributor.authorHsu, Chung-Chienen_US
dc.contributor.authorLin, Tse-Enen_US
dc.contributor.authorChen, Jian-Huengen_US
dc.contributor.authorChi, Tai-Shihen_US
dc.date.accessioned2014-12-08T15:34:22Z-
dc.date.available2014-12-08T15:34:22Z-
dc.date.issued2013en_US
dc.identifier.isbn978-1-4799-0356-6en_US
dc.identifier.issn1520-6149en_US
dc.identifier.urihttp://hdl.handle.net/11536/23535-
dc.description.abstractIn this paper, we propose a voice activity detection (VAD) algorithm based on spectro-temporal modulation structures of input sounds. A multi-resolution spectro-temporal analysis framework is used to inspect prominent speech structures. By comparing with an adaptive threshold, the proposed VAD distinguishes speech from non-speech based on the energy of the frequency modulation of harmonics. Compared with three standard VADs, ITU-T G. 729B, ETSI AMR1 and AMR2, our proposed VAD significantly outperforms them in non-stationary noises in terms of the receiver operating characteristic (ROC) curves and the recognition rates from a practical distributed speech recognition (DSR) system.en_US
dc.language.isoen_USen_US
dc.subjectvoice activity detectionen_US
dc.subjectfrequency modulationen_US
dc.subjectspectro-temporal analysisen_US
dc.titleVOICE ACTIVITY DETECTION BASED ON FREQUENCY MODULATION OF HARMONICSen_US
dc.typeProceedings Paperen_US
dc.identifier.journal2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)en_US
dc.citation.spage6679en_US
dc.citation.epage6683en_US
dc.contributor.department電機資訊學士班zh_TW
dc.contributor.departmentUndergraduate Honors Program of Electrical Engineering and Computer Scienceen_US
dc.identifier.wosnumberWOS:000329611506169-
Appears in Collections:Conferences Paper