Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Hsu, Chung-Chien | en_US |
dc.contributor.author | Lin, Tse-En | en_US |
dc.contributor.author | Chen, Jian-Hueng | en_US |
dc.contributor.author | Chi, Tai-Shih | en_US |
dc.date.accessioned | 2014-12-08T15:34:22Z | - |
dc.date.available | 2014-12-08T15:34:22Z | - |
dc.date.issued | 2013 | en_US |
dc.identifier.isbn | 978-1-4799-0356-6 | en_US |
dc.identifier.issn | 1520-6149 | en_US |
dc.identifier.uri | http://hdl.handle.net/11536/23535 | - |
dc.description.abstract | In this paper, we propose a voice activity detection (VAD) algorithm based on spectro-temporal modulation structures of input sounds. A multi-resolution spectro-temporal analysis framework is used to inspect prominent speech structures. By comparing with an adaptive threshold, the proposed VAD distinguishes speech from non-speech based on the energy of the frequency modulation of harmonics. Compared with three standard VADs, ITU-T G. 729B, ETSI AMR1 and AMR2, our proposed VAD significantly outperforms them in non-stationary noises in terms of the receiver operating characteristic (ROC) curves and the recognition rates from a practical distributed speech recognition (DSR) system. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | voice activity detection | en_US |
dc.subject | frequency modulation | en_US |
dc.subject | spectro-temporal analysis | en_US |
dc.title | VOICE ACTIVITY DETECTION BASED ON FREQUENCY MODULATION OF HARMONICS | en_US |
dc.type | Proceedings Paper | en_US |
dc.identifier.journal | 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | en_US |
dc.citation.spage | 6679 | en_US |
dc.citation.epage | 6683 | en_US |
dc.contributor.department | 電機資訊學士班 | zh_TW |
dc.contributor.department | Undergraduate Honors Program of Electrical Engineering and Computer Science | en_US |
dc.identifier.wosnumber | WOS:000329611506169 | - |
Appears in Collections: | Conferences Paper |