Title: | VOICE ACTIVITY DETECTION BASED ON FREQUENCY MODULATION OF HARMONICS |
Authors: | Hsu, Chung-Chien Lin, Tse-En Chen, Jian-Hueng Chi, Tai-Shih 電機資訊學士班 Undergraduate Honors Program of Electrical Engineering and Computer Science |
Keywords: | voice activity detection;frequency modulation;spectro-temporal analysis |
Issue Date: | 2013 |
Abstract: | In this paper, we propose a voice activity detection (VAD) algorithm based on spectro-temporal modulation structures of input sounds. A multi-resolution spectro-temporal analysis framework is used to inspect prominent speech structures. By comparing with an adaptive threshold, the proposed VAD distinguishes speech from non-speech based on the energy of the frequency modulation of harmonics. Compared with three standard VADs, ITU-T G. 729B, ETSI AMR1 and AMR2, our proposed VAD significantly outperforms them in non-stationary noises in terms of the receiver operating characteristic (ROC) curves and the recognition rates from a practical distributed speech recognition (DSR) system. |
URI: | http://hdl.handle.net/11536/23535 |
ISBN: | 978-1-4799-0356-6 |
ISSN: | 1520-6149 |
Journal: | 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) |
Begin Page: | 6679 |
End Page: | 6683 |
Appears in Collections: | Conferences Paper |