標題: | VOICE ACTIVITY DETECTION BASED ON FREQUENCY MODULATION OF HARMONICS |
作者: | Hsu, Chung-Chien Lin, Tse-En Chen, Jian-Hueng Chi, Tai-Shih 電機資訊學士班 Undergraduate Honors Program of Electrical Engineering and Computer Science |
關鍵字: | voice activity detection;frequency modulation;spectro-temporal analysis |
公開日期: | 2013 |
摘要: | In this paper, we propose a voice activity detection (VAD) algorithm based on spectro-temporal modulation structures of input sounds. A multi-resolution spectro-temporal analysis framework is used to inspect prominent speech structures. By comparing with an adaptive threshold, the proposed VAD distinguishes speech from non-speech based on the energy of the frequency modulation of harmonics. Compared with three standard VADs, ITU-T G. 729B, ETSI AMR1 and AMR2, our proposed VAD significantly outperforms them in non-stationary noises in terms of the receiver operating characteristic (ROC) curves and the recognition rates from a practical distributed speech recognition (DSR) system. |
URI: | http://hdl.handle.net/11536/23535 |
ISBN: | 978-1-4799-0356-6 |
ISSN: | 1520-6149 |
期刊: | 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) |
起始頁: | 6679 |
結束頁: | 6683 |
顯示於類別: | 會議論文 |