標題: VOICE ACTIVITY DETECTION BASED ON FREQUENCY MODULATION OF HARMONICS
作者: Hsu, Chung-Chien
Lin, Tse-En
Chen, Jian-Hueng
Chi, Tai-Shih
電機資訊學士班
Undergraduate Honors Program of Electrical Engineering and Computer Science
關鍵字: voice activity detection;frequency modulation;spectro-temporal analysis
公開日期: 2013
摘要: In this paper, we propose a voice activity detection (VAD) algorithm based on spectro-temporal modulation structures of input sounds. A multi-resolution spectro-temporal analysis framework is used to inspect prominent speech structures. By comparing with an adaptive threshold, the proposed VAD distinguishes speech from non-speech based on the energy of the frequency modulation of harmonics. Compared with three standard VADs, ITU-T G. 729B, ETSI AMR1 and AMR2, our proposed VAD significantly outperforms them in non-stationary noises in terms of the receiver operating characteristic (ROC) curves and the recognition rates from a practical distributed speech recognition (DSR) system.
URI: http://hdl.handle.net/11536/23535
ISBN: 978-1-4799-0356-6
ISSN: 1520-6149
期刊: 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)
起始頁: 6679
結束頁: 6683
顯示於類別:會議論文