標題: Robust Voice Activity Detection Algorithm Based on Feature of Frequency Modulation of Harmonics and Its DSP Implementation
作者: Hsu, Chung-Chien
Cheong, Kah-Meng
Chi, Tai-Shih
Tsao, Yu
電機工程學系
Department of Electrical and Computer Engineering
關鍵字: digital signal processor;frequency modulation;spectrotemporal analysis;voice activity detection
公開日期: 1-十月-2015
摘要: This paper proposes a voice activity detection (VAD) algorithm based on an energy related feature of the frequency modulation of harmonics. A multi-resolution spectro-temporal analysis framework, which was developed to extract texture features of the audio signal from its Fourier spectrogram, is used to extract frequency modulation features of the speech signal. The proposed algorithm labels the voice active segments of the speech signal by comparing the energy related feature of the frequency modulation of harmonics with a threshold. Then, the proposed VAD is implemented on one of Texas Instruments (TI) digital signal processor (DSP) platforms for real-time operation. Simulations conducted on the DSP platform demonstrate the proposed VAD performs significantly better than three standard VADs, ITU-T G.729B, ETSI AMR1 and AMR2, in non-stationary noise in terms of the receiver operating characteristic (ROC) curves and the recognition rates from a practical distributed speech recognition (DSR) system.
URI: http://dx.doi.org/10.1587/transinf.2015EDP7138
http://hdl.handle.net/11536/129426
ISSN: 1745-1361
DOI: 10.1587/transinf.2015EDP7138
期刊: IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS
Volume: E98D
Issue: 10
起始頁: 1808
結束頁: 1817
顯示於類別:期刊論文


文件中的檔案:

  1. a68cbaa0802dfebfa4e94a77e3339be5.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。