| 標題: | VOICE ACTIVITY DETECTION BASED ON FREQUENCY MODULATION OF HARMONICS |
| 作者: | Hsu, Chung-Chien Lin, Tse-En Chen, Jian-Hueng Chi, Tai-Shih 電機資訊學士班 Undergraduate Honors Program of Electrical Engineering and Computer Science |
| 關鍵字: | voice activity detection;frequency modulation;spectro-temporal analysis |
| 公開日期: | 2013 |
| 摘要: | In this paper, we propose a voice activity detection (VAD) algorithm based on spectro-temporal modulation structures of input sounds. A multi-resolution spectro-temporal analysis framework is used to inspect prominent speech structures. By comparing with an adaptive threshold, the proposed VAD distinguishes speech from non-speech based on the energy of the frequency modulation of harmonics. Compared with three standard VADs, ITU-T G. 729B, ETSI AMR1 and AMR2, our proposed VAD significantly outperforms them in non-stationary noises in terms of the receiver operating characteristic (ROC) curves and the recognition rates from a practical distributed speech recognition (DSR) system. |
| URI: | http://hdl.handle.net/11536/23535 |
| ISBN: | 978-1-4799-0356-6 |
| ISSN: | 1520-6149 |
| 期刊: | 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) |
| 起始頁: | 6679 |
| 結束頁: | 6683 |
| Appears in Collections: | Conferences Paper |

