标题: | DISCRIMINATIVE DEEP RECURRENT NEURAL NETWORKS FOR MONAURAL SPEECH SEPARATION |
作者: | Wang, Guan-Xiang Hsu, Chung-Chien Chien, Jen-Tzung 电机工程学系 Department of Electrical and Computer Engineering |
关键字: | deep learning;discriminative learning;neural network;monaural speech separation |
公开日期: | 2016 |
摘要: | Deep neural network is now a new trend towards solving different problems in speech processing. In this paper, we propose a discriminative deep recurrent neural network (DRNN) model for monaural speech separation. Our idea is to construct DRNN as a regression model to discover the deep structure and regularity for signal reconstruction from a mixture of two source spectra. To reinforce the discrimination capability between two separated spectra, we estimate DRNN separation parameters by minimizing an integrated objective function which consists of two measurements. One is the within source reconstruction errors due to the individual source spectra while the other conveys the discrimination information which preserves the mutual difference between two source spectra during the supervised training procedure. This discrimination information acts as a kind of regularization so as to maintain between-source separation in monaural source separation. In the experiments, we demonstrate the effectiveness of the proposed method for speech separation compared with the other methods. |
URI: | http://hdl.handle.net/11536/136363 |
ISBN: | 978-1-4799-9988-0 |
ISSN: | 1520-6149 |
期刊: | 2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS |
起始页: | 2544 |
结束页: | 2548 |
显示于类别: | Conferences Paper |