标题: RECALL NEURAL NETWORK FOR SOURCE SEPARATION
作者: Chien, Jen-Tzung
Tsou, Kai-Wei
电机工程学系
Department of Electrical and Computer Engineering
关键字: long short-term memory;sequence-to-sequence learning;recall neural network;source separation
公开日期: 1-一月-2018
摘要: This paper presents a novel memory-augmented neural network for single-channel source separation. We propose a recall neural network (RCNN) where a couple of external memories are realized for sequence-to-sequence learning based on an encoder and a decoder. These memories are learned in a two-pass sensing procedure where the mixed signal is encoded and then decoded (or recalled) as context vectors by using a bidirectional long short-term memory (LSTM) and a LSTM, respectively. These context vectors are integrated in a gating layer. A set of attention weights are calculated to attend the hidden state of decoder to implement a recurrent neural network for source separation. A gated attention mechanism is carried out to fulfill a specialized memory network. The regression errors due to two passes of sensing procedure and one pass of gated attention are jointly minimized to estimate the weight parameters of different components in different layers. The experiments on multi-speaker speech enhancement show that the proposed RCNN consistently outperforms LSTM and neural Turing machine in different settings.
URI: http://hdl.handle.net/11536/150763
期刊: 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)
起始页: 2956
结束页: 2960
显示于类别:Conferences Paper