Convolutional Neural Turing Machine for Speech Separation

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Chien, Jen-Tzung	en_US
dc.contributor.author	Tsou, Kai-Wei	en_US
dc.date.accessioned	2019-08-02T02:24:19Z	-
dc.date.available	2019-08-02T02:24:19Z	-
dc.date.issued	2018-01-01	en_US
dc.identifier.isbn	978-1-5386-5627-3	en_US
dc.identifier.uri	http://hdl.handle.net/11536/152459	-
dc.description.abstract	Long short-term memory (LSTM) has been successfully developed for monaural speech separation. Temporal information is learned by using dynamic states which are evolved through time and stored as an internal memory. The spectro-temporal data matrix of mixed signal is flattened as input vectors. There are twofold limitations. First, the internal memory in LSTM could not sufficiently characterize long-term information from different sources. Second, the temporal correlation and frequency neighboring in the flattened vectors were smeared. To deal with these limitations, this paper presents a convolutional neural Turing machine (ConvNTM) where the feature maps of spectro-temporal data are extracted and embedded in an external memory at each time step. ConvNTM aims to preserve the spectro-temporal structure in long sequential signals which is exploited to estimate the separated spectral signals. An addressing mechanism is introduced to continuously calculate the read and write heads to retrieve and update memory slots, respectively. The memory augmented source separation is implemented for single-channel speech enhancement. Experimental results illustrate the superiority of ConvNTM to LSTM, NTM and convolutional LSTM for speech enhancement in terms of short-term objective intelligibility measure.	en_US
dc.language.iso	en_US	en_US
dc.subject	Recurrent neural network	en_US
dc.subject	convolutional neural network	en_US
dc.subject	neural Turing machine	en_US
dc.subject	monaural speech separation	en_US
dc.title	Convolutional Neural Turing Machine for Speech Separation	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.journal	2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP)	en_US
dc.citation.spage	81	en_US
dc.citation.epage	85	en_US
dc.contributor.department	電機工程學系	zh_TW
dc.contributor.department	Department of Electrical and Computer Engineering	en_US
dc.identifier.wosnumber	WOS:000469313700017	en_US
dc.citation.woscount	0	en_US
顯示於類別：	會議論文