CNN BASED TWO-STAGE MULTI-RESOLUTION END-TO-END MODEL FOR SINGING MELODY EXTRACTION

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Chen, Ming-Tso	en_US
dc.contributor.author	Li, Bo-Jun	en_US
dc.contributor.author	Chi, Tai-Shih	en_US
dc.date.accessioned	2019-10-05T00:09:44Z	-
dc.date.available	2019-10-05T00:09:44Z	-
dc.date.issued	2019-01-01	en_US
dc.identifier.isbn	978-1-4799-8131-1	en_US
dc.identifier.issn	1520-6149	en_US
dc.identifier.uri	http://hdl.handle.net/11536/152923	-
dc.description.abstract	Inspired by human hearing perception, we propose a two-stage multi-resolution end-to-end model for singing melody extraction in this paper. The convolutional neural network (CNN) is the core of the proposed model to generate multi-resolution representations. The 1-D and 2-D multi-resolution analysis on waveform and spectrogram-like graph are successively carried out by using 1-D and 2-D CNN kernels of different lengths and sizes. The 1-D CNNs with kernels of different lengths produce multi-resolution spectrogram-like graphs without suffering from the trade-off between spectral and temporal resolutions. The 2-D CNNs with kernels of different sizes extract features from spectro-temporal envelopes of different scales. Experiment results show the proposed model outperforms three compared systems in three out of five public databases.	en_US
dc.language.iso	en_US	en_US
dc.subject	Melody extraction	en_US
dc.subject	multi-resolution	en_US
dc.subject	convolution neural network	en_US
dc.subject	end-to-end learning	en_US
dc.subject	music information retrieval	en_US
dc.title	CNN BASED TWO-STAGE MULTI-RESOLUTION END-TO-END MODEL FOR SINGING MELODY EXTRACTION	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.journal	2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)	en_US
dc.citation.spage	1005	en_US
dc.citation.epage	1009	en_US
dc.contributor.department	電機工程學系	zh_TW
dc.contributor.department	Department of Electrical and Computer Engineering	en_US
dc.identifier.wosnumber	WOS:000482554001047	en_US
dc.citation.woscount	0	en_US
顯示於類別：	會議論文