標題: Spectro-temporal Modulation Based Singing Detection Combined with Pitch based Grouping for Singing Voice Separation
作者: Lin, Tse-En
Hsu, Chung-Chien
Chen, Yi-Cheng
Chen, Jian-Hueng
Chi, Tai-Shih
電機工程學系
Department of Electrical and Computer Engineering
關鍵字: singing voice detection;singing voice separation;spectro-temporal modulation;pitch tracking
公開日期: 1-Jan-2013
摘要: A spectro-temporal modulation based singing voice detection cascaded with a Viterbi based pitch tracking algorithm is proposed in this paper for singing-voice separation from monaural recordings. To detect the singing voice, the spectrotemporal modulation energy related to voice harmonics is extracted using a spectro-temporal modulation analysis framework developed for the Fourier spectrogram. Separation of singing -voice from background music is conducted using a binary mask to group estimated harmonics of singing voice. The proposed system is evaluated using MIR-1K dataset and is shown outperforming three other binary-mask based systems in the vocal/music separation task.
URI: http://hdl.handle.net/11536/146416
ISSN: 2308-457X
期刊: 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5
起始頁: 2919
結束頁: 2922
Appears in Collections:Conferences Paper