标题: Spectro-temporal Modulation Based Singing Detection Combined with Pitch based Grouping for Singing Voice Separation
作者: Lin, Tse-En
Hsu, Chung-Chien
Chen, Yi-Cheng
Chen, Jian-Hueng
Chi, Tai-Shih
电机工程学系
Department of Electrical and Computer Engineering
关键字: singing voice detection;singing voice separation;spectro-temporal modulation;pitch tracking
公开日期: 1-一月-2013
摘要: A spectro-temporal modulation based singing voice detection cascaded with a Viterbi based pitch tracking algorithm is proposed in this paper for singing-voice separation from monaural recordings. To detect the singing voice, the spectrotemporal modulation energy related to voice harmonics is extracted using a spectro-temporal modulation analysis framework developed for the Fourier spectrogram. Separation of singing -voice from background music is conducted using a binary mask to group estimated harmonics of singing voice. The proposed system is evaluated using MIR-1K dataset and is shown outperforming three other binary-mask based systems in the vocal/music separation task.
URI: http://hdl.handle.net/11536/146416
ISSN: 2308-457X
期刊: 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5
起始页: 2919
结束页: 2922
显示于类别:Conferences Paper