標題: | Spectro-temporal Modulation Based Singing Detection Combined with Pitch based Grouping for Singing Voice Separation |
作者: | Lin, Tse-En Hsu, Chung-Chien Chen, Yi-Cheng Chen, Jian-Hueng Chi, Tai-Shih 電機工程學系 Department of Electrical and Computer Engineering |
關鍵字: | singing voice detection;singing voice separation;spectro-temporal modulation;pitch tracking |
公開日期: | 1-Jan-2013 |
摘要: | A spectro-temporal modulation based singing voice detection cascaded with a Viterbi based pitch tracking algorithm is proposed in this paper for singing-voice separation from monaural recordings. To detect the singing voice, the spectrotemporal modulation energy related to voice harmonics is extracted using a spectro-temporal modulation analysis framework developed for the Fourier spectrogram. Separation of singing -voice from background music is conducted using a binary mask to group estimated harmonics of singing voice. The proposed system is evaluated using MIR-1K dataset and is shown outperforming three other binary-mask based systems in the vocal/music separation task. |
URI: | http://hdl.handle.net/11536/146416 |
ISSN: | 2308-457X |
期刊: | 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 |
起始頁: | 2919 |
結束頁: | 2922 |
Appears in Collections: | Conferences Paper |