| 標題: | Spectro-temporal Modulation Based Singing Detection Combined with Pitch based Grouping for Singing Voice Separation |
| 作者: | Lin, Tse-En Hsu, Chung-Chien Chen, Yi-Cheng Chen, Jian-Hueng Chi, Tai-Shih 電機工程學系 Department of Electrical and Computer Engineering |
| 關鍵字: | singing voice detection;singing voice separation;spectro-temporal modulation;pitch tracking |
| 公開日期: | 1-Jan-2013 |
| 摘要: | A spectro-temporal modulation based singing voice detection cascaded with a Viterbi based pitch tracking algorithm is proposed in this paper for singing-voice separation from monaural recordings. To detect the singing voice, the spectrotemporal modulation energy related to voice harmonics is extracted using a spectro-temporal modulation analysis framework developed for the Fourier spectrogram. Separation of singing -voice from background music is conducted using a binary mask to group estimated harmonics of singing voice. The proposed system is evaluated using MIR-1K dataset and is shown outperforming three other binary-mask based systems in the vocal/music separation task. |
| URI: | http://hdl.handle.net/11536/146416 |
| ISSN: | 2308-457X |
| 期刊: | 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 |
| 起始頁: | 2919 |
| 結束頁: | 2922 |
| Appears in Collections: | Conferences Paper |

