標題: | A HYBRID NEURAL NETWORK BASED ON THE DUPLEX MODEL OF PITCH PERCEPTION FOR SINGING MELODY EXTRACTION |
作者: | Chou, Hsin Chen, Ming-Tso Chi, Tai-Shih 電機工程學系 Department of Electrical and Computer Engineering |
關鍵字: | pitch perception;duplex model;melody extraction;deep neural network;CNN |
公開日期: | 1-一月-2018 |
摘要: | In this paper, we build up a hybrid neural network (NN) for singing melody extraction from polyphonic music by imitating human pitch perception. For human hearing, there are two pitch perception models, the spectral model and the temporal model, in accordance with whether harmonics are resolved or not. Here, we first use NNs to implement individual models and evaluate their performance in the task of singing melody extraction. Then, we combine the NNs to constitute the composite NN to simulate the duplex model, which complements the pitch perception from unresolved harmonics of the spectral model using the temporal model. Simulation results show the proposed composite NN outperforms other conventional methods in singing melody extraction. |
URI: | http://hdl.handle.net/11536/150759 |
期刊: | 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) |
起始頁: | 381 |
結束頁: | 385 |
顯示於類別: | 會議論文 |