標題: Exploring redundancy of HRTFs for fast training DNN-based HRTF personalization
作者: Chen, Tzu-Yu
Hsiao, Po-Wen
Chi, Tai-Shih
電機工程學系
Department of Electrical and Computer Engineering
公開日期: 1-Jan-2018
摘要: A deep neural network (DNN) is constructed to predict the magnitude responses of the head-related transfer functions (HRTFs) of users for a specific direction and a specific ear. Using the CIPIC HRTF database (including 25 azimuth angles and 50 elevation angles for both ears), we trained 2500 DNNs to predict magnitude responses of all HRTFs of a user. To reduce training time, we propose to use the final weights of the trained DNN of a nearby direction as the initial weights of the current DNN under training since magnitude responses of the HRTFs are smoothly changing across nearby directions. Analysis of variance (ANOVA) was performed to show that the proposed training scheme produces equivalent magnitude responses of HRTFs as the standard training scheme with random initial weights in terms of the log-spectral distortion (LSD) measure. Meanwhile, the proposed training scheme can dramatically reduce training time by more than 95%.
URI: http://hdl.handle.net/11536/152431
ISBN: 978-9-8814-7685-2
ISSN: 2309-9402
期刊: 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC)
起始頁: 1929
結束頁: 1933
Appears in Collections:Conferences Paper