完整後設資料紀錄
DC 欄位語言
dc.contributor.authorLi, Naen_US
dc.contributor.authorMak, Man-Waien_US
dc.contributor.authorChien, Jen-Tzungen_US
dc.date.accessioned2018-08-21T05:56:40Z-
dc.date.available2018-08-21T05:56:40Z-
dc.date.issued2016-01-01en_US
dc.identifier.urihttp://hdl.handle.net/11536/146479-
dc.description.abstractIn speaker recognition, the mismatch between the enrollment and test utterances due to noise with different signal-to-noise ratios (SNRs) is a great challenge. Based on the observation that noise-level variability causes the i-vectors to form heterogeneous clusters, this paper proposes using an SNR-aware deep neural network (DNN) to guide the training of PLDA mixture models. Specifically, given an i-vector, the SNR posterior probabilities produced by the DNN are used as the posteriors of indicator variables of the mixture model. As a result, the proposed model provides a more reasonable soft division of the i-vector space compared to the conventional mixture of PLDA. During verification, given a test trial, the marginal likelihoods from individual PLDA models are linearly combined by the posterior probabilities of SNR levels computed by the DNN. Experimental results for SNR mismatch tasks based on NIST 2012 SRE suggest that the proposed model is more effective than PLDA and conventional mixture of PLDA for handling heterogeneous corpora.en_US
dc.language.isoen_USen_US
dc.subjectSpeaker verificationen_US
dc.subjecti-vectoren_US
dc.subjectmixture of PLDAen_US
dc.subjectdeep neural networksen_US
dc.subjectSNR mismatchen_US
dc.titleDEEP NEURAL NETWORK DRIVEN MIXTURE OF PLDA FOR ROBUST I-VECTOR SPEAKER VERIFICATIONen_US
dc.typeProceedings Paperen_US
dc.identifier.journal2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016)en_US
dc.citation.spage186en_US
dc.citation.epage191en_US
dc.contributor.department電機工程學系zh_TW
dc.contributor.departmentDepartment of Electrical and Computer Engineeringen_US
dc.identifier.wosnumberWOS:000399128000027en_US
顯示於類別:會議論文