Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lin, Weiwei | en_US |
dc.contributor.author | Mak, Man-Wai | en_US |
dc.contributor.author | Tu, Youzhi | en_US |
dc.contributor.author | Chien, Jen-Tzung | en_US |
dc.date.accessioned | 2019-10-05T00:09:44Z | - |
dc.date.available | 2019-10-05T00:09:44Z | - |
dc.date.issued | 2019-01-01 | en_US |
dc.identifier.isbn | 978-1-4799-8131-1 | en_US |
dc.identifier.issn | 1520-6149 | en_US |
dc.identifier.uri | http://hdl.handle.net/11536/152933 | - |
dc.description.abstract | How to overcome the training and test data mismatch in speaker verification systems has been a focus of research recently. In this paper, we propose a semi-supervised nuisance attribute network ( SNAN) to reduce the domain mismatch in i-vectors and x-vectors. SNANs are based on the idea of nuisance attribute removal in inter-dataset variability compensation ( IDVC). But instead of measuring the domain variability through the dataset means, SNANs use the maximum mean discrepancy ( MMD) as part of their loss function, which enables the network to find nuisance directions in which domain variability is measured up to infinite moment. The architecture of SNANs also allows us to incorporate the out-of-domain speaker labels into the semi-supervised training process through the center loss and triplet loss. Using SNANs as a preprocessing step for PLDA training, we achieve a relative improvement of 11.8% in EER on NIST 2016 SRE compared to PLDA without adaptation. We also found that the semi-supervised approach can further improve SNANs' performance. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | Speaker verification | en_US |
dc.subject | x-vectors | en_US |
dc.subject | i-vectors | en_US |
dc.subject | domain adaptation | en_US |
dc.subject | maximum mean discrepancy | en_US |
dc.title | SEMI-SUPERVISED NUISANCE-ATTRIBUTE NETWORKS FOR DOMAIN ADAPTATION | en_US |
dc.type | Proceedings Paper | en_US |
dc.identifier.journal | 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | en_US |
dc.citation.spage | 6236 | en_US |
dc.citation.epage | 6240 | en_US |
dc.contributor.department | 電機工程學系 | zh_TW |
dc.contributor.department | Department of Electrical and Computer Engineering | en_US |
dc.identifier.wosnumber | WOS:000482554006093 | en_US |
dc.citation.woscount | 0 | en_US |
Appears in Collections: | Conferences Paper |