完整後設資料紀錄
DC 欄位語言
dc.contributor.authorChi, Tai-Shihen_US
dc.contributor.authorLin, Ting-Hanen_US
dc.contributor.authorHsu, Chung-Chienen_US
dc.date.accessioned2014-12-08T15:23:15Z-
dc.date.available2014-12-08T15:23:15Z-
dc.date.issued2012-05-01en_US
dc.identifier.issn0001-4966en_US
dc.identifier.urihttp://hdl.handle.net/11536/16329-
dc.description.abstractSpectro-temporal modulations of speech encode speech structures and speaker characteristics. An algorithm which distinguishes speech from non-speech based on spectro-temporal modulation energies is proposed and evaluated in robust text-independent closed-set speaker identification simulations using the TIMIT and GRID corpora. Simulation results show the proposed method produces much higher speaker identification rates in all signal-to-noise ratio (SNR) conditions than the baseline system using mel-frequency cepstral coefficients. In addition, the proposed method also outperforms the system, which uses auditory-based nonnegative tensor cepstral coefficients [Q. Wu and L. Zhang, "Auditory sparse representation for robust speaker recognition based on tensor structure," EURASIP J. Audio, Speech, Music Process. 2008, 578612 (2008)], in low SNR (<= 10 dB) conditions. (C) 2012 Acoustical Society of Americaen_US
dc.language.isoen_USen_US
dc.titleSpectro-temporal modulation energy based mask for robust speaker identificationen_US
dc.typeArticleen_US
dc.identifier.journalJOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICAen_US
dc.citation.volume131en_US
dc.citation.issue5en_US
dc.citation.epageEL368en_US
dc.contributor.department電機工程學系zh_TW
dc.contributor.departmentDepartment of Electrical and Computer Engineeringen_US
dc.identifier.wosnumberWOS:000303601600003-
dc.citation.woscount0-
顯示於類別:期刊論文


文件中的檔案:

  1. 000303601600003.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。