完整後設資料紀錄
DC 欄位語言
dc.contributor.authorChien, Jen-Tzungen_US
dc.contributor.authorHuang, Pei-Wenen_US
dc.contributor.authorLee, Tanen_US
dc.date.accessioned2018-08-21T05:56:52Z-
dc.date.available2018-08-21T05:56:52Z-
dc.date.issued2016-01-01en_US
dc.identifier.issn2308-457Xen_US
dc.identifier.urihttp://dx.doi.org/10.21437/Interspeech.2016-192en_US
dc.identifier.urihttp://hdl.handle.net/11536/146774-
dc.description.abstractOptimization procedure is crucial to achieve desirable performance for speech recognition based on deep neural networks (DNNs). Conventionally, DNNs are trained by using mini-batch stochastic gradient descent (SGD) which is stable but prone to be trapped into local optimum. A recent work based on Nesterov's accelerated gradient descent (NAG) algorithm is developed by merging the current momentum information into correction of SGD updating. NAG less likely jumps into local minimum so that convergence rate is improved. In general, optimization based on SGD is more stable while that based on NAG is faster and more accurate. This study aims to boost the performance of speech recognition by combining complimentary SGD and NAG. A new hybrid optimization is proposed by integrating the SGD with momentum and the NAG by using an interpolation scheme which is continuously run in each mini-batch according to the change rate of cost function in consecutive two learning epochs. Tradeoff between two algorithms can be balanced for mini-batch optimization. Experiments on speech recognition using CUSENT and Aurora-4 show the effectiveness of the hybrid accelerated optimization in DNN acoustic model.en_US
dc.language.isoen_USen_US
dc.subjecthybrid optimizationen_US
dc.subjectstochastic gradient descenten_US
dc.subjectdeep neural networken_US
dc.subjectspeech recognitionen_US
dc.titleHybrid Accelerated Optimization for Speech Recognitionen_US
dc.typeProceedings Paperen_US
dc.identifier.doi10.21437/Interspeech.2016-192en_US
dc.identifier.journal17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINESen_US
dc.citation.spage3399en_US
dc.citation.epage3403en_US
dc.contributor.department電機工程學系zh_TW
dc.contributor.departmentDepartment of Electrical and Computer Engineeringen_US
dc.identifier.wosnumberWOS:000409394402061en_US
顯示於類別:會議論文