REINFORCEMENT LEARNING BASED SPEECH ENHANCEMENT FOR ROBUST SPEECH RECOGNITION

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Shen, Yih-Liang	en_US
dc.contributor.author	Huang, Chao-Yuan	en_US
dc.contributor.author	Wang, Syu-Siang	en_US
dc.contributor.author	Tsao, Yu	en_US
dc.contributor.author	Wang, Hsin-Min	en_US
dc.contributor.author	Chi, Tai-Shih	en_US
dc.date.accessioned	2019-10-05T00:09:44Z	-
dc.date.available	2019-10-05T00:09:44Z	-
dc.date.issued	2019-01-01	en_US
dc.identifier.isbn	978-1-4799-8131-1	en_US
dc.identifier.issn	1520-6149	en_US
dc.identifier.uri	http://hdl.handle.net/11536/152934	-
dc.description.abstract	Conventional deep neural network (DNN)-based speech enhancement (SE) approaches aim to minimize the mean square error (MSE) between enhanced speech and clean reference. The MSE-optimized model may not directly improve the performance of an automatic speech recognition (ASR) system. If the target is to minimize the recognition error, the recognition results should be used to design the objective function for optimizing the SE model. However, the structure of an ASR system, which consists of multiple units, such as acoustic and language models, is usually complex and not differentiable. In this study, we propose to adopt the reinforcement learning (RL) algorithm to optimize the SE model based on the recognition results. We evaluated the proposed RL-based SE system on the Mandarin Chinese broadcast news corpus (MATBN). Experimental results demonstrate that the proposed SE system can effectively improve the ASR results with a notable 12 : 40% and 19 : 23% error rate reductions for signal to noise ratio (SNR) at 0 dB and 5 dB conditions, respectively.	en_US
dc.language.iso	en_US	en_US
dc.subject	reinforcement learning	en_US
dc.subject	automatic speech recognition	en_US
dc.subject	speech enhancement	en_US
dc.subject	deep neural network	en_US
dc.subject	character error rate	en_US
dc.title	REINFORCEMENT LEARNING BASED SPEECH ENHANCEMENT FOR ROBUST SPEECH RECOGNITION	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.journal	2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)	en_US
dc.citation.spage	6750	en_US
dc.citation.epage	6754	en_US
dc.contributor.department	電機工程學系	zh_TW
dc.contributor.department	Department of Electrical and Computer Engineering	en_US
dc.identifier.wosnumber	WOS:000482554006196	en_US
dc.citation.woscount	0	en_US
顯示於類別：	會議論文