MARKOV RECURRENT NEURAL NETWORKS

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Kuo, Che-Yu	en_US
dc.contributor.author	Chien, Jen-Tzung	en_US
dc.date.accessioned	2019-04-02T06:04:19Z	-
dc.date.available	2019-04-02T06:04:19Z	-
dc.date.issued	2018-01-01	en_US
dc.identifier.issn	2161-0363	en_US
dc.identifier.uri	http://hdl.handle.net/11536/150841	-
dc.description.abstract	Deep learning has achieved great success in many real-world applications. For speech and language processing, recurrent neural networks are learned to characterize sequential patterns and extract the temporal information based on dynamic states which are evolved through time and stored as an internal memory. Traditionally, simple transition function using input-to-hidden and hidden-to-hidden weights is insufficient. To strengthen the learning capability, it is crucial to explore the diversity of latent structure in sequential signals and learn the stochastic trajectory of signal transitions to improve sequential prediction. This paper proposes the stochastic modeling of transitions in deep sequential learning. Our idea is to enhance latent variable representation by discovering the Markov state transitions in sequential data based on a K-state long short-term memory (LSTM) model. Such a latent state machine is capable of learning the complicated latent semantics in highly structured and heterogeneous sequential data. Gumbel-softmax is introduced to implement stochastic learning procedure with discrete states. Experimental results on visual and text language modeling illustrate the merit of the proposed stochastic transitions in sequential prediction with limited amount of parameters.	en_US
dc.language.iso	en_US	en_US
dc.subject	Deep learning	en_US
dc.subject	recurrent neural network	en_US
dc.subject	stochastic transition	en_US
dc.subject	discrete latent structure	en_US
dc.title	MARKOV RECURRENT NEURAL NETWORKS	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.journal	2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP)	en_US
dc.contributor.department	電機工程學系	zh_TW
dc.contributor.department	Department of Electrical and Computer Engineering	en_US
dc.identifier.wosnumber	WOS:000450651000063	en_US
dc.citation.woscount	0	en_US
顯示於類別：	會議論文