標題: Stochastic Fusion for Multi-stream Neural Network in Video Classification
作者: Huang, Yu-Min
Tseng, Huan-Hsin
Chien, Jen-Tzung
電機工程學系
Department of Electrical and Computer Engineering
公開日期: 1-一月-2019
摘要: Spatial image and optical how provide complementary information for video representation and classification. Traditional methods separately encode two stream signals and then fuse them at the end of streams. This paper presents a new multi-stream recurrent neural network where streams are tightly coupled at each time step. Importantly, we propose a stochastic fusion mechanism for multiple streams of video data based on the Gumbel samples to increase the prediction power. A stochastic backpropagation algorithm is implemented to carry out a multi-stream neural network with stochastic fusion based on a joint optimization of convolutional encoder and recurrent decoder. Experiments on UCF101 dalaset illustrate the merits of the proposed stochastic fusion in recurrent neural network in terms of interpretation and classification performance.
URI: http://hdl.handle.net/11536/155267
ISBN: 978-1-7281-3248-8
ISSN: 2309-9402
期刊: 2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC)
起始頁: 69
結束頁: 74
顯示於類別:會議論文