Enhancing Utilization of SIMD-Like Accelerator for Sparse Convolutional Neural Networks

doi:10.1109/TVLSI.2019.2897052

標題:	Enhancing Utilization of SIMD-Like Accelerator for Sparse Convolutional Neural Networks
作者:	Lai, Bo-Cheng Pan, Jyun-Wei Lin, Chien-Yu 電子工程學系及電子研究所 Department of Electronics Engineering and Institute of Electronics
關鍵字:	Load balance;machine learning;single-instruction-multiple-data (SIMD) architecture;sparse convolutional neural networks (CNNs)
公開日期:	1-May-2019
摘要:	Although the existing single-instruction-multiple-data-like (SIMD) accelerators can handle the compressed format of sparse convolutional neural networks, the sparse and irregular distributions of nonzero elements cause low utilization of multipliers in a processing engine (PE) and imbalanced computation between PEs. This brief addresses the above issues by proposing a data screening and task mapping (DSTM) accelerator which integrates a series of techniques, including software refinement and hardware modules. An efficient indexing module is introduced to identify the effectual computation pairs and skip unnecessary computation in a fine-grained manner. The intra-PE load imbalance is alleviated with weight data rearrangement. An effective task sharing mechanism further balances the computation between PEs. When compared with the state-of-the-art SIMD-like accelerator, the proposed DSTM enhances the average PE utilization by 3.5x. The overall processing throughput is 59.7% higher than the previous design.
URI:	http://dx.doi.org/10.1109/TVLSI.2019.2897052 http://hdl.handle.net/11536/152414
ISSN:	1063-8210
DOI:	10.1109/TVLSI.2019.2897052
期刊:	IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS
Volume:	27
Issue:	5
起始頁:	1218
結束頁:	1222
Appears in Collections:	Articles

APA	Lai, B., Pan, J., & Lin, C. (2019). Enhancing Utilization of SIMD-Like Accelerator for Sparse Convolutional Neural Networks. WOS:000466226400020.
Bibtex	@article{Lai2019Enhancing, title={Enhancing Utilization of SIMD-Like Accelerator for Sparse Convolutional Neural Networks}, author={Lai, Bo-Cheng and Pan, Jyun-Wei and Lin, Chien-Yu}, journal={WOS:000466226400020}, year={2019}, url={https://ir.lib.nycu.edu.tw/handle/11536/152414}, }