Development of Big Data Multi-VM Platform for Rapid Prototyping of Distributed Deep Learning

doi:10.1007/978-3-319-94301-5_14

標題:	Development of Big Data Multi-VM Platform for Rapid Prototyping of Distributed Deep Learning
作者:	Wu, Chien-Heng Chuang, Chiao-Ning Chang, Wen-Yi Tsai, Whey-Fone 交大名義發表 National Chiao Tung University
關鍵字:	Big data multi-VM platform;Deep learning application Spark;In-memory computing;Hadoop Map/Reduce
公開日期:	1-一月-2018
摘要:	The present study utilizes VirtualBox virtual environment technology to develop the personal big data multi-VM platform with four-node Spark and Hadoop cluster that can effectively replicate and provide an environment for developers to easily design and implement the Spark and Hadoop Map/Reduce programming. Before running their Big Data and deep learning applications in physical multi-node Spark and Hadoop Cluster, developers can conduct Map/Reduce programing simply on the proposed multi-VM platform, which is exactly the same as the physical one. To demonstrate its capability and applicability, this study utilizes the deep learning application as an example for function illustration. In this study, the big data multi-VM platform provides the rapid prototyping of distributed deep learning by using a cutting-edge framework TensorFlowOnSpark (TFoS) for AI developers. To look into deep insight, this study performs the deep-learning benchmark in different types of cluster systems including the multi-node big data VM platform, physical standalone system and the physical small-cluster system. The results indicate that InputMode. SPARK can get 3.3 times faster than InputMode. TENSORFLOW on the big data VM platform and even achieve 6.1 times faster on the physical server.
URI:	http://dx.doi.org/10.1007/978-3-319-94301-5_14 http://hdl.handle.net/11536/150723
ISSN:	0302-9743
DOI:	10.1007/978-3-319-94301-5_14
期刊:	BIG DATA - BIGDATA 2018
Volume:	10968
起始頁:	182
結束頁:	193
顯示於類別：	會議論文