標題: | Development of Big Data Multi-VM Platform for Rapid Prototyping of Distributed Deep Learning |
作者: | Wu, Chien-Heng Chuang, Chiao-Ning Chang, Wen-Yi Tsai, Whey-Fone 交大名義發表 National Chiao Tung University |
關鍵字: | Big data multi-VM platform;Deep learning application Spark;In-memory computing;Hadoop Map/Reduce |
公開日期: | 1-一月-2018 |
摘要: | The present study utilizes VirtualBox virtual environment technology to develop the personal big data multi-VM platform with four-node Spark and Hadoop cluster that can effectively replicate and provide an environment for developers to easily design and implement the Spark and Hadoop Map/Reduce programming. Before running their Big Data and deep learning applications in physical multi-node Spark and Hadoop Cluster, developers can conduct Map/Reduce programing simply on the proposed multi-VM platform, which is exactly the same as the physical one. To demonstrate its capability and applicability, this study utilizes the deep learning application as an example for function illustration. In this study, the big data multi-VM platform provides the rapid prototyping of distributed deep learning by using a cutting-edge framework TensorFlowOnSpark (TFoS) for AI developers. To look into deep insight, this study performs the deep-learning benchmark in different types of cluster systems including the multi-node big data VM platform, physical standalone system and the physical small-cluster system. The results indicate that InputMode. SPARK can get 3.3 times faster than InputMode. TENSORFLOW on the big data VM platform and even achieve 6.1 times faster on the physical server. |
URI: | http://dx.doi.org/10.1007/978-3-319-94301-5_14 http://hdl.handle.net/11536/150723 |
ISSN: | 0302-9743 |
DOI: | 10.1007/978-3-319-94301-5_14 |
期刊: | BIG DATA - BIGDATA 2018 |
Volume: | 10968 |
起始頁: | 182 |
結束頁: | 193 |
顯示於類別: | 會議論文 |