Title: Analyzing job completion reliability and job energy consumption for a heterogeneous MapReduce cluster under different intermediate-data replication policies
Authors: Lin, Jia-Chun
Leu, Fang-Yie
Chen, Ying-Ping
資訊工程學系
Department of Computer Science
Keywords: MapReduce;Hadoop;Job completion reliability;Job energy consumption;Intermediate data;Replication
Issue Date: 1-May-2015
Abstract: Recently, MapReduce has been a popular distributed programming framework for solving data-intensive applications. However, a large-scale MapReduce cluster has inevitable machine/node failures and considerable energy consumption. To solve these problems, MapReduce has employed several policies for replicating input data, storing/replicating intermediate data, and re-executing failed tasks. In this study, we concentrate on two typical policies for storing/replicating intermediate data, and derive the job completion reliability (JCR for short) and job energy consumption (JEC for short) of a MapReduce cluster when the two policies are individually employed. The two policies are further analyzed and compared given various scenarios in which jobs with different input data sizes, numbers of reduce tasks, and other parameters are run in a MapReduce cluster with two extreme parallel execution capabilities. From the analytical results, MapReduce managers are able to comprehend how the two policies influence the JCR and JEC of a MapReduce cluster.
URI: http://dx.doi.org/10.1007/s11227-014-1286-7
http://hdl.handle.net/11536/124654
ISSN: 0920-8542
DOI: 10.1007/s11227-014-1286-7
Journal: JOURNAL OF SUPERCOMPUTING
Volume: 71
Begin Page: 1657
End Page: 1677
Appears in Collections:Articles