標題: Scheduling MapReduce Tasks on Virtual MapReduce Clusters from a Tenant\'s Perspective
作者: Lin, Jia-Chun
Lee, Ming-Chang
Yahyapour, Ramin
資訊工程學系
Department of Computer Science
關鍵字: MapReduce;virtual MapReduce cluster;data locality;map-task scheduling;reduce-task scheduling
公開日期: 2014
摘要: Renting a set of virtual private servers (VPSs for short) from a VPS provider to establish a virtual MapReduce cluster is cost-efficient for a company/organization. To shorten job turnaround time and keep data locality as high as possible in this type of environment, this paper proposes a Best-Fit Task Scheduling scheme (BFTS for short) from a tenant\'s perspective. BFTS schedules each map task to a VPS that can finish the task earlier than the other VPSs by predicting and comparing the time required by every VPS to retrieve the map-input data, executeTS sch the map task, and become idle in an online manner. Furthermore, BFedules each reduce task to a VPS that is close to most VPSs that execute the related map tasks. We conduct extensive experiments to compare BFTS with several scheduling algorithms employed by Hadoop. The experimental results show that BFTS is better than the other tested algorithms in terms of map-data locality, reduce-data locality, and job turnaround time. The overhead incurred by BFTS is also evaluated, which is inevitable but acceptable compared with the other algorithms.
URI: http://hdl.handle.net/11536/136136
ISBN: 978-1-4799-5666-1
期刊: 2014 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA)
起始頁: 141
結束頁: 146
Appears in Collections:Conferences Paper