標題: Performance-based data distribution for data mining applications on grid computing environments
作者: Shih, Wen-Chung
Yang, Chao-Tung
Tseng, Shian-Shyong
資訊工程學系
Department of Computer Science
關鍵字: Heuristic data distribution scheme;Data mining;Grid computing;MPI
公開日期: 1-May-2010
摘要: Effective data distribution techniques can significantly reduce the total execution time of a program on grid computing environments, especially for data mining applications. In this paper, we describe a linear programming formulation for the data distribution problem on grids. Furthermore, a heuristic method, named Heuristic Data Distribution Scheme (HDDS), is proposed to solve this problem. We implement two types of data mining applications, Association Rule Mining and Decision Tree Construction, and conduct experiments on grid testbeds. Experimental results show that data mining programs using the proposed HDDS to distribute data could execute more efficiently than traditional schemes could.
URI: http://dx.doi.org/10.1007/s11227-009-0286-5
http://hdl.handle.net/11536/5493
ISSN: 0920-8542
DOI: 10.1007/s11227-009-0286-5
期刊: JOURNAL OF SUPERCOMPUTING
Volume: 52
Issue: 2
起始頁: 171
結束頁: 198
Appears in Collections:Articles


Files in This Item:

  1. 000276498300005.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.