標題: | Performance-based data distribution for data mining applications on grid computing environments |
作者: | Shih, Wen-Chung Yang, Chao-Tung Tseng, Shian-Shyong 資訊工程學系 Department of Computer Science |
關鍵字: | Heuristic data distribution scheme;Data mining;Grid computing;MPI |
公開日期: | 1-May-2010 |
摘要: | Effective data distribution techniques can significantly reduce the total execution time of a program on grid computing environments, especially for data mining applications. In this paper, we describe a linear programming formulation for the data distribution problem on grids. Furthermore, a heuristic method, named Heuristic Data Distribution Scheme (HDDS), is proposed to solve this problem. We implement two types of data mining applications, Association Rule Mining and Decision Tree Construction, and conduct experiments on grid testbeds. Experimental results show that data mining programs using the proposed HDDS to distribute data could execute more efficiently than traditional schemes could. |
URI: | http://dx.doi.org/10.1007/s11227-009-0286-5 http://hdl.handle.net/11536/5493 |
ISSN: | 0920-8542 |
DOI: | 10.1007/s11227-009-0286-5 |
期刊: | JOURNAL OF SUPERCOMPUTING |
Volume: | 52 |
Issue: | 2 |
起始頁: | 171 |
結束頁: | 198 |
Appears in Collections: | Articles |
Files in This Item:
If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.