標題: Using knowledge-based systems for research on parallelizing compilers
作者: Yang, CT
Tseng, SS
Fann, YW
Tsai, TK
Hsieh, MH
Wu, CT
資訊工程學系
Department of Computer Science
關鍵字: parallelizing compiler;knowledge-based system;loop parallelization;multithreaded OS;program restructuring
公開日期: 1-Mar-2001
摘要: The main function of parallelizing compilers is to analyze sequential programs, in particular the loop structure, to detect hidden parallelism and automatically restructure sequential programs into parallel subtasks that are executed on a multiprocessor. This article describes the design and implementation of an efficient parallelizing compiler to parallelize loops and achieve high speedup rates on multiprocessor systems. It is well known that the execution efficiency of a loop can be enhanced if the loop is executed in parallel or partially parallel, such as in a DOALL or DOACROSS loop. This article also reviews a practical parallel loop detector (PPD) that is implemented in our PFPC on finding the parallelism in loops. The PPD can extract the potential DOALL and DOACROSS loops in a program by verifying array subscripts. In addition, a new model by using knowledge-based approach is proposed to exploit more loop parallelisms in this paper. The knowledge-based approach integrates existing loop transformations and loop scheduling algorithms to make good use of their ability to extract loop parallelisms. Two rule-based systems, called the KPLT and IPLS, are then developed using repertory grid analysis and attribute-ordering tables respectively, to construct the knowledge bases. These systems can choose an appropriate transform and loop schedule, and then apply the resulting methods to perform loop parallelization and obtain a high speedup rate. For example, the IPLS system can choose an appropriate loop schedule for running on multiprocessor systems. Finally, a runtime technique based on the inspector/executor scheme is proposed in this article for finding available parallelism on loops. Our inspector can determine the wavefronts of a loop with any complex indirected array-indexing pattern by building a DEF-USE table. The inspector is fully parallel without any synchronization. Experimental results show that the new method can resolve any complex data dependence patterns where no previous research can. One of the ultimate goals is to construct a high-performance and portable FORTRAN parallelizing compiler on shared-memory multiprocessors. We believe that our research may provide more insight into the development of a highperformance parallelizing compiler Copyright (C) 2001 John Wiley & Sons, Ltd.
URI: http://dx.doi.org/10.1002/cpe.563
http://hdl.handle.net/11536/29833
ISSN: 1532-0626
DOI: 10.1002/cpe.563
期刊: CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE
Volume: 13
Issue: 3
起始頁: 181
結束頁: 208
Appears in Collections:Articles


Files in This Item:

  1. 000168248400002.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.