標題: Hierarchical loop scheduling for clustered NUMA machines
作者: Wang, YM
Wang, HH
Chang, RC
資訊工程學系
Department of Computer Science
公開日期: 1-Dec-2000
摘要: Loop scheduling is an important issue in the development of high performance multiprocessors. As modern multiprocessors have high and non-uniform memory access (NUMA) costs, the communication costs dominate the execution of parallel programs. Previous affinity algorithms perform better than dynamic algorithms under non-clustered NUMA multiprocessors, but they suffer heavy overheads when migrating work load under clustered NUMA machines. In this paper, we propose a new loop scheduling policy, hierarchical policy, to improve Various affinity scheduling algorithms (AFSs) for clustered NUMA machines. We cyclically distribute the iteration chunks to clusters. When imbalance occurs, the migration of iterations is carried on hierarchically. We use hierarchical policy to improve AFS and modified AFS (MAFS), and we call them Hierarchical AFS (HAFS) and Hierarchical MAFS (HMAFS), respectively. AFS uses a deterministic assignment policy to assign repeated executions of loop iteration to the same processor. MAFS modifies the migration policy of AFS, and reduces the number of synchronization operations. We confirm our idea by running many applications under a clustered NUMA simulator. Our experimental result shows that hierarchical policy reduces the inter-cluster remote memory accesses, decreases the locks to the queues, and effectively balances the work load. We also show that HMAFS is the best choice among these algorithms in most cases. (C) 2000 Elsevier Science Inc. All rights reserved.
URI: http://dx.doi.org/10.1016/S0164-1212(00)00045-5
http://hdl.handle.net/11536/30071
ISSN: 0164-1212
DOI: 10.1016/S0164-1212(00)00045-5
期刊: JOURNAL OF SYSTEMS AND SOFTWARE
Volume: 55
Issue: 1
起始頁: 33
結束頁: 44
Appears in Collections:Articles


Files in This Item:

  1. 000165976600004.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.