標題: | 多核心系統架構上針對OpenMP程式之電源管控方法 A Power-Control Mechanism for OpenMP Applications on Multi-core Machines |
作者: | 莊睦昂 游逸平 資訊科學與工程研究所 |
關鍵字: | OpenMP 程式;多核心系統;電源節省;電源管控;OpenMP;multi-core processors;energy reduction;power control |
公開日期: | 2012 |
摘要: | 多核心系統在現代科技的應用已越來越普遍,舉凡自小型的手持裝置、筆記型電腦、桌上型電腦甚至於大型資料中心皆已使用多核心系統架構。然而,在多核心系統中執行平行程式可能會因為程式平行化的程度的影響使得每個核心所被分配到的工作是不平均的,進而可能造成有部分的核心沒有被善加利用導致電源的浪費。
在此篇論文我們提出一電源管控方法,用以改善在多核心系統架構上執行OpenMP程式的整體系統之功率消耗。其方法主要是在平行迴圈(Parallel-for loop)中,考慮到多核心上的每個核心所被分配到的工作量的不同,將未善加利用的核心在盡量不影響效能以及計算結果的前提之下,以電源匣控(Power-Gating)技術關閉以降低整體系統之功率消耗。實驗結果顯示我們提出的電源管控方法分別對於Rodinia benchmark suite以及NAS Parallel Benchmarks裡的程式可以有效的減少平均6.5%以及3.9%的耗能。 Multi-core processors are now widely used in modern computing, from embedded to large- scale computing systems. The computing power of such systems is noticeably increased as more cores being integrated in a chip and multiple threads running in parallel. However, the multiple cores might not be utilized efficiently due to the degree of parallelism of work-loads; that is, the workloads distributed to these multiple cores might be imbalanced, which reveals an opportunity for reducing the energy consumption that is dissipated when a core is idle. In this thesis, we present a history-based OpenMP power-control mechanism, called OpenMP-HBPCM, to reduce the energy consumption of OpenMP applications that run on multi-core machines. The mechanism includes two major parts: (1) a compile-time instrumentation that inserts the codes for energy management into possible parallel-for loops and (2) an OpenMP power-control library that decides whether to apply a power control to a core upon a history-based prediction. The experimental results demonstrate that the proposed history-based OpenMP power-control mechanism reduced energy consumption with only little performance degradation. The geometric average reduction in the energy consumption is 6.5% for the Rodinia benchmark suite (five selected benchmarks), and 3.9% for NAS Parallel Benchmarks, while the geometric average of performance degradation is only 0.7% for the Rodinia benchmark suite, and 1.4% for NAS Parallel Benchmarks. |
URI: | http://140.113.39.130/cdrfb3/record/nctu/#GT079955592 http://hdl.handle.net/11536/50499 |
Appears in Collections: | Thesis |