The Design and Implementation of Heterogeneous Multicore Systems for Energy-efficient Speculative Thread Execution

dc.citation.epageen_US
dc.citation.issue4en_US
dc.citation.volume10en_US
dc.citation.woscount0
dc.contributor.authorLuo, Yangchunen_US
dc.contributor.authorHsu, Wei-Chungen_US
dc.contributor.authorZhai, Antoniaen_US
dc.contributor.department交大名義發表zh_TW
dc.contributor.departmentNational Chiao Tung Universityen_US
dc.date.accessioned2014-12-08T15:34:39Z
dc.date.available2014-12-08T15:34:39Z
dc.date.issued2013-12-01en_US
dc.description.abstractWith the emergence of multicore processors, various aggressive execution models have been proposed to exploit fine-grained thread-level parallelism, taking advantage of the fast on-chip interconnection communication. However, the aggressive nature of these execution models often leads to excessive energy consumption incommensurate to execution time reduction. In the context of Thread-Level Speculation, we demonstrated that on a same-ISA heterogeneous multicore system, by dynamically deciding how on-chip resources are utilized, speculative threads can achieve performance gain in an energy-efficient way. Through a systematic design space exploration, we built a multicore architecture that integrates heterogeneous components of processing cores and first-level caches. To cope with processor reconfiguration overheads, we introduced runtime mechanisms to mitigate their impacts. To match program execution with the most energy-efficient processor configuration, the system was equipped with a dynamic resource allocation scheme that characterizes program behaviors using novel processor counters. We evaluated the proposed heterogeneous system with a diverse set of benchmark programs from SPEC CPU2000 and CPU20006 suites. Compared to the most efficient homogeneous TLS implementation, we achieved similar performance but consumed 18% less energy. Compared to the most efficient homogeneous uniprocessor running sequential programs, we improved performance by 29% and reduced energy consumption by 3.6%, which is a 42% improvement in energy-delay-squared product.en_US
dc.identifier.doi10.1145/2541228.2541233en_US
dc.identifier.issn1544-3566en_US
dc.identifier.journalACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATIONen_US
dc.identifier.urihttp://dx.doi.org/10.1145/2541228.2541233en_US
dc.identifier.urihttps://ir.lib.nycu.edu.tw/handle/11536/23647
dc.identifier.wosnumberWOS:000330509300005
dc.language.isoen_USen_US
dc.subjectThread-Level Speculationen_US
dc.subjectenergy efficiencyen_US
dc.subjectheterogeneous multicoreen_US
dc.subjectdynamic resource allocationen_US
dc.titleThe Design and Implementation of Heterogeneous Multicore Systems for Energy-efficient Speculative Thread Executionen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
000330509300005.pdf
Size:
1.74 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: