Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Luo, Yangchun | en_US |
dc.contributor.author | Hsu, Wei-Chung | en_US |
dc.contributor.author | Zhai, Antonia | en_US |
dc.date.accessioned | 2014-12-08T15:34:39Z | - |
dc.date.available | 2014-12-08T15:34:39Z | - |
dc.date.issued | 2013-12-01 | en_US |
dc.identifier.issn | 1544-3566 | en_US |
dc.identifier.uri | http://dx.doi.org/10.1145/2541228.2541233 | en_US |
dc.identifier.uri | http://hdl.handle.net/11536/23647 | - |
dc.description.abstract | With the emergence of multicore processors, various aggressive execution models have been proposed to exploit fine-grained thread-level parallelism, taking advantage of the fast on-chip interconnection communication. However, the aggressive nature of these execution models often leads to excessive energy consumption incommensurate to execution time reduction. In the context of Thread-Level Speculation, we demonstrated that on a same-ISA heterogeneous multicore system, by dynamically deciding how on-chip resources are utilized, speculative threads can achieve performance gain in an energy-efficient way. Through a systematic design space exploration, we built a multicore architecture that integrates heterogeneous components of processing cores and first-level caches. To cope with processor reconfiguration overheads, we introduced runtime mechanisms to mitigate their impacts. To match program execution with the most energy-efficient processor configuration, the system was equipped with a dynamic resource allocation scheme that characterizes program behaviors using novel processor counters. We evaluated the proposed heterogeneous system with a diverse set of benchmark programs from SPEC CPU2000 and CPU20006 suites. Compared to the most efficient homogeneous TLS implementation, we achieved similar performance but consumed 18% less energy. Compared to the most efficient homogeneous uniprocessor running sequential programs, we improved performance by 29% and reduced energy consumption by 3.6%, which is a 42% improvement in energy-delay-squared product. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | Thread-Level Speculation | en_US |
dc.subject | energy efficiency | en_US |
dc.subject | heterogeneous multicore | en_US |
dc.subject | dynamic resource allocation | en_US |
dc.title | The Design and Implementation of Heterogeneous Multicore Systems for Energy-efficient Speculative Thread Execution | en_US |
dc.type | Article | en_US |
dc.identifier.doi | 10.1145/2541228.2541233 | en_US |
dc.identifier.journal | ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION | en_US |
dc.citation.volume | 10 | en_US |
dc.citation.issue | 4 | en_US |
dc.citation.epage | en_US | |
dc.contributor.department | 交大名義發表 | zh_TW |
dc.contributor.department | National Chiao Tung University | en_US |
dc.identifier.wosnumber | WOS:000330509300005 | - |
dc.citation.woscount | 0 | - |
Appears in Collections: | Articles |
Files in This Item:
If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.