完整後設資料紀錄
DC 欄位語言
dc.contributor.authorKuo, Hsien-Kaien_US
dc.contributor.authorChen, Kuan-Tingen_US
dc.contributor.authorLai, Bo-Cheng Charlesen_US
dc.contributor.authorJou, Jing-Yangen_US
dc.date.accessioned2014-12-08T15:24:33Z-
dc.date.available2014-12-08T15:24:33Z-
dc.date.issued2012en_US
dc.identifier.isbn978-1-4673-0772-7en_US
dc.identifier.issn2153-6961en_US
dc.identifier.urihttp://hdl.handle.net/11536/17018-
dc.description.abstractMemory Coalescing and on-chip shared Cache are two effective techniques to alleviate the memory bottleneck in modern GPGPUs. These two techniques are very useful on applications with regular memory accesses. However, they become ineffective on concurrent threads with large numbers of uncoordinated accesses and the potential performance benefit could be significantly degraded. This paper proposes a thread affinity mapping methodology to coordinate the irregular data accesses on shared cache GPGPUs. Based on the proposed affinity metrics, threads are congregated into execution groups which are able to fully exploit the memory coalescing and data sharing within an application. An average of 3.5x runtime speedup is achieved on a Fermi GPGPU. The speedup scales with the sizes of test cases, which makes the proposed methodology an effective and promising solution for the continually increasing complexities of applications in the future many-core systems.en_US
dc.language.isoen_USen_US
dc.titleThread Affinity Mapping for Irregular Data Access on Shared Cache GPGPUen_US
dc.typeProceedings Paperen_US
dc.identifier.journal2012 17TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC)en_US
dc.citation.spage659en_US
dc.citation.epage664en_US
dc.contributor.department電子工程學系及電子研究所zh_TW
dc.contributor.departmentDepartment of Electronics Engineering and Institute of Electronicsen_US
dc.identifier.wosnumberWOS:000309240000119-
顯示於類別:會議論文