Full metadata record
DC FieldValueLanguage
dc.contributor.authorHsu, Chia-Chenen_US
dc.contributor.authorLin, Cheng-Yenen_US
dc.contributor.authorChen, Shin Kaien_US
dc.contributor.authorLiu, Chih-Weien_US
dc.contributor.authorLee, Jenq-Kuenen_US
dc.date.accessioned2015-12-02T03:00:54Z-
dc.date.available2015-12-02T03:00:54Z-
dc.date.issued2014-01-01en_US
dc.identifier.isbn978-1-4799-6307-2en_US
dc.identifier.issn2325-1271en_US
dc.identifier.urihttp://hdl.handle.net/11536/128539-
dc.description.abstractHeterogeneous multi-core systems that contain multiple CPUs and GPUs are gaining momentum, as they are providing different computation power to meet the performance demand of modern applications. On such systems, developers try to fully utilize the computation power both for CPU and GPU by using the emerging programming models such as CUDA and OpenCL. To achieve the maximal performance, developers must carefully offload the appropriate workload to the compute devices according to the characteristics of target architecture. Under such scenario, seamlessly data motion between different processors become crucial. Additionally, re-organizing the data layout to fit the target architectures, such as array-of-structure (AOS) for CPU, structure-of-array (SOA) for GPU, and coordinate (COO) format to ELLPACK (ELL) for sparse computation, address such concern. In this paper, we propose a hardware memory manager, which efficiently optimizes the conversion of data layouts for heterogeneous multi-core systems on-the-fly. We address coalescing and sparse format conversion issue in our design. A novel ping-pong transpose architecture is devised to reorganize non-coalescing access pattern, and a histogram unit and sparse address generator are presented to process sparse storage format transformation. Our design reduces the overhead of data transfer and layout transformation among CPU and GPU. In our experiment, our design achieves 68.5 to 2.19 times speed up comparing to software-based library depending on data size.en_US
dc.language.isoen_USen_US
dc.titleOptimized Memory Access Support for Data Layout Conversion on Heterogeneous Multi-core Systemsen_US
dc.typeProceedings Paperen_US
dc.identifier.journal2014 IEEE 12TH SYMPOSIUM ON EMBEDDED SYSTEMS FOR REAL-TIME MULTIMEDIA (ESTIMEDIA)en_US
dc.citation.spage128en_US
dc.citation.epage137en_US
dc.contributor.department電子工程學系及電子研究所zh_TW
dc.contributor.departmentDepartment of Electronics Engineering and Institute of Electronicsen_US
dc.identifier.wosnumberWOS:000358220900016en_US
dc.citation.woscount0en_US
Appears in Collections:Conferences Paper