Title: Dynamic Memory Optimization and Parallelism Management for OpenCL
Authors: Hsu, Chao-Hung
Wu, I-Wei
Shann, Jean Jyh-Jiun
資訊工程學系
Department of Computer Science
Keywords: dynamic optimization;GPU;OpenCL;LLVM
Issue Date: 1-Jan-2014
Abstract: Recently, multiprocessor platforms have become trends for achieving high performance. OpenCL (Open Computing Language) is one of the programming standards for heterogeneous multiprocessors, and provides portability for these platforms. Our research focuses on platforms with CPUs and GPUs since GPUs are now widespread in use. On such a platform, two programming issues may affect the performance on GPU computing significantly. One is the work load distribution and another is the employment of GPU memory hierarchy. To fully utilize the characteristics of GPUs, programmers have to be not only proficient at parallel programming but also familiar with hardware specifications. Therefore, in this paper, we propose a compilation pass to automatically perform optimizations for OpenCL kernels. Our compilation pass will transform an input naive kernel function with optimizations, including kernel function analysis, work- group rearrangement, memory coalescing, and work-item merge. In addition, our framework is implemented on a runtime system so that it may dynamically adjust the optimizing parameters according to the hardware specifications. Considering the execution time, the optimized kernels generated by our design may have significant performance improvement over the naive versions. Although the optimizations performed in runtime may incur time overheads, the overheads may be covered by intensive kernel computation or massive input data in most cases.
URI: http://hdl.handle.net/11536/124998
ISBN: 978-1-4799-3197-2
ISSN: 
Journal: 2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3
Begin Page: 780
End Page: 784
Appears in Collections:Conferences Paper