標題: | Dynamic Memory Optimization and Parallelism Management for OpenCL |
作者: | Hsu, Chao-Hung Wu, I-Wei Shann, Jean Jyh-Jiun 資訊工程學系 Department of Computer Science |
關鍵字: | dynamic optimization;GPU;OpenCL;LLVM |
公開日期: | 1-Jan-2014 |
摘要: | Recently, multiprocessor platforms have become trends for achieving high performance. OpenCL (Open Computing Language) is one of the programming standards for heterogeneous multiprocessors, and provides portability for these platforms. Our research focuses on platforms with CPUs and GPUs since GPUs are now widespread in use. On such a platform, two programming issues may affect the performance on GPU computing significantly. One is the work load distribution and another is the employment of GPU memory hierarchy. To fully utilize the characteristics of GPUs, programmers have to be not only proficient at parallel programming but also familiar with hardware specifications. Therefore, in this paper, we propose a compilation pass to automatically perform optimizations for OpenCL kernels. Our compilation pass will transform an input naive kernel function with optimizations, including kernel function analysis, work- group rearrangement, memory coalescing, and work-item merge. In addition, our framework is implemented on a runtime system so that it may dynamically adjust the optimizing parameters according to the hardware specifications. Considering the execution time, the optimized kernels generated by our design may have significant performance improvement over the naive versions. Although the optimizations performed in runtime may incur time overheads, the overheads may be covered by intensive kernel computation or massive input data in most cases. |
URI: | http://hdl.handle.net/11536/124998 |
ISBN: | 978-1-4799-3197-2 |
ISSN: | |
期刊: | 2014 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE, ELECTRONICS AND ELECTRICAL ENGINEERING (ISEEE), VOLS 1-3 |
起始頁: | 780 |
結束頁: | 784 |
Appears in Collections: | Conferences Paper |