完整後設資料紀錄
DC 欄位語言
dc.contributor.authorHuang, Ming-Hsiangen_US
dc.contributor.authorYang, Wuuen_US
dc.date.accessioned2017-04-21T06:48:13Z-
dc.date.available2017-04-21T06:48:13Z-
dc.date.issued2016en_US
dc.identifier.isbn978-1-5090-2823-8en_US
dc.identifier.issn0190-3918en_US
dc.identifier.urihttp://dx.doi.org/10.1109/ICPP.2016.70en_US
dc.identifier.urihttp://hdl.handle.net/11536/136474-
dc.description.abstractSupporting irregular nested parallelism on modern GPUs requires much effort. One should distribute the parallel tasks evenly while preserving reasonable memory usage. Moreover, the task distribution should also fit the thread hierarchy of the underlying GPU to fully exploit its computing power. We propose partial flattening, an automatic code transformation which translates annotated C programs to CUDA kernels. Thread blocks are treated as flat SIMT processors. Iterations are dynamically organized into batches. Batches are executed in a sequential (depth-first) order. A kernel is treated as multiple independent SIMT processors with an additional task-stealing mechanism. Partial flattening allows easy expression of nested parallelism and synchronization by annotating nested parallel loops or parallel-recursive calls, while preserving reasonable memory usage by the depth-first execution order. Our 2-level task distribution scheme does not need special hardware support, and fits well with the CUDA thread hierarchy. Experiments show that partial flattening outperforms NESL significantly in most benchmarks, and obtains 2.15x and 67x speedup over CUDA dynamic parallelism in Quicksort and the Bron-Kerbosch algorithm, respectively.en_US
dc.language.isoen_USen_US
dc.titlePartial Flattening: A Compilation Technique for Irregular Nested Parallelism on GPGPUsen_US
dc.typeProceedings Paperen_US
dc.identifier.doi10.1109/ICPP.2016.70en_US
dc.identifier.journalPROCEEDINGS 45TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING - ICPP 2016en_US
dc.citation.spage552en_US
dc.citation.epage561en_US
dc.contributor.department交大名義發表zh_TW
dc.contributor.departmentNational Chiao Tung Universityen_US
dc.identifier.wosnumberWOS:000387089600063en_US
dc.citation.woscount0en_US
顯示於類別:會議論文