PFACC: An OpenACC-like programming model for irregular nested parallelism

doi:10.1002/spe.2868

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Huang, Ming Hsiang	en_US
dc.contributor.author	Yang, Wuu	en_US
dc.date.accessioned	2020-10-05T01:59:43Z	-
dc.date.available	2020-10-05T01:59:43Z	-
dc.date.issued	1970-01-01	en_US
dc.identifier.issn	0038-0644	en_US
dc.identifier.uri	http://dx.doi.org/10.1002/spe.2868	en_US
dc.identifier.uri	http://hdl.handle.net/11536/154851	-
dc.description.abstract	OpenACC is a directive-based programming model which allows programmers to write graphic processing unit (GPU) programs by simply annotating parallel loops. However, OpenACC has poor support for irregular nested parallel loops, which are natural choices to express nested parallelism. We propose PFACC, a programming model similar to OpenACC. PFACC directives can be used to annotate parallel loops and to guide data movement between different levels of memory hierarchy. Parallel loops can be arbitrarily nested or be placed inside functions that would be (possibly recursively) called in other parallel loops. The PFACC translator translates C programs with PFACC directives into CUDA programs by inserting runtime iteration-sharing and memory allocation routines. The PFACC runtime iteration-sharing routine is a two-level mechanism. Thread blocks dynamically organize loop iterations intobatchesand execute the batches in a depth-first order. Different thread blocks share iterations among one another with an iteration-stealing mechanism. PFACC generates CUDA programs with reasonable memory usage because of the depth-first execution order. The two-level iteration-sharing mechanism is implemented purely in software and fits well with the CUDA thread hierarchy. Experiments show that PFACC outperforms CUDA dynamic parallelism in terms of performance and code size on most benchmarks.	en_US
dc.language.iso	en_US	en_US
dc.subject	dynamic scheduling	en_US
dc.subject	GPGPU	en_US
dc.subject	irregular parallelism	en_US
dc.subject	nested parallelism	en_US
dc.subject	OpenACC	en_US
dc.subject	parallel programming model	en_US
dc.subject	PFACC	en_US
dc.title	PFACC: An OpenACC-like programming model for irregular nested parallelism	en_US
dc.type	Article	en_US
dc.identifier.doi	10.1002/spe.2868	en_US
dc.identifier.journal	SOFTWARE-PRACTICE & EXPERIENCE	en_US
dc.citation.spage	0	en_US
dc.citation.epage	0	en_US
dc.contributor.department	資訊工程學系	zh_TW
dc.contributor.department	Department of Computer Science	en_US
dc.identifier.wosnumber	WOS:000546570800001	en_US
dc.citation.woscount	0	en_US
顯示於類別：	期刊論文