Full metadata record
DC FieldValueLanguage
dc.contributor.authorChen, Shin-Kaien_US
dc.contributor.authorHung, Cheng-Yuen_US
dc.contributor.authorChen, Ching-Chihen_US
dc.contributor.authorLiu, Chih-Weien_US
dc.date.accessioned2014-12-08T15:36:46Z-
dc.date.available2014-12-08T15:36:46Z-
dc.date.issued2014-12-01en_US
dc.identifier.issn0885-7458en_US
dc.identifier.urihttp://dx.doi.org/10.1007/s10766-013-0256-7en_US
dc.identifier.urihttp://hdl.handle.net/11536/25147-
dc.description.abstractMulticore processors can provide sufficient computing power and flexibility for complex streaming applications, such as high-definition video processing. For less hardware complexity and power consumption, the distributed scratchpad memory architecture is considered, instead of the cache memory architecture. However, the distributed design poses new challenges to programming. It is difficult to exploit all available capabilities and achieve maximal throughput, due to the combined complexity of inter-processor communication, synchronization, and workload balancing. In this study, we developed an efficient design flow for parallelizing multimedia applications on a distributed scratchpad memory multicore architecture. An application is first partitioned into streaming components and then mapped onto multicore processors. Various hardware-dependent factors and application-specific characteristics are involved in generating efficient task partitions and allocating resources appropriately. To test and verify the proposed design flow, three popular multimedia applications were implemented: a full-HD motion JPEG decoder, an object detector, and a full-HD H.264/AVC decoder. For demonstration purposes, SONY PlayStation3 was selected as the target platform. Simulation results show that, on PS3, the full-HD motion JPEG decoder with the proposed design flow can decode about 108.9 frames per second (fps) in the 1080p format. The object detection application can perform real-time object detection at 2.84 fps at resolution, 11.75 fps at resolution, and 62.52 fps at resolution. The full-HD H.264/AVC decoder applications can achieve nearly 50 fps.en_US
dc.language.isoen_USen_US
dc.subjectParallel programmingen_US
dc.subjectStreaming applicationen_US
dc.subjectMulticore architectureen_US
dc.subjectDistributed scratchpad memory architectureen_US
dc.titleParallelizing Complex Streaming Applications on Distributed Scratchpad Memory Multicore Architectureen_US
dc.typeArticleen_US
dc.identifier.doi10.1007/s10766-013-0256-7en_US
dc.identifier.journalINTERNATIONAL JOURNAL OF PARALLEL PROGRAMMINGen_US
dc.citation.volume42en_US
dc.citation.issue6en_US
dc.citation.spage875en_US
dc.citation.epage899en_US
dc.contributor.department交大名義發表zh_TW
dc.contributor.department電子工程學系及電子研究所zh_TW
dc.contributor.departmentNational Chiao Tung Universityen_US
dc.contributor.departmentDepartment of Electronics Engineering and Institute of Electronicsen_US
dc.identifier.wosnumberWOS:000341498400001-
dc.citation.woscount0-
Appears in Collections:Articles


Files in This Item:

  1. 000341498400001.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.