Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lai, Bo-Cheng Charles | en_US |
dc.contributor.author | Chiang, Chih-Hsuan | en_US |
dc.contributor.author | Li, Guan-Ru | en_US |
dc.date.accessioned | 2014-12-08T15:21:18Z | - |
dc.date.available | 2014-12-08T15:21:18Z | - |
dc.date.issued | 2011 | en_US |
dc.identifier.isbn | 978-0-7695-4576-9 | en_US |
dc.identifier.issn | 1521-9097 | en_US |
dc.identifier.uri | http://hdl.handle.net/11536/15122 | - |
dc.identifier.uri | http://dx.doi.org/10.1109/ICPADS.2011.43 | en_US |
dc.description.abstract | Object detection has become an enabling function for modern smart embedded devices to perform intelligent applications and interact with the environment appropriately and promptly. However, the limited computation resource of embedded devices has become a barrier to execute the computation intensive object detection algorithm. Leveraging the multi-threading scheme on embedded multi-core systems provides an opportunity to boost the performance. However, the memory bottleneck limits the performance scalability. Improving data locality of applications and maximizing the data reuse for on-chip caches have therefore become critical design concerns. This paper comprehensively analyzes the memory behavior and data locality of a multi-threaded object detection algorithm. A novel Classifier-Grouping scheme is proposed to significantly enhance the data reuse for on-chip caches of embedded multi-core systems. By executing a multi-threaded object detection algorithm on a cycle-accurate multi-core simulator, the proposed approach can achieve up to 62% better performance when compared with the original parallel program. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | data locality | en_US |
dc.subject | object detection | en_US |
dc.subject | parallel processing | en_US |
dc.subject | multi-core | en_US |
dc.subject | embedded device | en_US |
dc.title | Classifier Grouping to Enhance Data Locality for A Multi-Threaded Object Detection Algorithm | en_US |
dc.type | Proceedings Paper | en_US |
dc.identifier.doi | 10.1109/ICPADS.2011.43 | en_US |
dc.identifier.journal | 2011 IEEE 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS) | en_US |
dc.citation.spage | 268 | en_US |
dc.citation.epage | 275 | en_US |
dc.contributor.department | 電子工程學系及電子研究所 | zh_TW |
dc.contributor.department | Department of Electronics Engineering and Institute of Electronics | en_US |
dc.identifier.wosnumber | WOS:000299395900035 | - |
Appears in Collections: | Conferences Paper |
Files in This Item:
If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.