標題: | CASTA: CUDA-Accelerated Static Timing Analysis for VLSI Designs |
作者: | Wang, Hunta H. -W. Lin, Louis Y. -Z. Huang, Ryan H-M. Wen, Charles H-P. 資訊工程學系 Department of Computer Science |
關鍵字: | CUDA;GPU;STA;Parallel Computing |
公開日期: | 2014 |
摘要: | General-purpose computing on graphics processing unit (GPGPU) enables the possibility of parallel computing for Static Timing Analysis (STA) of VLSI designs. However, memory access and synchronization between massively many cores become challenges to parallelizing STA. In this work, we developed a fast CUDA-Accelerated STA engine (named CASTA) that incorporates four novel techniques including Table-Index Remapping (TIR), Texture-Accelerated Rendering (TAR), Cell Levelization & Type Sorting (CLTS) and Timing-Table Restructuring (TTR) to enable high parallelism. Cell Levelization & Type Sorting (CLTS) levelizes cells and sort their types in order to efficiently access the same timing library. Timing-Table Restructuring (TTR) modifies the data structure for timing signals of cells to increase memory throughput. Table-Index Remapping (TIR) re-maps the axes of timing tables to retrieve data more efficiently while Texture-Accelerated Rendering (TAR) expands look-up tables (LUTs) to avoid extrapolation and stores LUTs in the texture for speed. As a result, our experimental result indicates that CASTA successfully enables high parallelism and outperforms a commercial tool by a three-order speedup on average over several benchmark circuits. |
URI: | http://dx.doi.org/10.1109/ICPP.2014.28 http://hdl.handle.net/11536/135294 |
ISBN: | 978-1-4799-5618-0 |
ISSN: | 0190-3918 |
DOI: | 10.1109/ICPP.2014.28 |
期刊: | 2014 43RD INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP) |
起始頁: | 192 |
結束頁: | 200 |
顯示於類別: | 會議論文 |