標題: | A Systolic Array Based GTD Processor With a Parallel Algorithm |
作者: | Yang, Chia-Hsiang Chou, Chun-Wei Hsu, Chia-Shen Chen, Chiao-En 電子工程學系及電子研究所 Department of Electronics Engineering and Institute of Electronics |
關鍵字: | Generalized triangular decomposition (GTD);geometric mean decomposition (GMD);multiple-input multiple-output (MIMO);reconfigurable architecture |
公開日期: | 1-Apr-2015 |
摘要: | Generalized triangular decomposition (GTD) has been found to be useful in the field of signal processing, but the feasibility of the related hardware has not yet been established. This paper presents (for the first time) a GTD processor architecture with a parallel algorithm. The proposed parallel GTD algorithm achieves an increase in speed of up to 1.66 times, compared to the speed of its conventional sequential counterpart for an 8x8 matrix. For hardware implementation, the proposed reconfigurable architecture is capable of computing singular value decomposition (SVD), geometric mean decomposition (GMD), and GTD for matrix sizes from 1x1 to 8x8. The proposed GTD processor is composed of 16 processing cores in a heterogeneous systolic array. Computations are distributed over area-efficient coordinate rotation digital computers (CORDICs) to achieve a high throughput. To establish the validity of the concept, a GTD processor was designed and implemented. The latency constraint of 16 mu s specified in the 802.11ac standard is adopted for the hardware realization. The proposed design achieves a maximum throughput of 83.3k matrices/s for an 8x8 matrix at 112.4 MHz. The estimated power and core area are 172.7 mW and 1.96 mm(2), respectively, based on standard 90 nm CMOS technology. |
URI: | http://dx.doi.org/10.1109/TCSI.2015.2388831 http://hdl.handle.net/11536/124463 |
ISSN: | 1549-8328 |
DOI: | 10.1109/TCSI.2015.2388831 |
期刊: | IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS |
Volume: | 62 |
起始頁: | 1099 |
結束頁: | 1108 |
Appears in Collections: | Articles |