标题: 使用重叠架构之超越Gb/s低密度同位元检查回旋码解码器设计
An over Gb/s LDPC-CC Decoder Design using the Overlapped Architecture
作者: 刘荣杰
张锡嘉
Liu, Rong-Jie
Chang, Hsie-Chia
电子工程学系 电子研究所
关键字: 低密度同位元检查回旋码;低密度同位元检查码;重叠架构;LDPC-CC;LDPC;overlapped architecture
公开日期: 2015
摘要: 在无线通讯系统中,通道编码模组扮演重要的角色,为了提供具有技术竞争
力的解决方案,不仅需要达到高吞吐量传输也必须降低伴随而来的功率消耗。由
于具有优越的错误更正能力与适合平行运算的架构,低密度奇偶校验区块码
(LDPC block codes,简称LDPC-BCs)因而广受瞩目,然而此解码器在实作时会面临高绕线复杂度的问题。此外,在设计支援多码率之LDPC-BCs 也面临许多挑战,低密度奇偶校验回旋码(LDPC convolutional codes,简称LDPC-CCs)在1999年被提出,此码不只可对任意长度的资料区块做编解码,而且容易经由穿孔(puncturing)机制提供弹性的码率。相较于LDPC-BCs,LDPC-CCs 具有较低的绕线复杂度并且依然具有优秀的错误更正能力。
束缚长度长的LDPC-CC 会有好的错误地板现象,而且适合小错误率的应用,然而储存单元的需求量会随着记忆体容量成线性增加。为了克服此问题,本论文提出重叠架构不仅减少储存单元的数量,也减少解码延迟时间。此设计经由UMC65 nm 的低漏电制程下线,目前仍然还处于量测阶段,从post-layout 模拟结果中,我们提出的解码器在227 MHz 的操作频率下,其资料吞吐量可达到5.44 Gb/s,其中包含5 个处理器,在整体晶片面积1.79 mm^2 中仅占1.19 mm^2,功率消耗为195mW,能源效率为0.007 nJ/bit/proc。
In wireless communication system, the channel coding module play an important role. For providing a highly competitive solution, both high throughput transmission and low power consumption are required. Due to the capacity-approaching performance and inherent parallel architecture, LDPC block codes (LDPC-BCs) have attracted great interests in recent years. However, the problem of high routing complexity is a serious design challenge in VLSI implementations. Moreover, the complexity of designing multiple code-rates LDPC-BCs is increased because of different parity-check matrices are needed to be jointly considered. LDPC convolutional codes (LDPC-CCs) were introduced in 1999, which are not only capable of handling variable length of data frame but also possess flexible code-rates through puncturing. While performing the capacity-approaching performance, the routing complexity of the convolutional version is much lower than that of the LDPC-BCs.
The long constraint length LDPC-CC has great error floor performance, which is very suitable for deep error rate applications. However, the requirement of the storage units grow large linearly according to the memory size. To overcome this problem, our work proposed the overlapped architecture to not only reduce the number of storage units but also decoding latency. The design is fabricated in UMC 65nm LL CMOS process and still under measurement. The post-layout simulations show that our proposed decoder could achieve 5.44 Gb/s under 227MHz operating frequency. The decoder containing 6 processors only occupies an area of 1.19 mm^2 within the core area of total 1.14*1.57 mm^2. It draws 195 mW of power with an
energy efficiency of 0.007nJ/bit/proc.
URI: http://etd.lib.nctu.edu.tw/cdrfb3/record/nctu/#GT070250204
http://hdl.handle.net/11536/138485
显示于类别:Thesis