標題: 應用於即時數位視訊編解碼系統之高效能正反離散餘弦轉換架構與設計方法及相關計算機輔助軟體設計
A High-Throughput DCT/IDCT Architecture and Design Methodology with Application to Real-Time Digital Video Codec System and Associated CAD Design
作者: 朱啟誠
Ju, Chi-Cheng
李鎮宜
Chen-Yi Lee
電子研究所
關鍵字: 離散餘弦轉換;多常數乘法;分類向量量化;Discrete cosine transform;multiple constant multiplication;classified vector quantization;DCT;MCM;MPEG
公開日期: 1996
摘要: 離散餘弦轉換(DCT)及其反轉換已經廣泛的運用在許多低位元率的視訊壓
縮孤統而且已成為許多國際標準的核心部份。在高畫質數位電視及更高取
樣速率的應用中,高效能的離散餘弦轉換處理器對高品質的多媒體通訊是
十分重要的。在本論文中,我們先提出一個新的使用精確邊緣匹配向量量
化預測器的頻域分類向量量化演算法來說明離散餘弦轉換在影像處理運用
上的有效性,這個演算法在主觀和客觀上皆優於現存的演算法。之後,基
於一個新的系統化設計方法及內嵌式離散餘弦轉換模組的系統要求,我們
提出了一個應用於數位及時編解碼視訊系統的高效能二維正反離散餘弦轉
換架構。這個架構主要包含了七個低硬體成本的常數乘法器及分時多工乘
法累加器,而且具有無轉置動作,低通訊複雜度,低輸出入頻寬,低面積
時間複雜度(area-time complexity)和適合用於超大型積體電路的製作等
等迷人的特性。和傳統的架構比較,所提出的架構不僅能減少系統整合的
硬體成本而且能更容易的達成即時編碼的要求。此外,我們更提出了一個
新的多路徑搜尋的遞迴多常數乘法(MCM)的演算法來最佳化離散餘弦轉換
架構中的常數乘法器。就我們所知,這個演算法優於現存於文獻中的任何
演算法。在TSMC SPDM 0.6mm製程技術之下,我們利用COMPASSTM 0.6mm高
性能的標準元件 (Standard Cells) 製作此架構,此架構使用了100K電晶
體,晶片核心(core)大小為3.9mm x 4.8mm,即18.71 mm2。而模擬結果顯
示本架構之晶片時脈速度可以達到100MHz,這意味了本架構不但可以應用
於許多數位編解碼視訊系統,而且也可以達成即時(real-time)編解碼的
要求。
Discrete cosine transform (DCT) has been widely used in the
implementation of low bit rate codecs for video compression and
become an integral part of several international standards.
With the imminent arrival of HDTV and the considerably higher
sample rates applications, high throughput processors dedicated
to DCT are of great importance to achieve the high-quality
multimedia communication. In this thesis, first, we present a
new DCT domain classified vector quantization (CVQ) algorithm
with accurate side match VQ predictor to demonstrate the
effectiveness of DCT in many image processing applications. This
algorithm outperforms the previously proposed algorithms
subjectively and objectively. Then, based on a new systematic
design methodology and the system requirements of an embedded
DCT module, a high throughput 2D-DCT/IDCT VLSI architecture is
proposed for the applications of a real-time digital video codec
system. The proposed architecture mainly consists of seven cost
coefficient constant multipliers and time-multiplexed
multiplieraccumulators. It also possesses the appealing
properties such as no transpose operation, low communication
complexity, low I/O bandwidth, low area-time complexity and very
suitable for VLSI implementation. As compared with conventional
architectures, this architecture not only reduces the system
integration hardware cost but also achieves the real-time
encoding requirement more easily. Moreover, a new multiple
constant multiplication (MCM) recursive algorithm with multi-
path search strategy is proposed to optimize the constant
multipliers used in the proposed DCT/IDCT architecture. To our
knowledge, this algorithm outperforms any existing algorithms in
literature. Based on TSMC SPDM 0.6mm process technology, we use
COMPASSTM 0.6mm high performance cell library to implement the
proposed architecture. It integrates about 100k transistors in a
3.9mm x 4.8mm, i.e. 18.71mm2, silicon area. Simulation results
show that clock rate up to 100Mhz can be achieved, implying that
the proposed architecture can meet the real-time requirement of
many digital video codec systems.
URI: http://140.113.39.130/cdrfb3/record/nctu/#NT850428070
http://hdl.handle.net/11536/61940
Appears in Collections:Thesis