标题: 频域成分位移估测与变形网目视讯编码技术之研究
A Study of Frequency Component Motion Estimation and Deformable Mesh Video Coding
作者: 周永铭
Chou, Yung-Ming
杭学鸣, 张文钟
Hsueh-Ming Hang, Wen-Thong Chang
电子研究所
关键字: 位移估测;影像形变;视讯编码;motion estimation;image warping;video coding
公开日期: 1996
摘要: 物体位移估测(motion estimation)已广泛地使用在现有的视讯处理系统
上,目前最常用的方法有区块比对法(block matching method)及微分法(
differential method)。在本论文的第一部分中,我们利用频域成分的分
析方式,对物体位移估测的理论做定性与定量的探讨。除了研究影响估测
的两大原因-准确性(accuracy)与混淆性(ambiguity)二者之间的相互关
系以外,并推导不同估测方法在理论上的极限。最后藉由此分析设计一种
新的频域成分估测演算法(frequency component algorithm),并针对现
有的演算法提出可行的改良之道。在传统的位移补偿式压缩编码(motion-
compensated coding)系统中,通常利用规则的区块做为编码的基本单位
。由于影像内容的时空结构以及人类视觉系统的特征均未列入考虑,使得
区块式位移补偿受到这些限制而降低其效率和品质。此外,这类的编码架
构亦无法满足目前视讯压缩编码的新趋势-物体萃取(object
scalability)。因此,在本论文的第二部分□,我们提出了若干新技术适
用于以影像形变(image warping)位移补偿为基础的变形网目视讯编码(
deformable mesh video coding)系统中。这些新技术包括:前景/背景
分离术(foreground/background separation),经过重叠区域消除(
overlapped patch elimination)处理之参考点追踪术(nodal point
tracking)以及非矩形离散余弦编码法(nonrectangular DCT coding)。模
拟结果显示采用这些技术的视讯压缩架构十分适合应用在低速率之物体萃
取式编码上。
Motion estimation techniques are widely used in today's video
processing systems. The most frequently used techniques are the
block matching and the differential method. In the first part
of this thesis, we have studied this topic form a viewpoint
different from the above to explore the fundamental limits and
tradeoffs in image motion estimation. The underlying principles
behind two conflict requirements in motion estimation, accuracy
and ambiguity, become clear when they are analyzed using this
tool --- frequency component analysis. This analysis also
suggests new motion estimation algorithm and ways to improve the
existing algorithms. Ths so-called frequency component motion
estimation algorithm is thus proposed. Comparing to the
conventional block matching and phase correlation algorithms,
this approach provides more reliable displacement estimates
particularly for the noisy pictures. In a conventional motion-
compensated coding scheme, images are often partitioned into
artificial units such as rectangular blocks. Without
considering the natural spatio-temporal structure of images and
the characteristics of human visual system, the performance of
block based motion compensation is often limited by this
fundamental restriction. In addition, this coding structure
does not match the object scalability requirement which is a new
trend in video coding. Thus, in the second part of this thesis,
we develop several techniques to form a new deformable mesh
video coding algorithm. These techniques include foreground/
background separation, nodal point tracking with overlapped
patch elimination, and nonrectangular DCT coding schemes. The
simulation results indicate that this coding structure is quite
suitable for object scalable coding at low bit rates.
URI: http://140.113.39.130/cdrfb3/record/nctu/#NT850428126
http://hdl.handle.net/11536/62003
显示于类别:Thesis