标题: | 频域成分位移估测与变形网目视讯编码技术之研究 A Study of Frequency Component Motion Estimation and Deformable Mesh Video Coding |
作者: | 周永铭 Chou, Yung-Ming 杭学鸣, 张文钟 Hsueh-Ming Hang, Wen-Thong Chang 电子研究所 |
关键字: | 位移估测;影像形变;视讯编码;motion estimation;image warping;video coding |
公开日期: | 1996 |
摘要: | 物体位移估测(motion estimation)已广泛地使用在现有的视讯处理系统 上,目前最常用的方法有区块比对法(block matching method)及微分法( differential method)。在本论文的第一部分中,我们利用频域成分的分 析方式,对物体位移估测的理论做定性与定量的探讨。除了研究影响估测 的两大原因-准确性(accuracy)与混淆性(ambiguity)二者之间的相互关 系以外,并推导不同估测方法在理论上的极限。最后藉由此分析设计一种 新的频域成分估测演算法(frequency component algorithm),并针对现 有的演算法提出可行的改良之道。在传统的位移补偿式压缩编码(motion- compensated coding)系统中,通常利用规则的区块做为编码的基本单位 。由于影像内容的时空结构以及人类视觉系统的特征均未列入考虑,使得 区块式位移补偿受到这些限制而降低其效率和品质。此外,这类的编码架 构亦无法满足目前视讯压缩编码的新趋势-物体萃取(object scalability)。因此,在本论文的第二部分□,我们提出了若干新技术适 用于以影像形变(image warping)位移补偿为基础的变形网目视讯编码( deformable mesh video coding)系统中。这些新技术包括:前景/背景 分离术(foreground/background separation),经过重叠区域消除( overlapped patch elimination)处理之参考点追踪术(nodal point tracking)以及非矩形离散余弦编码法(nonrectangular DCT coding)。模 拟结果显示采用这些技术的视讯压缩架构十分适合应用在低速率之物体萃 取式编码上。 Motion estimation techniques are widely used in today's video processing systems. The most frequently used techniques are the block matching and the differential method. In the first part of this thesis, we have studied this topic form a viewpoint different from the above to explore the fundamental limits and tradeoffs in image motion estimation. The underlying principles behind two conflict requirements in motion estimation, accuracy and ambiguity, become clear when they are analyzed using this tool --- frequency component analysis. This analysis also suggests new motion estimation algorithm and ways to improve the existing algorithms. Ths so-called frequency component motion estimation algorithm is thus proposed. Comparing to the conventional block matching and phase correlation algorithms, this approach provides more reliable displacement estimates particularly for the noisy pictures. In a conventional motion- compensated coding scheme, images are often partitioned into artificial units such as rectangular blocks. Without considering the natural spatio-temporal structure of images and the characteristics of human visual system, the performance of block based motion compensation is often limited by this fundamental restriction. In addition, this coding structure does not match the object scalability requirement which is a new trend in video coding. Thus, in the second part of this thesis, we develop several techniques to form a new deformable mesh video coding algorithm. These techniques include foreground/ background separation, nodal point tracking with overlapped patch elimination, and nonrectangular DCT coding schemes. The simulation results indicate that this coding structure is quite suitable for object scalable coding at low bit rates. |
URI: | http://140.113.39.130/cdrfb3/record/nctu/#NT850428126 http://hdl.handle.net/11536/62003 |
显示于类别: | Thesis |