標題: 適用於H.264/MPEG4-AVC及其可調式視訊編碼之高效頻寬移動估測研究
The Study of Bandwidth Efficient Motion Estimation for H.264/MPEG4-AVC Video Coding and Its Scalable Extension
作者: 李國龍
Li, Gwo-Long
張添烜
陳美娟
Chang, Tian-Sheuan
Chen, Mei-Juan
電子研究所
關鍵字: 移動估測;高效頻寬;H.264/MPEG4-AVC;可調式視訊編碼;Motion estimation;Data efficient;H.264/MPEG4-AVC;Scalable video coding
公開日期: 2011
摘要: 移動估計演算法的高計算複雜度與資料頻寬存取量,一直以來是視訊編碼研究領域中非常重要的研究議題。然而,隨著高解析度視訊影像應用需求的增加,使得移動估計演算法的高資料頻寬存取量,成為影響整體視訊編解碼器效能的重要關鍵。此外,為了達到應用可調性,可調式視訊編碼所採用的層間預測模式,亦為移動估計帶來更高之資料頻寬存取量及計算複雜度。為了解決視訊編碼之高資料頻寬存取量所造成編碼系統效能下降之問題,本論文提出若干適用於整數點及分數點移動估計演算法之資料存取及計算量減少演算法。 針對整數點移動估計資料頻寬之問題,本論文提出一位元失真率最佳化的頻寬有效率之移動估計演算法。在此演算法中,本論文提出一數學模型以描述位元失真率與頻寬之關係。藉由此模型的建立,進而發展出一低頻寬需求之移動估計演算法。此外,經由此數學模型的幫助,本論文亦發展出一頻寬感知移動估計演算法。此演算法可以在頻寬有限的條件之下,適當的分配頻寬資源給移動估計演算法,進而達到更佳的失真位元率表現。透過本論文所發展之方法,可以達到78.82%頻寬節省。 在可調式視訊編碼中,由於其額外採用的層間預測方式,使得原本整數點移動估計之高頻寬存取所帶來的問題更加嚴重。因此,本論文提出若干個適用於可調式視訊編碼之高效頻寬層間預測移動估計演算法。透過利用畫面層間之高相關性,達到較高的資料共用性進而減少多餘的資料存取。實驗結果顯示,本論文提出之適用於可調式視訊編碼之高效頻寬層間預測演算法,可至少達到50.55%的資料頻寬之節省。 除了可調式視訊編碼中額外採用的層間預測方法所帶的高資料頻寬存取量之問題外,分數型移動估計演算法的高計算複雜度,亦大幅度的造成計算複雜度的增加進而影響可調式視訊編碼器的系統效率。因此,本論文提出一分數型移動估計模式預先選擇演算法,以預先過濾掉潛在可忽略之預測模式。藉由觀察不同模式之整數型移動估計成本及分數型移動估計成本之間之關係,進而提出若干個模式過濾機制。透過本論文提出之演算法,在不造成太多位元失真率效能失真的情況之下,平均可達到65.97%模式減少。 對於分數型移動估計演算法因硬體設計考量,導致參考資料未被從外部記憶體下載進來所造成的效能下降而言,本論文提出了搜尋範圍重新決定演算法用以減少視訊編碼器之效能下降。在此演算法中,本論文透過觀察移動向量子與非重疊區域尺寸之關係,提出一數學公式予以描述。因此,透過移動向量子的大小,即可計算出非重疊區域尺寸,進而重新計算所需之搜尋範圍尺寸。此外,本論文亦提出一搜尋範圍長寬比決定演算法,透過移動向量子與非重疊區域尺寸關係之數學求解,可得到較佳之非對稱式搜尋範圍長寬比。經由本論文所提出之搜尋範圍與長寬比決定之演算法,可達到90%的位元率下降改善。 整體而言,透過本論文所提之演算法,除了可大幅度的減少資料頻寬之存取外,亦可減少整數及分數點的計算複雜度,進而達到更佳視訊編碼效能之改善。
In the video coding system, the overall system performance is dominated by the motion estimation module due to its high computational complexity and memory bandwidth intensive data accesses. Furthermore, with the increasing demands of high definition TV, the system performance drop caused by the intensive data bandwidth access requirement becomes even more significant. In addition, the additional adopted Inter-layer prediction modes of scalable video coding also significant increase the data access bandwidth overhead and computational complexity. To solve the high computation complexity and intensive data bandwidth access problems, this dissertation proposes several data access bandwidth and computational complexity reduction algorithms for both of integer and fractional motion estimation. First, this dissertation proposes a rate distortion bandwidth efficient motion estimation algorithm to reduce the data bandwidth requirements in integer motion estimation. In this algorithm, a mathematical model is proposed to describe the relationship between rate distortion cost and data bandwidth. Through the modeling results, a data bandwidth efficient motion estimation algorithm is thus proposed. In addition, a bandwidth aware motion estimation algorithm based on the modeling results is also proposed to efficiently allocate the data bandwidth for motion estimation under the available bandwidth constraint. Simulation results show that our proposed algorithm can achieve 78.82% data bandwidth saving. In scalable video coding standard, the additional included Inter-layer prediction modes significantly deteriorate the video system coding performance since much more data have to be accessed for the prediction purpose. Therefore, this dissertation proposes several data efficient Inter-layer prediction algorithms to lighten the intensive data bandwidth requirement problem in scalable video coding. By observing the relationship between spatial layers, several data reusing algorithms have been proposed and thus achieve more data bandwidth requirement reduction. Simulation results demonstrate that our proposed algorithm can achieve 50.55% data bandwidth reduction at least. In addition to the system performance degradation caused by intensive data bandwidth access problem, the high computational complexity of fractional motion estimation also noticeably increases the system performance drop in scalable video coding. Therefore, this dissertation proposes a mode pre-selection algorithm for fractional motion estimation in scalable video coding. In our proposed algorithm, the rate distortion cost relationship between different prediction modes are observed and analyzed first. Based on the observing and analytical results, several mode pre-selection rules are proposed to filter out the potentially skippable prediction modes. Simulation results provide that our proposed mode pre-selection algorithm can reduce 65.97% prediction modes with ignorable rate distortion performance degradation. Finally, for the video coding system performance drop problem caused by the fractional motion estimation process skipping due to hardware implementation consideration, this dissertation proposes a search range adjust algorithm to adjust the search range for the motion estimation so that the new decided search range can cover the absent reference data as much as possible for fractional motion estimation. By mathematically modeling the relationship between motion vector predictor and non-overlapping area size, the new search range can thus be adjusted. In addition, a search range aspect ratio adjust algorithm is also proposed in this dissertation by means of solving the mathematical equations. Through the proposed search range adjust algorithm, up to 90.56% of bitrate increasing can be reduced when compared to fractional motion estimation skipping mechanism. Furthermore, the proposed search range aspect ratio adjust algorithm can achieve better rate distortion performance when compared to the exhaustive search method under the same search range area constraint. In summary, through the algorithms proposed in this dissertation, not only the data access bandwidth but the computational complexity of integer and fractional motion estimation can be reduced and thus improve the overall video coding system performance significantly.
URI: http://140.113.39.130/cdrfb3/record/nctu/#GT079811835
http://hdl.handle.net/11536/46858
Appears in Collections:Thesis


Files in This Item:

  1. 183501.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.