具邊界前瞻之非失真影像預測編碼技術

標題:	具邊界前瞻之非失真影像預測編碼技術 Predictively Encoded Techniques with Edge-look-ahead for Lossless Compression of Images
作者:	高立人 Lih-Jen Kau 林源倍 Yuan-Pei Lin 電控工程研究所
關鍵字:	非失真影像壓縮;預測編碼;最小平方法;邊界前瞻;環境建模;誤差補償;熵值編碼;算數編碼;長度編碼;P型控制器;Lossless image compression;Predictive coding;Least squares optimization;Edge-look-ahead;Context modeling;Error compensation;Entropy coding;Arithmetic coding;Run-length encoding;P-controller
公開日期:	2008
摘要:	非失真影像編碼在許多場合皆有其應用之需求性；例如醫學影像編碼、遠端感測，以及影像壓縮等。在進行非失真影像編碼過程中，如何有效地移除統計累贅至今仍是訊號源編碼研究領域的一個主要挑戰課題。因此已有許多關於非失真影像編碼的方法被提出。其中有部分的研究係使用可還原小波轉換。然而由各項文獻中可以發現，使用轉換編碼所得到的結果往往卻不若以空間域之預測編碼結合環境建模（predictive coding with context modeling in spatial domain）所得到之結果來的好。在本篇論文中，我們將針對近年來非失真影像編碼之概況做一簡介。此外我們亦將提出一基於線性預測之架構並以最小平方法進行此預測器係數之修正。由於使用最小平方法進行預測器係數之修正具有所謂邊界導向之特性（edge-directed characteristic），因此對於位處影像邊界附近像素之預測具有非常良好的效果。然而若將整張圖像皆以最小平方法進行預測器係數之修正勢將導致極高之複雜度，因此我們提出當編碼過程遭遇影像之邊界時才以最小平方法進行預測器係數之修正；如此運算複雜度將可大幅地降低。為了能夠於編碼過程中事先偵測影像邊界之存在與否，我們提出了一個非常簡易、有效而且僅使用已掃瞄/編碼像素（causal pixels）的邊界偵測法。如此一來我們的系統便可以預知影像邊界存在與否，並在遭遇邊界時，事先以最小平方法進行預測器係數之修正以防範較大預測誤差之發生。在我們所提出的方法中僅使用到已掃瞄/編碼像素來進行預測編碼；因此並無需傳送額外之資訊。經由實驗證明，我們所提出的方法能夠有效地在預測結果與運算複雜度之間取得良好的平衡點。此外我們也透過大量的實驗並針對當前非失真影像壓縮領域最先進的預測器（predictor）與編碼器（coder）進行比較來證明所提出方法之可用性。除上述所提出具邊界前瞻（edge-look-ahead）能力之非失真預測編碼架構外，我們發現較大的預測誤差通常好發於影像中具有邊界之處。因此，我們在本篇論文中提出一創新概念，亦即利用控制工程的技術來改進影像中位於邊界附近像素之預測結果。之所以有這樣的想法是因為我們瞭解控制系統的目的本就是希望系統的輸出能夠準確地遵循所輸入之控制命令；因此在目的上與預測編碼是一致的。此外影像中的邊界（亦即像素的急速變化）亦可視為控制系統中的步階命令（step command）。基於前述的觀察使我們產生了嘗試以控制方法來改進預測編碼效能的想法。為了實現這樣想法，我們也以Takagi以及Sugeno兩位學者所提出之模糊類神經網路實做了一適應性預測編碼架構。此外我們也將控制領域中經常被使用的Proportional Controller（P型控制器）實現於此一模糊類神經網路中以強化影像中位於邊界像素之預測結果。我們發現這樣的作法對於預測編碼的效能改善確實是有所幫助的；雖然目前的改進效果並不是那麼地顯著，但是這樣的概念的確為非失真影像預測編碼領域開啟了一個全然不同的問題思考以及解決方式。 Lossless image coding is required by many applications, such as medical imaging, remote sensing, and image archiving. It has remained a major challenge to source coding community for the difficulty of removing statistical redundancy effectively and efficiently. Therefore, many approaches have been proposed for lossless compression of images. Among proposed approaches, some of which are based on reversible transform coding, like integer wavelet transformation. However, we find in literatures that the results obtained by using transform coding are typically inferior to that of obtained by predictively encoded techniques with context modeling in spatial domain. In this dissertation, an introduction on recent advances in lossless image coding will be given. Moreover, we will propose an approach based on linear predictive coding with least-squares (LS) optimization for the adaptation of predictor coefficients. The LS-based adaptive predictor, for its edge-directed characteristic, has been shown to be useful for the prediction of pixels around boundaries. Instead of performing LS adaptation in a pixel-by-pixel manner, we adapt the predictor coefficients only when an edge is detected so that the computational complexity can be significantly reduced. For this, we propose a simple yet effective edge detector using only causal pixels. This way, the proposed system can look ahead to determine if the coding pixel is around an edge and initiate the LS adaptation in advance to prevent the occurrence of a large prediction error. Furthermore, only causal pixels are used for estimating the coding pixels in the proposed encoder; no additional side information needs to be transmitted. As we will see later in the experiments, a very good trade-off between prediction results and the computational complexity can be obtained with the proposed approach. Besides, extensive experiments as well as comparisons to existing state-of-the-art predictors and coders will be given to demonstrate the usefulness of the proposed approach. In addition to the proposed edge-look-ahead approach, we find a large prediction error can usually take place for pixels around boundaries. Therefore, we also propose in this dissertation a novel concept of using control technologies to improve prediction result for pixels around boundaries. This idea comes from the fact that the purpose of a control system is to follow the input command as precisely as possible, which has the same objective with predictive coding. Moreover, an edge or a boundary can be regarded as a step command in control system. The above observations lead to the idea of solving this problem using control technologies. To realize this idea, we also implement an adaptive predictor using Takagi-Sugeno fuzzy neural network (TS-FNN). Moreover, the widely used proportional controller in control theory is applied implicitly in the consequent part of the network as a compensator to enhance the prediction result around edges. The effectiveness of the proposed novel approach, though not very conspicuous at present, can be further improved if a more sophisticated compensator is applied, and what's more, we have brought up an idea of solving this problem in a quite different aspect for lossless compression of images.
URI:	http://140.113.39.130/cdrfb3/record/nctu/#GT008912801 http://hdl.handle.net/11536/77079
顯示於類別：	畢業論文

文件中的檔案：

280101.pdf

若為 zip 檔案，請下載檔案解壓縮後，用瀏覽器開啟資料夾中的 index.html 瀏覽全文。