以影像內涵之形狀及色彩為基礎的影像檢索系統

標題:	以影像內涵之形狀及色彩為基礎的影像檢索系統 Content-based Image Retrieval Using Shape and Color
作者:	石昭玲 Jau-Ling Shih Ling-Hwei Chen 陳玲慧資訊科學與工程研究所
關鍵字:	影像檢索;特徵擷取;商標檢索;Content-based Image Retreival;Feature Extraction;Trademark retreival
公開日期:	2001
摘要:	近年來，隨著數位圖書館的建立，如何有效率地管理一個大型的影像資料庫變得越來越重要。在影像資料庫中可能包含商標，彩色圖片，照片，郵票以及油畫等等。事實上，若將傳統處理文字資料的圖書館分類系統用於影像資料庫上並不太適當，因若想用傳統文字檢索鍵(keyword)的方法來搜尋影像資料庫，勢必要將每一筆影像資料用文字建立檢索鍵。如此一來不只要花費大量的時間和金錢建立文字檢索鍵，而且所建立的檢索鍵又可能太過主觀不符合實際需求。因此對於影像資料庫的管理，首先要建立一個有效率的搜尋方法，而捨棄傳統以文字為檢索鍵的方法。目前以影像本身的內容為檢索鍵的搜尋方式是影像資料庫管理上的最佳利器！配合著數位圖書館的建立網際網路應用的普及，此項技術已吸引許多研究單位的投入。其中大部份的系統都是處理彩色影像資料庫，事實上商標搜尋本身，也是有著很大的應用市場。由於著作及智慧財產權觀念的建立，如何設計一個商標，不致於去模倣到現有已註冊商標也變成越來越重要。因此，如何設計一良好的商標搜尋系統以幫助使用者在極短的時間內，找尋出相似的商標已成為一重要研究課題。因此，本論文首先將提出商標檢索系統。此系統含三部分：商標切割、特徵擷取及商標檢索。第一個部分將提出半自動的切割系統，使用者可利用此系統，將商標中最具代表性的數個圖樣找出來。接下來，對每個圖樣，我們會以幾個特徵來描述它，包括不變矩量、邊界方向的直方統計圖，還有兩種較不受幾何變形影響的轉換係數。綜合考量每個特徵所做的搜尋結果，我們可以找到數個相似的商標。最後，可依使用者對目前搜尋結果的意見，自動調整每個特徵值的權重，以期使用者能在下一次的搜尋中，得到更滿意的結果。在另一方面，我們亦提出兩個彩色影像檢索系統，此兩系統主要是利用影像本身的子影像(sub-image)作搜尋。在第一個系統中，首先，對每張影像中的每個像素，取其上下左右的像素及自己本身構成一個內容單元。由於在一張影像中，有很多相似的內容單元，因此我們利用一個分群演算法將影像中具代表性的內容單元 (稱為基本單元) 先找出來，再利用這些基本單元搜尋相似的影像。同樣的，我們也可利用使用者對目前搜尋結果的意見，調整影像中每個基本單元在搜尋時的權重，以符合使用者之需求。在第二個系統中，我們將提出一個以區塊為單位的搜尋法，希望能搜尋與原影像部分相似的影像。首先，我們先將一張影像切成好幾塊，對每一小塊影像我們計算其彩色矩量當作特徵，因為在一張影像中可能會有很多部分是相似的，因此我們可先將這些彩色矩量做一個分群的動作，找到其中具代表性的數個彩色矩量，然後利用它們來做搜尋的動作。由於每張影像有其不同的特性，因此任何方法都不可能滿足所有影像的需求。基於此，最後我們將提出一個回饋演算法，依使用者對目前綜合各種方法的搜尋結果的意見，決定對目前這張搜尋影像最有利的方法。實驗結果顯現，我們所提出的各式搜尋方法的確相當有效。 In recent year, with the construction of digital libraries, the management of the large multimedia databases such as trademarks, color photographs, stamps and paintings becomes an important issue. Thus, the demand for an automatic and user-friendly image retrieval system based on image content become urgent. Most of the current image retrieval systems search the similar images based on the keyword of each image. However, the keywords must be defined by persons. If the image database contains many images, it will be time consuming to define keywords for the entire database. Moreover, what are the proper keywords for an image is difficult to decide. Thus, the demand for an automatic and user-friendly image retrieval system based on image content, such as color, texture, shape, spatial relationship, motion, and so on, become urgent. The major problem for a content-based image retrieval system is how to extract proper features to represent the visual content in an image and query the similar images by these features. Hence, in this dissertation, we will propose methods to extract the shape and color features for images. First, with the gradual increase of the number of trademarks, trademark imitation becomes a serious problem. Thus, to build an efficient trademark retrieval system is imperative. In this dissertation, such a system will first be presented. It consists of three phases: trademark segmentation, feature extraction, and trademark retrieval. A semi-automatic segmentation method will be proposed to extract the shapes of those representative objects, called "masks", in each trademark. Next, some features will be selected to describe a mask. These include invariant moments, the histogram of edge directions, and two kinds of transform coefficients that are robust to geometric deformation. Then, based on the rank of the feature distance, a similarity measure is provided to do the similar trademark retrieval. A feedback algorithm will be also proposed to automatically determine the weight of each feature according to the user's response. Secondly, a color image retrieval method based on the primitives of images will be proposed. The context of each pixel in an image will be defined. Then, the contexts in the image are clustered into several classes based on the algorithm of fast non-iterative clustering. The mean of the contexts in the same class is considered as a primitive of the image. The primitives are used as feature vectors. Since the numbers of primitives between images are different, a specially designed similarity measure is then proposed to do color image retrieval. To better adapt to the preferences of users, a relevance feedback algorithm is provided to automatically determine the weight of each primitive according to the user’s response. Finally, a region-based color image retrieval method will be proposed to find those images that are partly similar to the query one. An image is divided into several blocks. Then, the color features of all blocks are extracted and clustered into several classes. The mean vectors of each class are considered as a primitive of the image. All primitives are used as features. Furthermore, since for few special types of images, other methods may have better results occasionally, a relevance feedback algorithm is also provided to automatically determine the most appropriate method according to the user’s response. Experiment results show that these proposed methods are superior to others.
URI:	http://140.113.39.130/cdrfb3/record/nctu/#NT900394063 http://hdl.handle.net/11536/68589
顯示於類別：	畢業論文