Full metadata record
DC FieldValueLanguage
dc.contributor.author蕭遜文en_US
dc.contributor.authorHsun-Wen Hsiaoen_US
dc.contributor.author李素瑛en_US
dc.contributor.authorSuh-Yin Leeen_US
dc.date.accessioned2014-12-12T01:26:44Z-
dc.date.available2014-12-12T01:26:44Z-
dc.date.issued2006en_US
dc.identifier.urihttp://140.113.39.130/cdrfb3/record/nctu/#GT009067586en_US
dc.identifier.urihttp://hdl.handle.net/11536/41591-
dc.description.abstract傳統的搜尋引擎主要以關鍵字查詢為主,雖然提供布林運算式的查詢,但無法查詢關鍵字在文件中的順序(order)關係。XML文件搜尋引擎查詢時,除了必須具備傳統搜尋引擎關鍵字的查詢功能之外,必須考量XML文件中資料的階層關係,因此查詢時需透過由W3C所制定的XML查詢語言XPath[5]的語法查詢關鍵字在XML文件中的順序關係,以彌補傳統搜尋引擎所欠缺的功能。 本論文主要在大量XML文件資料庫加速查詢,利用所謂Begin-End-Level(BEL)[22]區間編碼方式,建立XML文件資料庫的索引結構。XML文件經BEL編碼之後,將索引資料值儲存於關聯式資料庫系統(Relation DataBase Man-agement System,RDBMS)。再利用XPath表示式轉換為SQL Command存取資料庫,重建(reconstruct)所得到的資料錄(records),可以獲得跟原先一致的XML文件內容。 為了加速XML搜尋引擎的查詢,引入signature file的索引機制,做為過濾機制,過濾掉不必要的資料庫查詢。zh_TW
dc.description.abstractTraditional search engine mainly query by keywords.Although Boolean opera-tions are provided, it is unable to query the ordering of keywords or attributes in XML documents. In XML document serach engine, besides the keyword query function of the traditional search engine, the ordering of data or the hierarchical relation of data in the XML documents must also be considered. XML query strings expressed in Xpath, which is the W3C XML query language, can query the order of keywords,the structure of XML documents. In this thesis, we are focused on the speed up of query operations in large XML documents database. We use Begin-End-Level (BEL) interval encoding method to build the index structure for each XML document.After the BEL coding of the XML documents, the indexes are saved into Relation Database. The query in the XPath ex-pression is transformed into the SQL query Commands. The stored records can recon-struct the original and consistent contents of XML documents. In order to speed up the query, the index mechanism of signature file is employed to filter out the unqualified documents first and avoid nonessential query operations.en_US
dc.language.isozh_TWen_US
dc.subject搜尋引擎zh_TW
dc.subject編碼zh_TW
dc.subject樹狀結構zh_TW
dc.subject節點zh_TW
dc.subject序號zh_TW
dc.subject路徑zh_TW
dc.subjectxmlen_US
dc.subjectBELen_US
dc.subjectsignature fileen_US
dc.subjectxpathen_US
dc.subjectorderen_US
dc.subjectshreddingen_US
dc.titleXML文件搜尋引擎的研究zh_TW
dc.titleStudy on the Search Engine of the XML Documentsen_US
dc.typeThesisen_US
dc.contributor.department資訊學院資訊學程zh_TW
Appears in Collections:Thesis


Files in This Item:

  1. 758601.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.