標題: Supporting unified interface to wrapper generator in Integrated Information Retrieval
作者: Chang, YS
Ho, MH
Sun, WC
Yuan, SM
資訊工程學系
Department of Computer Science
關鍵字: XML;information retrieval;wrapper generation;CORBA
公開日期: 1-九月-2002
摘要: Given the ever-increasing scale and diversity of information and applications on the Internet, improving the technology of information retrieval is an urgent research objective. Retrieved information is either semi-structured or unstructured in format and its sources are extremely heterogeneous. In consequence, the task of efficiently gathering and extracting information from documents can be both difficult and tedious. Given this variety of sources and formats, many choose to use mediator/wrapper architecture (Y. Papakonstantinou, A. Gupta, H. Garcia-Molina, J. Ullman, A Query Translation Scheme for Rapid Implementation of Wrappers, International Conference on Deductive and Object-Oriented Databases, Singapore, 1995), but its use demands a fast means of generating efficient wrappers. In this paper, we present a design for an automatic eXtensible Markup Language (XML)-based framework with which to generate wrappers rapidly. Wrappers created with this framework support a unified interface for a meta-search information retrieval system based on the Internet Search Service using the Common Object Request Broker Architecture (CORBA) standard. Greatly advantaged by the compatibility of CORBA and XML, a user can quickly and easily develop information-gathering applications, such as a meta-search engine or any other information source retrieval method. The two main things our design provides are a method of wrapper generation that is fast, simple, and efficient, and a wrapper generator that is CORBA and XML-compliant and that supports a unified interface. (C) 2002 Elsevier Science B.V. All rights reserved.
URI: http://dx.doi.org/10.1016/S0920-5489(02)00016-8
http://hdl.handle.net/11536/28547
ISSN: 0920-5489
DOI: 10.1016/S0920-5489(02)00016-8
期刊: COMPUTER STANDARDS & INTERFACES
Volume: 24
Issue: 4
起始頁: 291
結束頁: 309
顯示於類別:期刊論文


文件中的檔案:

  1. 000176870600003.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。