標題: An inverted file cache for fast information retrieval
作者: Shieh, WY
Shann, JJJ
Chung, CP
資訊工程學系
Department of Computer Science
關鍵字: information retrieval system;inverted file;cache;hashing;memory management
公開日期: 1-七月-2003
摘要: The inverted file is the most popular indexing mechanism used for document search in an information retrieval system (IRS). However, the disk I/O for accessing the inverted file becomes a bottleneck in an IRS. To avoid using the disk I/O, we propose a caching mechanism for accessing the inverted file, called the inverted file cache (IF cache). In this cache, a proposed hashing scheme using a linked list structure to handle collisions in the hash table speeds up entry indexing. Furthermore, the replacement and storage mechanisms of this cache are designed specifically for the inverted file structure. We experimentally verify our design, based on documents collected from the TREC (Text REtrieval Conference) and search requests generated by the Zipf-like distribution. Simulation results show that the IF cache can improve the performance of a test IRS by about 60% in terms of the average searching response time.
URI: http://hdl.handle.net/11536/27778
ISSN: 1016-2364
期刊: JOURNAL OF INFORMATION SCIENCE AND ENGINEERING
Volume: 19
Issue: 4
起始頁: 681
結束頁: 695
顯示於類別:期刊論文


文件中的檔案:

  1. 000184321600009.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。