完整後設資料紀錄
DC 欄位語言
dc.contributor.authorShieh, WYen_US
dc.contributor.authorChung, CPen_US
dc.date.accessioned2014-12-08T15:19:35Z-
dc.date.available2014-12-08T15:19:35Z-
dc.date.issued2005-03-01en_US
dc.identifier.issn0306-4573en_US
dc.identifier.urihttp://dx.doi.org/10.1016/j.ipm.2003.10.004en_US
dc.identifier.urihttp://hdl.handle.net/11536/13941-
dc.description.abstractMany information retrieval systems use the inverted file as indexing structure. The inverted file, however, requires inefficient reorganization when new documents are to be added to an existing collection. Most studies suggest dealing with this problem by sparing free space in an inverted file for incremental updates. In this paper, we propose a run-time statistics-based approach to allocate the spare space. This approach estimates the space requirements in an inverted file using only a little most recent statistical data on space usage and document update request rate. For best indexing speed and space efficiency, the amount of the spare space to be allocated is determined by adaptively balancing the trade-offs between reorganization reduction and space utilization. Experiment results show that the proposed space-sparing approach significantly avoids reorganization in updating an inverted file, and in the meantime, unused free space can be well controlled such that the file access speed is not affected. (C) 2003 Elsevier Ltd. All rights reserved.en_US
dc.language.isoen_USen_US
dc.subjectinformation retrievalen_US
dc.subjectinverted fileen_US
dc.subjectincremental updateen_US
dc.subjectstatistical approachen_US
dc.subjectspare spaceen_US
dc.titleA statistics-based approach to incrementally update inverted filesen_US
dc.typeArticleen_US
dc.identifier.doi10.1016/j.ipm.2003.10.004en_US
dc.identifier.journalINFORMATION PROCESSING & MANAGEMENTen_US
dc.citation.volume41en_US
dc.citation.issue2en_US
dc.citation.spage275en_US
dc.citation.epage288en_US
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:000225323100007-
dc.citation.woscount10-
顯示於類別:期刊論文


文件中的檔案:

  1. 000225323100007.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。