標題: Incremental update on sequential patterns in large databases by implicit merging and efficient counting
作者: Lin, MY
Lee, SY
資訊工程學系
Department of Computer Science
關鍵字: data mining;sequential patterns;incremental update;sequence discovery;sequence merging
公開日期: 1-七月-2004
摘要: Current approaches for sequential pattern mining usually assume that the mining is performed in a static sequence database. However, databases are not static due to update so that the discovered patterns might become invalid and new patterns could be created. In addition to higher complexity, the maintenance of sequential patterns is more challenging than that of association rules owing to sequence merging. Sequence merging, which is unique in sequence databases, requires the appended new sequences to be merged with the existing ones if their customer ids are the same. Re-mining of the whole database appears to be inevitable since the information collected in previous discovery will be corrupted by sequence merging. Instead of re-mining, the proposed IncSP (Incremental Sequential Pattern Update) algorithm solves the maintenance problem through effective implicit merging and efficient separate counting over appended sequences. Patterns found previously are incrementally updated rather than re-mined from scratch. Moreover, the technique of early candidate pruning further speeds up the discovery of new patterns. Empirical evaluation using comprehensive synthetic data shows that IncSP is fast and scalable. (C) 2003 Elsevier Ltd. All rights reserved.
URI: http://dx.doi.org/10.1016/S0306-4379(03)00036-X
http://hdl.handle.net/11536/26610
ISSN: 0306-4379
DOI: 10.1016/S0306-4379(03)00036-X
期刊: INFORMATION SYSTEMS
Volume: 29
Issue: 5
起始頁: 385
結束頁: 404
顯示於類別:期刊論文


文件中的檔案:

  1. 000221009200002.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。