標題: | Interactive sequence discovery by incremental mining |
作者: | Lin, MY Lee, SY 資訊工程學系 Department of Computer Science |
關鍵字: | data mining;sequential pattern;interactive discovery;knowledge base;incremental mining |
公開日期: | 19-十月-2004 |
摘要: | Sequential pattern mining has become a challenging task in data mining due to its complexity. Essentially, the mining algorithms discover all the frequent patterns meeting the user specified minimum support threshold. However, it is very unlikely that the user could obtain the satisfactory patterns in just one query. Usually the user must try various support thresholds to mine the database for the final desirable set of patterns. Consequently, the time-consuming mining process has to be repeated several times. However, current approaches are inadequate for such interactive mining due to the long processing time required for each query. In order to reduce the response time for each query during the interactive process, we propose a knowledge base assisted mining algorithm for interactive sequence discovery. The proposed approach utilizes the knowledge acquired from each mining process, accumulates the counting information to facilitate efficient counting of patterns, and speeds up the whole interactive mining process. Furthermore, the knowledge base makes possible the direct generation of new candidate sets and the concurrent support counting of variable sized candidates. Even for some queries, due to the pattern information already kept in the knowledge base, database access is not required at all. The conducted experiments show that our approach outperforms GSP, a state-of-the-art sequential pattern mining algorithm, by several order of magnitudes for interactive sequence discovery. (C) 2003 Elsevier Inc. All rights reserved. |
URI: | http://dx.doi.org/10.1016/j.ins.2003.09.021 http://hdl.handle.net/11536/25749 |
ISSN: | 0020-0255 |
DOI: | 10.1016/j.ins.2003.09.021 |
期刊: | INFORMATION SCIENCES |
Volume: | 165 |
Issue: | 3-4 |
起始頁: | 187 |
結束頁: | 205 |
顯示於類別: | 期刊論文 |