標題: | An incremental mining algorithm for maintaining sequential patterns using pre-large sequences |
作者: | Hong, Tzung-Pei Wang, Ching-Yao Tseng, Shian-Shyong 資訊工程學系 Department of Computer Science |
關鍵字: | Data mining;Sequential pattern;Large sequence;Pre-large sequence;Incremental mining |
公開日期: | 1-Jun-2011 |
摘要: | Mining useful information and helpful knowledge from large databases has evolved into an important research area in recent years. Among the classes of knowledge derived, finding sequential patterns in temporal transaction databases is very important since it can help model customer behavior. In the past, researchers usually assumed databases were static to simplify data-mining problems. In real-world applications, new transactions may be added into databases frequently. Designing an efficient and effective mining algorithm that can maintain sequential patterns as a database grows is thus important. In this paper, we propose a novel incremental mining algorithm for maintaining sequential patterns based on the concept of pre-large sequences to reduce the need for rescanning original databases. Pre-large sequences are defined by a lower support threshold and an upper support threshold that act as gaps to avoid the movements of sequences directly from large to small and vice versa. The proposed algorithm does not require rescanning original databases until the accumulative amount of newly added customer sequences exceeds a safety bound, which depends on database size. Thus, as databases grow larger, the numbers of new transactions allowed before database rescanning is required also grow. The proposed approach thus becomes increasingly efficient as databases grow. (C) 2010 Elsevier Ltd. All rights reserved. |
URI: | http://dx.doi.org/10.1016/j.eswa.2010.12.008 http://hdl.handle.net/11536/8829 |
ISSN: | 0957-4174 |
DOI: | 10.1016/j.eswa.2010.12.008 |
期刊: | EXPERT SYSTEMS WITH APPLICATIONS |
Volume: | 38 |
Issue: | 6 |
起始頁: | 7051 |
結束頁: | 7058 |
Appears in Collections: | Articles |
Files in This Item:
If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.