標題: | An efficient and sensitive decision tree approach to mining concept-drifting data streams |
作者: | Tsai, Cheng-Jurig Lee, Chien-I Yang, Wei-Pang 資訊工程學系 Department of Computer Science |
關鍵字: | data mining;data streams;incremental learning;decision tree;concept drift |
公開日期: | 2008 |
摘要: | Data stream mining has become a novel research topic of growing interest in knowledge discovery. Most proposed algorithms for data stream mining assume that each data block is basically a random sample from a stationary distribution, but many databases available violate this assumption. That is, the class of an instance may change over time, known as concept drift. In this paper, we propose a Sensitive Concept Drift Probing Decision Tree algorithm (SCRIPT), which is based on the statistical X(2) test, to handle the concept drift problem on data streams. Compared with the proposed methods, the advantages of SCRIPT include: a) it can avoid unnecessary system cost for stable data streams b) it can immediately and efficiently corrects original classifier while data streams are instable; c) it is more suitable to the applications in which a sensitive detection of concept drift is required. |
URI: | http://hdl.handle.net/11536/9882 |
ISSN: | 0868-4952 |
期刊: | INFORMATICA |
Volume: | 19 |
Issue: | 1 |
起始頁: | 135 |
結束頁: | 156 |
顯示於類別: | 期刊論文 |