標題: | Detecting Drifting Concepts on the Internet |
作者: | Lee, Chien-I Tsai, Cheng-Jung Hsieh, Chien-Hui 資訊工程學系 Department of Computer Science |
關鍵字: | Internet;Incremental learning;Concept drift |
公開日期: | 2008 |
摘要: | With the explosive growth of information sources available on the World Wide Web, it has become increasingly necessary to utilize automated tools to discovery interesting and potentially useful patterns from data on the Internet. Since the data on the Internet such as communication packages, email, and e-commerce transactions come consecutively, an efficient and accurate incremental learning approach is required. Moreover, since the labels of these data may change over time, the problem of concept drift must be considered while incrementally learning from the data on the Internet. In this paper, we give a detailed discussion of the concept-drifting problem on the Internet. We also address a new problem called two-way drift. An approach adapted to the occurrence of concept drift is then proposed as the solution to incrementally learn from the data on the Internet. Our approach works as a preprocessor to detect the occurrence of concept drift and can be incorporated into any existing classification techniques. Our approach can also reveal which attribute values cause concept drift and therefore enables systems or decision makers to adopt proper decision in advance. |
URI: | http://hdl.handle.net/11536/9781 |
ISSN: | 1607-9264 |
期刊: | JOURNAL OF INTERNET TECHNOLOGY |
Volume: | 9 |
Issue: | 3 |
起始頁: | 229 |
結束頁: | 236 |
顯示於類別: | 期刊論文 |