標題: An extended Chi2 algorithm for discretization of real value attributes
作者: Su, CT
Hsu, JH
工業工程與管理學系
Department of Industrial Engineering and Management
關鍵字: VPRS model;RST;data mining;discretization
公開日期: 1-Mar-2005
摘要: The Variable Precision Rough Sets (VPRS) model is a powerful tool for data mining, as it has been widely applied to acquire knowledge. Despite its diverse applications in many domains, the VPRS model unfortunately cannot be applied to real-world classification tasks involving continuous attributes. This requires a discretization method to preprocess the data. Discretization is an effective technique to deal with continuous attributes for data mining, especially for the classification problem. The modified Chi2 algorithm is one of the modifications to the Chi2 algorithm, replacing the inconsistency check in the Chi2 algorithm by using the quality of approximation, coined from the Rough Sets Theory (RST), in which it takes into account the effect of degrees of freedom. However, the classification with a controlled degree of uncertainty, or a misclassification error, is outside the realm of RST. This algorithm also ignores the effect of variance in the two merged intervals. In this study, we propose a new algorithm, named the extended Chi2 algorithm, to overcome these two drawbacks. By running the software of See5, our proposed algorithm possesses a better performance than the original and modified Chi2 algorithms.
URI: http://dx.doi.org/10.1109/TKDE.2005.39
http://hdl.handle.net/11536/13970
ISSN: 1041-4347
DOI: 10.1109/TKDE.2005.39
期刊: IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING
Volume: 17
Issue: 3
起始頁: 437
結束頁: 441
Appears in Collections:Articles


Files in This Item:

  1. 000226358200011.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.