標題: An extended Chi2 algorithm for discretization of real value attributes
作者: Su, CT
Hsu, JH
工業工程與管理學系
Department of Industrial Engineering and Management
關鍵字: VPRS model;RST;data mining;discretization
公開日期: 1-三月-2005
摘要: The Variable Precision Rough Sets (VPRS) model is a powerful tool for data mining, as it has been widely applied to acquire knowledge. Despite its diverse applications in many domains, the VPRS model unfortunately cannot be applied to real-world classification tasks involving continuous attributes. This requires a discretization method to preprocess the data. Discretization is an effective technique to deal with continuous attributes for data mining, especially for the classification problem. The modified Chi2 algorithm is one of the modifications to the Chi2 algorithm, replacing the inconsistency check in the Chi2 algorithm by using the quality of approximation, coined from the Rough Sets Theory (RST), in which it takes into account the effect of degrees of freedom. However, the classification with a controlled degree of uncertainty, or a misclassification error, is outside the realm of RST. This algorithm also ignores the effect of variance in the two merged intervals. In this study, we propose a new algorithm, named the extended Chi2 algorithm, to overcome these two drawbacks. By running the software of See5, our proposed algorithm possesses a better performance than the original and modified Chi2 algorithms.
URI: http://dx.doi.org/10.1109/TKDE.2005.39
http://hdl.handle.net/11536/13970
ISSN: 1041-4347
DOI: 10.1109/TKDE.2005.39
期刊: IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING
Volume: 17
Issue: 3
起始頁: 437
結束頁: 441
顯示於類別:期刊論文


文件中的檔案:

  1. 000226358200011.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。