標題: A discretization algorithm based on Class-Attribute Contingency Coefficient
作者: Tsai, Cheng-Jung
Lee, Chien-I.
Yang, Wei-Pang
資訊工程學系
Department of Computer Science
關鍵字: data mining;classification;decision tree;discretization;Contingency Coefficient
公開日期: 1-二月-2008
摘要: Discretization algorithms have played an important role in data mining and knowledge discovery. They not only produce a concise summarization of continuous attributes to help the experts understand the data more easily, but also make learning more accurate and faster. In this paper, we propose a static, global, incremental, supervised and top-down discretization algorithm based on Class-Attribute Contingency Coefficient. Empirical evaluation of seven discretization algorithms on 13 real datasets and four artificial datasets showed that the proposed algorithm could generate a better discretization scheme that improved the accuracy of classification. As to the execution time of discretization, the number of generated rules, and the training time of C5.0, our approach also achieved promising results. (c) 2007 Elsevier Inc. All rights reserved.
URI: http://dx.doi.org/10.1016/j.ins.2007.09.004
http://hdl.handle.net/11536/9729
ISSN: 0020-0255
DOI: 10.1016/j.ins.2007.09.004
期刊: INFORMATION SCIENCES
Volume: 178
Issue: 3
起始頁: 714
結束頁: 731
顯示於類別:期刊論文


文件中的檔案:

  1. 000251621700009.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。