2001年資料探勘競賽研究

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	張文賢	en_US
dc.contributor.author	林心宇	en_US
dc.date.accessioned	2014-12-12T03:04:05Z	-
dc.date.available	2014-12-12T03:04:05Z	-
dc.date.issued	2003	en_US
dc.identifier.uri	http://140.113.39.130/cdrfb3/record/nctu/#GT009012537	en_US
dc.identifier.uri	http://hdl.handle.net/11536/80836	-
dc.description.abstract	資料探勘是一種分析的程序，用來幫助我們發現大型資料庫中的特徵及知識。因為有關生物學的資料探勘快速的發展，2001年資料探勘競賽聚焦在基因及藥物設計資料上。我們所熱衷的是一個分類問題，這個問題有三個有趣的特性（1）大量的遺漏值（2）大量的屬性（3）混合兩種不同型態的資料，而我們最感興趣的分類方法就是決策樹分類法，我們修改了決策樹演算法，並引入“少數服從多數”技巧來提昇分類正確性。為了結合上述兩種分類方法我們發展出“主要-輔助”分類系統。	zh_TW
dc.description.abstract	Data mining is an analysis process which helps discovering patterns and knowledge in large databases. Because of the rapid growth of interest in mining biological databases, KDD Cup 2001 was focused on data from genomics and drug design. We were involved in a classification problem. The problem has three interesting features: (1) the dataset contains many missing values; (2) this dataset has a lot of attributes; and (3) the dataset is a mixture of two types of data, while the classification method we interested in most is Decision Tree. We modify the Decision Tree algorithm and cite the majority vote to improve the classification accuracy. For integrating the above two classification methods we develop " Primary-Secondary " classification system.	en_US
dc.language.iso	en_US	en_US
dc.subject	資料探勘競賽	zh_TW
dc.subject	決策樹	zh_TW
dc.subject	少數服從多數	zh_TW
dc.subject	"主要-輔助”分類系統	zh_TW
dc.subject	KDD Cup	en_US
dc.subject	Decision Tree	en_US
dc.subject	Majority vote	en_US
dc.subject	"Primary-Secondary " classification system	en_US
dc.title	2001年資料探勘競賽研究	zh_TW
dc.title	Study on KDD cup 2001	en_US
dc.type	Thesis	en_US
dc.contributor.department	電控工程研究所	zh_TW
顯示於類別：	畢業論文

文件中的檔案：

253701.pdf

253702.pdf

若為 zip 檔案，請下載檔案解壓縮後，用瀏覽器開啟資料夾中的 index.html 瀏覽全文。