标题: 假说选择之于整体式学习之研究
Hypothesis Selection for Ensemble Learning
作者: 于主恩
庄仁辉
李嘉晃
刘建良
Chu, En-Yu
Chuang, Jen-Hui
Lee, Chia-Hoang
Chian-Liang Liu
资讯科学与工程研究所
关键字: 整体式学习;假设选择;假设发散;阶层式假设结构;Ensemble Learning;Hypothesis Selection;Hypothesis Divergence;Hypothesis Hierarchical Structure
公开日期: 2017
摘要: 机器学习研究的目标之一是提高分类的准确性。过去几十年,许多研究着重于根据问题领域和统计学习理论开发新的演算法,以不断提高分类效能。最近很多研究人员发现,只有使用单一的分类算法时才会出现校能瓶颈,因为每种演算法虽然都有其优点,但也有其弱点。整体式学习,结合了几个分类器或假设成为一个强大的分类器或学习器,依赖于各种假设的组合,而不是使用最新的算法。在整体式学习中,假设选择对于效能非常重要,选择假设的多样性是一个重要的选择标准。本篇论文提出了三种算法,其重点在于产生阶层性假设结构以实现假设选择的目标,其中基于特定标准来合并两个假说。我们对8个数据集进行实验,实验结果证明,该方法优于随机森林演算法。
One of the goals for the machine learning research is to improve the accuracy of the classification. Many research studies have focused on developing novel algorithms according to problem domains and statistical learning theory to continuously improve classification performance over the past decades. Recently, many researchers have found that performance bottleneck often occurs when only using a single classification algorithm, since each algorithm has its strength, but it also has its weakness. Ensemble learning, which combines several classifiers or hypotheses to become a strong classifier or learner, relies on the combination of various hypotheses rather than using state-of-the-art algorithms. In ensemble learning, hypothesis selection is crucial to performance, and the diversity of the selected hypotheses is an important selection criterion. This work proposes three algorithms focusing on generating a hierarchical hypothesis structure to achieve the goal of hypothesis selection, in which the two hypotheses are combined based on particular criterion. We conduct experiments on 8 data sets, and the experimental results indicate that the proposed method outperforms random forest, which is a state-of-the-art method.
URI: http://etd.lib.nctu.edu.tw/cdrfb3/record/nctu/#GT070456172
http://hdl.handle.net/11536/142360
显示于类别:Thesis