標題: Enhancing Text Classification with the Universum
作者: Liu, Chien-Liang
Lee, Ching-Hsien
工業工程與管理學系
Department of Industrial Engineering and Management
公開日期: 2016
摘要: The Universum is a data set that shares the same domain as the target problem, but does not comprise any category of interest. Recently, the concept of inference through contradictions has shown that the Universum provides a means for machine learning algorithms to encode prior knowledge into the model to improve performance. This work investigates whether text classification algorithms can benefit from the Universum when one has only a few labeled examples at hand. Additionally, this work proposes a confidence scheme to incorporate Universum into the learning process, and further devises a learning with Universum algorithm called Universum logistic regression (U-LR). The confidence scheme provides another means for machine learning algorithms to incorporate Universum into learning process. We conduct experiments on three data sets with several combinations. The experimental results indicate that the proposed method outperforms the other learning with Universum methods.
URI: http://hdl.handle.net/11536/136453
ISBN: 978-1-5090-4093-3
期刊: 2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD)
起始頁: 1147
結束頁: 1153
顯示於類別:會議論文