完整後設資料紀錄
DC 欄位語言
dc.contributor.authorHuang, Er-Chenen_US
dc.contributor.authorPao, Hsing-Kuoen_US
dc.contributor.authorLee, Yuh-Jyeen_US
dc.date.accessioned2018-08-21T05:57:13Z-
dc.date.available2018-08-21T05:57:13Z-
dc.date.issued2017-01-01en_US
dc.identifier.urihttp://hdl.handle.net/11536/147187-
dc.description.abstractActive learning is a common strategy to deal with large-scale data with limited labeling effort. In each iteration of active learning, a query is ready for oracle to answer such as what the label is for a given unlabeled data. Given the method, we can request the labels only for those data that are essential and save the labeling effort from oracle. We focus on pool-based active learning where a set of unlabeled data is selected for querying in each run of active learning. To apply pool-based active learning to massive high-dimensional data, especially when the unlabeled data set is much larger than the labeled set, we propose the APRAL and MLP strategies so that the computation for active learning can be dramatically reduced while keeping the model power more or less the same. In APRAL, we avoid unnecessary data re-ranking given an unlabeled data selection criteria. To further improve the efficiency, with MLP, we organize the unlabeled data in a multi-layer pool based on a dimensionality reduction technique and the most valuable data to know their label information are more likely to store in the top layers. Given the APRAL and MLP strategies, the active learning computation time is reduced by about 83% if compared to the traditional active learning ones; at the same time, the model effectiveness remains.en_US
dc.language.isoen_USen_US
dc.subjectactive learningen_US
dc.subjecthigh dimensionalityen_US
dc.subjectlarge-scale dataen_US
dc.subjectpool-based samplingen_US
dc.titleBig Active Learningen_US
dc.typeProceedings Paperen_US
dc.identifier.journal2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA)en_US
dc.citation.spage94en_US
dc.citation.epage101en_US
dc.contributor.department應用數學系zh_TW
dc.contributor.departmentDepartment of Applied Mathematicsen_US
dc.identifier.wosnumberWOS:000428073700017en_US
顯示於類別:會議論文