完整後設資料紀錄
DC 欄位語言
dc.contributor.authorLee, Chih-Ningen_US
dc.contributor.authorChen, Yi-Rueien_US
dc.contributor.authorTzeng, Wen-Gueyen_US
dc.date.accessioned2019-04-02T06:04:49Z-
dc.date.available2019-04-02T06:04:49Z-
dc.date.issued2017-01-01en_US
dc.identifier.urihttp://hdl.handle.net/11536/150831-
dc.description.abstractThis paper proposes an online subject-based spam filter built upon an extended version of weighted naive Bayesian (WNB) classifier. The spam filter checks email subjects only. It is faster than spam filters that scan whole body of emails and useful even spam senders temper email bodies to avoid filtering. In addition to the widely used bag-of-word feature, we further consider statistical and nature language features to discover new characteristics from email subjects. In online learning, we use an extended WNB classifier. It is not only computationally efficient, but also more adaptive to the changes of spams with new malicious campaigns. The proposed classifier is immune to the spams with malicious campaigns beyond contemplation. We evaluate the performance of our spam filter on 8 well-known ham-spam email datasets from TREC and Enron-Spam corpus. Our approach achieves 94.85% of accuracy and 95.8% of F1-measure on TREC datasets, and 95.74% of accuracy and 97.2% of F1-measure on Enron-Spam datasets. Compared with previous works of the same line, our approach has 2.43%, 2.3%, and 3.2% improvements on accuracy, true positive rate, and false positive rate, respectively.en_US
dc.language.isoen_USen_US
dc.subjectSpam filteren_US
dc.subjectEmailen_US
dc.subjectSubjecten_US
dc.subjectNaive Bayesianen_US
dc.subjectNatural languageen_US
dc.titleAn Online Subject-Based Spam Filter Using Natural Language Featuresen_US
dc.typeProceedings Paperen_US
dc.identifier.journal2017 IEEE CONFERENCE ON DEPENDABLE AND SECURE COMPUTINGen_US
dc.citation.spage479en_US
dc.citation.epage484en_US
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:000450296400070en_US
dc.citation.woscount0en_US
顯示於類別:會議論文