標題: Accelerating web content filtering by the early decision algorithm
作者: Lin, Po-Ching
Liu, Ming-Dao
Lin, Ying-Dar
Lai, Yuan-Cheng
資訊工程學系
Department of Computer Science
關鍵字: webfiltering;text classification;world wide web;early decision
公開日期: 1-二月-2008
摘要: Real-time content analysis is typically a bottleneck in Web filtering. To accelerate the filtering process, this work presents a simple, but effective early decision algorithm that analyzes only part of the Web content. This algorithm can make the filtering decision, either to block or to pass the Web content, as soon as it is confident with a high probability that the content really belongs to a banned or an allowed category. Experiments show the algorithm needs to examine only around one-fourth of the Web content on average, while the accuracy remains fairly good: 89% for the banned content and 93% for the allowed content. This algorithm can complement other Web filtering approaches, such as URL blocking, to filter the Web content with high accuracy and efficiency. Text classification algorithms in other applications can also follow the principle of early decision to accelerate their applications.
URI: http://dx.doi.org/10.1093/ietisy/e91-d.2.251
http://hdl.handle.net/11536/9686
ISSN: 0916-8532
DOI: 10.1093/ietisy/e91-d.2.251
期刊: IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS
Volume: E91D
Issue: 2
起始頁: 251
結束頁: 257
顯示於類別:期刊論文


文件中的檔案:

  1. 000253655800012.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。