標題: Multi-class named entities extraction from biomedical literature
作者: Liang, Tyne
Chen, Jian-Shin
資訊工程學系
Department of Computer Science
關鍵字: named entity extraction;biomedical literature;statistical model;term variant;classification
公開日期: 1-十一月-2006
摘要: With rapid growth of electronic literature in recent years, efficient named entities extraction becomes an indispensable part of knowledge base construction automation. In this paper an entity extraction system useful as biomedical knowledge acquisition was presented. Unlike most entity extraction systems which do not concern term variants, the proposed system was incorporated with a rule-based resolver to recover the full forms of those target entities from the coordination variants. The resolution approach was proved with GENIA Corpus 3.0 to be feasible by showing 88.51% recall and 57.04% precision. On the other hand, the kernel part of the system was based on Hidden Markov Model (HMMs) by setting appropriate set of input features extracted from training corpus. With various experiments on different corpora the proposed system achieved promising results at entity boundary identification and at classification as well.
URI: http://hdl.handle.net/11536/11644
ISSN: 1016-2364
期刊: JOURNAL OF INFORMATION SCIENCE AND ENGINEERING
Volume: 22
Issue: 6
起始頁: 1339
結束頁: 1353
顯示於類別:期刊論文


文件中的檔案:

  1. 000242243500003.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。