Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Liang, T | en_US |
dc.contributor.author | Shih, PK | en_US |
dc.date.accessioned | 2014-12-08T15:37:08Z | - |
dc.date.available | 2014-12-08T15:37:08Z | - |
dc.date.issued | 2005 | en_US |
dc.identifier.isbn | 3-540-26031-5 | en_US |
dc.identifier.issn | 0302-9743 | en_US |
dc.identifier.uri | http://hdl.handle.net/11536/25518 | - |
dc.description.abstract | Named Entity Recognition (NER) from biomedical literature is crucial in biomedical knowledge base automation. In this paper, both empirical rule and statistical approaches to protein entity recognition are presented and investigated on a general corpus GENIA 3.02p and a new domain-specific corpus SRC. Experimental results show the rules derived from SRC are useful though they are simpler and more general than the one used by other rule-based approaches. Meanwhile, a concise HMM-based model with rich set of features is presented and proved to be robust and competitive while comparing it to other successful hybrid models. Besides, the resolution of coordination variants common in entities recognition is addressed. By applying heuristic rules and clustering strategy, the presented resolver is proved to be feasible. | en_US |
dc.language.iso | en_US | en_US |
dc.title | Empirical textual mining to protein entities recognition from PubMed corpus | en_US |
dc.type | Article; Proceedings Paper | en_US |
dc.identifier.journal | NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS | en_US |
dc.citation.volume | 3513 | en_US |
dc.citation.spage | 56 | en_US |
dc.citation.epage | 66 | en_US |
dc.contributor.department | 資訊工程學系 | zh_TW |
dc.contributor.department | Department of Computer Science | en_US |
dc.identifier.wosnumber | WOS:000230413100006 | - |
Appears in Collections: | Conferences Paper |