Empirical textual mining to protein entities recognition from PubMed corpus

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Liang, T	en_US
dc.contributor.author	Shih, PK	en_US
dc.date.accessioned	2014-12-08T15:37:08Z	-
dc.date.available	2014-12-08T15:37:08Z	-
dc.date.issued	2005	en_US
dc.identifier.isbn	3-540-26031-5	en_US
dc.identifier.issn	0302-9743	en_US
dc.identifier.uri	http://hdl.handle.net/11536/25518	-
dc.description.abstract	Named Entity Recognition (NER) from biomedical literature is crucial in biomedical knowledge base automation. In this paper, both empirical rule and statistical approaches to protein entity recognition are presented and investigated on a general corpus GENIA 3.02p and a new domain-specific corpus SRC. Experimental results show the rules derived from SRC are useful though they are simpler and more general than the one used by other rule-based approaches. Meanwhile, a concise HMM-based model with rich set of features is presented and proved to be robust and competitive while comparing it to other successful hybrid models. Besides, the resolution of coordination variants common in entities recognition is addressed. By applying heuristic rules and clustering strategy, the presented resolver is proved to be feasible.	en_US
dc.language.iso	en_US	en_US
dc.title	Empirical textual mining to protein entities recognition from PubMed corpus	en_US
dc.type	Article; Proceedings Paper	en_US
dc.identifier.journal	NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS	en_US
dc.citation.volume	3513	en_US
dc.citation.spage	56	en_US
dc.citation.epage	66	en_US
dc.contributor.department	資訊工程學系	zh_TW
dc.contributor.department	Department of Computer Science	en_US
dc.identifier.wosnumber	WOS:000230413100006	-
顯示於類別：	會議論文