完整後設資料紀錄
DC 欄位語言
dc.contributor.authorHuang, Wen-Linen_US
dc.contributor.authorTung, Chun-Weien_US
dc.contributor.authorHo, Shih-Wenen_US
dc.contributor.authorHo, Shinn-Yingen_US
dc.date.accessioned2017-04-21T06:49:15Z-
dc.date.available2017-04-21T06:49:15Z-
dc.date.issued2008en_US
dc.identifier.isbn978-1-4244-1778-0en_US
dc.identifier.urihttp://hdl.handle.net/11536/135063-
dc.description.abstractGene Ontology (GO) annotation is a controlled vocabulary of terms and phrases describing the function of genes and gene products, which has been succeeded in predicting subcellualr and subnuclear localization. Generally, each gene product is annotated by very few GO terms from more than 25,000 annotations available at present. How to represent a protein sequence using GO terms as features plays an important role in designing prediction systems for protein subnuclear localization. Our previous work ProLoc-GO can select a small number m out of a large number n GO terms, where m n. However, its off-line time for training is large up to several days even though running on high speedily PC clusters. Therefore, this study proposes an efficient system (ProLoc-rGO) by using the decision tree method to speedily mine m informative GO terms and acquire interpretable rule-based knowledge for predicting subnuclear localization. The ProLoc-rGO performing on SNL9_80 (714 proteins in nine compartments with <80 identity) can mine m=17 informative GO terms, 17 interpretable rules and yield training and test accuracies of 84.9% and 78.2%. For comparison, an accuracy 82.6% (Matthews correlation coefficient (MCC) = 0.711) for ProLoc-rGO performed on SNL9_80 (714 proteins in nine compartments with <80 identity) is obtained, which is better than 67.4% (MCC = 0.50) for Nuc-PLoc that fuses the pseudo-amino acid composition of a protein and its position-specific scoring matrix.en_US
dc.language.isoen_USen_US
dc.titleProLoc-rGO: Using rule-based knowledge with Gene Ontology terms for prediction of protein subnuclear localizationen_US
dc.typeProceedings Paperen_US
dc.identifier.journal2008 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGYen_US
dc.citation.spage73en_US
dc.citation.epage+en_US
dc.contributor.department生物科技學系zh_TW
dc.contributor.department生物資訊及系統生物研究所zh_TW
dc.contributor.departmentDepartment of Biological Science and Technologyen_US
dc.contributor.departmentInstitude of Bioinformatics and Systems Biologyen_US
dc.identifier.wosnumberWOS:000263713400011en_US
dc.citation.woscount0en_US
顯示於類別:會議論文