Full metadata record
DC FieldValueLanguage
dc.contributor.authorHuang, Wen-Linen_US
dc.contributor.authorTung, Chun-Weien_US
dc.contributor.authorHuang, Hui-Lingen_US
dc.contributor.authorHo, Shinn-Yingen_US
dc.date.accessioned2014-12-08T15:08:18Z-
dc.date.available2014-12-08T15:08:18Z-
dc.date.issued2009-11-01en_US
dc.identifier.issn0303-2647en_US
dc.identifier.urihttp://dx.doi.org/10.1016/j.biosystems.2009.06.007en_US
dc.identifier.urihttp://hdl.handle.net/11536/6448-
dc.description.abstractThe nucleus guides life processes of cells. Many of the nuclear proteins participating in the life processes tend to concentrate on subnuclear compartments. The subnuclear localization of nuclear proteins is hence important for deeply understanding the construction and functions of the nucleus. Recently, Gene Ontology (GO) annotation has been used for prediction of subnuclear localization. However, the effective use of GO terms in solving sequence-based prediction problems remains challenging, especially when query protein sequences have no accession number or annotated GO term. This study obtains homologies of query proteins with known accession numbers using BLAST to retrieve GO terms for sequence-based subnuclear localization prediction. A prediction method PGAC, which involves mining informative GO terms associated with amino acid composition features, is proposed to design a support vector machine-based classifier. PGAC yields 55 informative GO terms with training and test accuracies of 85.7% and 76.3%, respectively, using a data set SNL-35 (561 proteins in 9 localizations) with 35% sequence identity. Upon comparison with Nuc-PLoc, which combines amphiphilic pseudo amino acid composition of a protein with its position-specific scoring matrix, PGAC using the data set SNL_80 yields a leave-one-out cross-validation accuracy of 81.1%, which is better than that of Nuc-PLoc, 67.4%. Experimental results show that the set of informative GO terms are effective features for protein subnuclear localization. The prediction server based on PGAC has been implemented at http://iclab.life.nctu.edu.tw/prolocgac. (C) 2009 Elsevier Ireland Ltd. All rights reserved.en_US
dc.language.isoen_USen_US
dc.subjectGene Ontologyen_US
dc.subjectSubnuclear localizationen_US
dc.subjectAmino acid compositionen_US
dc.titlePredicting protein subnuclear localization using GO-amino-acid composition featuresen_US
dc.typeArticleen_US
dc.identifier.doi10.1016/j.biosystems.2009.06.007en_US
dc.identifier.journalBIOSYSTEMSen_US
dc.citation.volume98en_US
dc.citation.issue2en_US
dc.citation.spage73en_US
dc.citation.epage79en_US
dc.contributor.department生物科技學系zh_TW
dc.contributor.department生物資訊及系統生物研究所zh_TW
dc.contributor.departmentDepartment of Biological Science and Technologyen_US
dc.contributor.departmentInstitude of Bioinformatics and Systems Biologyen_US
dc.identifier.wosnumberWOS:000271554900003-
dc.citation.woscount13-
Appears in Collections:Articles


Files in This Item:

  1. 000271554900003.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.