Full metadata record
DC FieldValueLanguage
dc.contributor.authorTsai, Keng-Changen_US
dc.contributor.authorJian, Jhih-Weien_US
dc.contributor.authorYang, Ei-Wenen_US
dc.contributor.authorHsu, Po-Chiangen_US
dc.contributor.authorPeng, Hung-Pinen_US
dc.contributor.authorChen, Ching-Taien_US
dc.contributor.authorChen, Jun-Boen_US
dc.contributor.authorChang, Jeng-Yihen_US
dc.contributor.authorHsu, Wen-Lianen_US
dc.contributor.authorYang, An-Sueien_US
dc.date.accessioned2014-12-08T15:28:04Z-
dc.date.available2014-12-08T15:28:04Z-
dc.date.issued2012-07-25en_US
dc.identifier.issn1932-6203en_US
dc.identifier.urihttp://dx.doi.org/10.1371/journal.pone.0040846en_US
dc.identifier.urihttp://hdl.handle.net/11536/20327-
dc.description.abstractNon-covalent protein-carbohydrate interactions mediate molecular targeting in many biological processes. Prediction of non-covalent carbohydrate binding sites on protein surfaces not only provides insights into the functions of the query proteins; information on key carbohydrate-binding residues could suggest site-directed mutagenesis experiments, design therapeutics targeting carbohydrate-binding proteins, and provide guidance in engineering protein-carbohydrate interactions. In this work, we show that non-covalent carbohydrate binding sites on protein surfaces can be predicted with relatively high accuracy when the query protein structures are known. The prediction capabilities were based on a novel encoding scheme of the three-dimensional probability density maps describing the distributions of 36 non-covalent interacting atom types around protein surfaces. One machine learning model was trained for each of the 30 protein atom types. The machine learning algorithms predicted tentative carbohydrate binding sites on query proteins by recognizing the characteristic interacting atom distribution patterns specific for carbohydrate binding sites from known protein structures. The prediction results for all protein atom types were integrated into surface patches as tentative carbohydrate binding sites based on normalized prediction confidence level. The prediction capabilities of the predictors were benchmarked by a 10-fold cross validation on 497 non-redundant proteins with known carbohydrate binding sites. The predictors were further tested on an independent test set with 108 proteins. The residue-based Matthews correlation coefficient (MCC) for the independent test was 0.45, with prediction precision and sensitivity (or recall) of 0.45 and 0.49 respectively. In addition, 111 unbound carbohydrate-binding protein structures for which the structures were determined in the absence of the carbohydrate ligands were predicted with the trained predictors. The overall prediction MCC was 0.49. Independent tests on anti-carbohydrate antibodies showed that the carbohydrate antigen binding sites were predicted with comparable accuracy. These results demonstrate that the predictors are among the best in carbohydrate binding site predictions to date.en_US
dc.language.isoen_USen_US
dc.titlePrediction of Carbohydrate Binding Sites on Protein Surfaces with 3-Dimensional Probability Density Distributions of Interacting Atomsen_US
dc.typeArticleen_US
dc.identifier.doi10.1371/journal.pone.0040846en_US
dc.identifier.journalPLOS ONEen_US
dc.citation.volume7en_US
dc.citation.issue7en_US
dc.citation.epageen_US
dc.contributor.department生物資訊及系統生物研究所zh_TW
dc.contributor.departmentInstitude of Bioinformatics and Systems Biologyen_US
dc.identifier.wosnumberWOS:000306806600025-
dc.citation.woscount5-
Appears in Collections:Articles


Files in This Item:

  1. 000306806600025.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.