Full metadata record
DC FieldValueLanguage
dc.contributor.authorLiou, Yi-Fanen_US
dc.contributor.authorCharoenkwan, Phasiten_US
dc.contributor.authorSrinivasulu, Yerukala Sathipatien_US
dc.contributor.authorVasylenko, Tamaraen_US
dc.contributor.authorLai, Shih-Chungen_US
dc.contributor.authorLee, Hua-Chinen_US
dc.contributor.authorChen, Yi-Hsiungen_US
dc.contributor.authorHuang, Hui-Lingen_US
dc.contributor.authorHo, Shinn-Yingen_US
dc.date.accessioned2019-04-03T06:40:27Z-
dc.date.available2019-04-03T06:40:27Z-
dc.date.issued2014-12-08en_US
dc.identifier.issn1471-2105en_US
dc.identifier.urihttp://dx.doi.org/10.1186/1471-2105-15-S16-S4en_US
dc.identifier.urihttp://hdl.handle.net/11536/124077-
dc.description.abstractBackground: Heme binding proteins (HBPs) are metalloproteins that contain a heme ligand (an iron-porphyrin complex) as the prosthetic group. Several computational methods have been proposed to predict heme binding residues and thereby to understand the interactions between heme and its host proteins. However, few in silico methods for identifying HBPs have been proposed. Results: This work proposes a scoring card method (SCM) based method (named SCMHBP) for predicting and analyzing HBPs from sequences. A balanced dataset of 747 HBPs (selected using a Gene Ontology term GO: 0020037) and 747 non-HBPs (selected from 91,414 putative non-HBPs) with an identity of 25% was firstly established. Consequently, a set of scores that quantified the propensity of amino acids and dipeptides to be HBPs is estimated using SCM to maximize the predictive accuracy of SCMHBP. Finally, the informative physicochemical properties of 20 amino acids are identified by utilizing the estimated propensity scores to be used to categorize HBPs. The training and mean test accuracies of SCMHBP applied to three independent test datasets are 85.90% and 71.57%, respectively. SCMHBP performs well relative to comparison with such methods as support vector machine (SVM), decision tree J48, and Bayes classifiers. The putative non-HBPs with high sequence propensity scores are potential HBPs, which can be further validated by experimental confirmation. The propensity scores of individual amino acids and dipeptides are examined to elucidate the interactions between heme and its host proteins. The following characteristics of HBPs are derived from the propensity scores: 1) aromatic side chains are important to the effectiveness of specific HBP functions; 2) a hydrophobic environment is important in the interaction between heme and binding sites; and 3) the whole HBP has low flexibility whereas the heme binding residues are relatively flexible. Conclusions: SCMHBP yields knowledge that improves our understanding of HBPs rather than merely improves the prediction accuracy in predicting HBPs.en_US
dc.language.isoen_USen_US
dc.titleSCMHBP: prediction and analysis of heme binding proteins using propensity scores of dipeptidesen_US
dc.typeArticleen_US
dc.identifier.doi10.1186/1471-2105-15-S16-S4en_US
dc.identifier.journalBMC BIOINFORMATICSen_US
dc.citation.volume15en_US
dc.citation.spage0en_US
dc.citation.epage0en_US
dc.contributor.department生物科技學系zh_TW
dc.contributor.department生物資訊及系統生物研究所zh_TW
dc.contributor.departmentDepartment of Biological Science and Technologyen_US
dc.contributor.departmentInstitude of Bioinformatics and Systems Biologyen_US
dc.identifier.wosnumberWOS:000346168200005en_US
dc.citation.woscount8en_US
Appears in Collections:Articles


Files in This Item:

  1. e35f16230dd8c28055592fc7d125820d.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.