Full metadata record
DC FieldValueLanguage
dc.contributor.authorHuang, Hui-Lingen_US
dc.date.accessioned2014-12-08T15:36:23Z-
dc.date.available2014-12-08T15:36:23Z-
dc.date.issued2014-05-14en_US
dc.identifier.issn1932-6203en_US
dc.identifier.urihttp://dx.doi.org/10.1371/journal.pone.0097158en_US
dc.identifier.urihttp://hdl.handle.net/11536/24717-
dc.description.abstractBioluminescent proteins (BLPs) are a class of proteins with various mechanisms of light emission such as bioluminescence and fluorescence from luminous organisms. While valuable for commercial and medical applications, identification of BLPs, including luciferases and fluorescent proteins (FPs), is rather challenging, owing to their high variety of protein sequences. Moreover, characterization of BLPs facilitates mutagenesis analysis to enhance bioluminescence and fluorescence. Therefore, this study proposes a novel methodological approach to estimating the propensity scores of 400 dipeptides and 20 amino acids in order to design two prediction methods and characterize BLPs based on a scoring card method (SCM). The SCMBLP method for predicting BLPs achieves an accuracy of 90.83% for 10-fold cross-validation higher than existing support vector machine based methods and a test accuracy of 82.85%. A dataset consisting of 269 luciferases and 216 FPs is also established to design the SCMLFP prediction method, which achieves training and test accuracies of 97.10% and 96.28%, respectively. Additionally, four informative physicochemical properties of 20 amino acids are identified using the estimated propensity scores to characterize BLPs as follows: 1) high transfer free energy from inside to the protein surface, 2) high occurrence frequency of residues in the transmembrane regions of the protein, 3) large hydrophobicity scale from the native protein structure, and 4) high correlation coefficient (R = 0.921) between the amino acid compositions of BLPs and integral membrane proteins. Further analyzing BLPs reveals that luciferases have a larger value of R (0.937) than FPs (0.635), suggesting that luciferases tend to locate near the cell membrane location rather than FPs for convenient receipt of extracellular ions. Importantly, the propensity scores of dipeptides and amino acids and the identified properties facilitate efforts to predict, characterize, and apply BLPs, including luciferases, photoproteins, and FPs. The web server is available at http://iclab.life.nctu.edu.tw/SCMBLP/index.html.en_US
dc.language.isoen_USen_US
dc.titlePropensity Scores for Prediction and Characterization of Bioluminescent Proteins from Sequencesen_US
dc.typeArticleen_US
dc.identifier.doi10.1371/journal.pone.0097158en_US
dc.identifier.journalPLOS ONEen_US
dc.citation.volume9en_US
dc.citation.issue5en_US
dc.citation.epageen_US
dc.contributor.department生物科技學系zh_TW
dc.contributor.department生物資訊及系統生物研究所zh_TW
dc.contributor.departmentDepartment of Biological Science and Technologyen_US
dc.contributor.departmentInstitude of Bioinformatics and Systems Biologyen_US
dc.identifier.wosnumberWOS:000336857400060-
dc.citation.woscount1-
Appears in Collections:Articles


Files in This Item:

  1. 000336857400060.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.