標題: | An expert system to predict protein thermostability using decision tree |
作者: | Wu, Li-Cheng Lee, Jian-Xin Huang, Hsien-Da Liu, Baw-Juine Horng, Jorng-Tzong 生物資訊及系統生物研究所 Institude of Bioinformatics and Systems Biology |
關鍵字: | Expert system;Machine learning;Bioinformatics;Protein thermostability;Decision Tree |
公開日期: | 1-Jul-2009 |
摘要: | Protein thermostability information is closely linked to commercial production of many biomaterials. Recent developments have shown that amino acid composition, special sequence patterns and hydrogen bonds, disulfide bonds, salt bridges and so on are of considerable importance to thermostability. In this study, we present a system to integrate these various factors that predict protein thermostability. In this study, the features of proteins in the PGTdb are analyzed. We consider both structure and sequence features and correlation coefficients are incorporated into the feature selection algorithm. Machine learning algorithms are then used to develop identification systems and performances between the different algorithms are compared. In this research, two features, (E + F + M + R)/residue and charged/non-charged, are found to be critical to the thermostability of proteins. Although the sequence and structural models achieve a higher accuracy, sequence-only models provides sufficient accuracy for sequence-only thermostability prediction. (C) 2008 Elsevier Ltd. All rights reserved. |
URI: | http://dx.doi.org/10.1016/j.eswa.2008.12.020 http://hdl.handle.net/11536/7028 |
ISSN: | 0957-4174 |
DOI: | 10.1016/j.eswa.2008.12.020 |
期刊: | EXPERT SYSTEMS WITH APPLICATIONS |
Volume: | 36 |
Issue: | 5 |
起始頁: | 9007 |
結束頁: | 9014 |
Appears in Collections: | Articles |
Files in This Item:
If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.