標題: Designing predictors of halophilic and non-halophilic proteins using support vector machines
作者: Huang, Hui-Ling
Charoenkwan, Phasit
Srinivasulu, Yerukala Sathipati
Lee, Hua-Chin
Ho, Shinn-Ying
生物資訊及系統生物研究所
Institude of Bioinformatics and Systems Biology
關鍵字: Halophilic proteins;SVM;Physicochemical properties;Genetic algorithms
公開日期: 2013
摘要: Finding the molecular features causes the halophilicity in the halostable organisms is helpful to understand the halophilic adaption. In this study, we proposed a prediction method for halophilic proteins by using a machine learning method. The stages of this study are six-fold. First, we establish a non-redundant dataset of the halophilic proteins, collected from NCBI, Uniprotkb and EMBL-EBI databases. The dataset consists of 245 positive and negative proteins with sequence identity < 25%. Second, the protein sequences are represented by three types of feature vector sets which include amino acid composition, dipeptide composition, and physicochemical properties. Third, we propose three classifiers based on support vector machine (SVM) to classify the halophilic proteins and non-halophilic proteins. Fourth, the independent test accuracies of the three efficient classifiers are larger than 83%. Fifth, an inheritable bi-objective combinatory genetic algorithm is utilized to select a set of 11 physicochemical properties (PCPs). Sixth, these abundant amino acids, high different dipeptides (amino acid pair) and 11 informative PCPs can support to analyze the halophilic and non-halophilic proteins.
URI: http://hdl.handle.net/11536/24151
ISBN: 978-1-4673-5875-0
期刊: PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB)
起始頁: 230
結束頁: 237
顯示於類別:會議論文