標題: Prediction of Protein Subcellular Localizations
作者: Yu, Chin-Sheng
Hwang, Jenn-Kang
生物科技學系
Department of Biological Science and Technology
公開日期: 2008
摘要: The support vector machine (SVM) method based on n-peptide composition (Yu et al, Proteins: Struct. Funct. Genet. 2003:50:531-536) is used to predict the subcellular localizations of proteins. For an unbiased assessment of the results, we apply our approach to two independent data sets: one set consisting of two parts (Reinhardt and Hubbard, Nucleic Acids Res. 1998; 26:2230-2236): the prokaryotic set includes 997 protein sequences in three categories and the eukaryotic set includes 2427. sequences in four localization categories; another set comprising 2191 proteins in 12 subcellular localizations (Chou and Cai, J. Biol. Chem. 2002; 277:45765-45769). Our approach provides excellent results for both data sets. For the first data set, our approach gives an overall prediction accuracy 93.2% for prokaryotic sequences, 88.1% for eukaryotic sequences. Our approach also yields significantly better Matthews correlation coefficient for each subcellular localization than the existing approaches. For the second data set, our approach achieves an overall prediction accuracy 83.2%, which is also around 10% higher than the best existing result. Our approaches should be valuable in the high throughput analysis of genomics and proteomics.
URI: http://dx.doi.org/10.1109/ISDA.2008.306
http://hdl.handle.net/11536/135089
ISBN: 978-0-7695-3382-7
DOI: 10.1109/ISDA.2008.306
期刊: ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 1, PROCEEDINGS
起始頁: 165
結束頁: +
Appears in Collections:Conferences Paper