Full metadata record
DC FieldValueLanguage
dc.contributor.authorHo, Shinn-Yingen_US
dc.contributor.authorYu, Fu-Chiehen_US
dc.contributor.authorChang, Chia-Yunen_US
dc.contributor.authorHuang, Hui-Lingen_US
dc.date.accessioned2019-04-02T06:01:00Z-
dc.date.available2019-04-02T06:01:00Z-
dc.date.issued2007-07-01en_US
dc.identifier.issn0303-2647en_US
dc.identifier.urihttp://dx.doi.org/10.1016/j.biosystems.2006.08.007en_US
dc.identifier.urihttp://hdl.handle.net/11536/149202-
dc.description.abstractIn this paper, we investigate the design of accurate predictors for DNA-binding sites in proteins from amino acid sequences. As a result, we propose a hybrid method using support vector machine (SVM) in conjunction with evolutionary information of amino acid sequences in terms of their position-specific scoring matrices (PSSMs) for prediction of DNA-binding sites. Considering the numbers of binding and non-binding residues in proteins are significantly unequal, two additional weights as well as SVM parameters are analyzed and adopted to maximize net prediction (NP, an average of sensitivity and specificity) accuracy. To evaluate the generalization ability of the proposed method SVM-PSSM, a DNA-binding dataset PDC-59 consisting of 59 protein chains with low sequence identity on each other is additionally established. The SVM-based method using the same six-fold cross-validation procedure and PSSM features has NP = 80.15% for the training dataset PDNA-62 and NP = 69.54% for the test dataset PDC-59, which are much better than the existing neural network-based method by increasing the NP values for training and test accuracies up to 13.45% and 16.53%, respectively. Simulation results reveal that SVM-PSSM performs well in predicting DNA-binding sites of novel proteins from amino acid sequences. (c) 2006 Elsevier Ireland Ltd. All rights reserved.en_US
dc.language.isoen_USen_US
dc.subjectamino acid sequenceen_US
dc.subjectDNA-binding predictionen_US
dc.subjectposition-specific scoring matrices (PSSM)en_US
dc.subjectproteinen_US
dc.subjectsupport vector machine (SVM)en_US
dc.titleDesign of accurate predictors for DNA-binding sites in proteins using hybrid SVM-PSSM methoden_US
dc.typeArticleen_US
dc.identifier.doi10.1016/j.biosystems.2006.08.007en_US
dc.identifier.journalBIOSYSTEMSen_US
dc.citation.volume90en_US
dc.citation.spage234en_US
dc.citation.epage241en_US
dc.contributor.department生物科技學系zh_TW
dc.contributor.department生物資訊及系統生物研究所zh_TW
dc.contributor.departmentDepartment of Biological Science and Technologyen_US
dc.contributor.departmentInstitude of Bioinformatics and Systems Biologyen_US
dc.identifier.wosnumberWOS:000248966800021en_US
dc.citation.woscount27en_US
Appears in Collections:Articles