標題: | Incorporating Evolutionary Information and Functional Domains for Identifying RNA Splicing Factors in Humans |
作者: | Hsu, Justin Bo-Kai Bretana, Neil Arvin Lee, Tzong-Yi Huang, Hsien-Da 生物科技學系 生物資訊及系統生物研究所 Department of Biological Science and Technology Institude of Bioinformatics and Systems Biology |
公開日期: | 16-Nov-2011 |
摘要: | Regulation of pre-mRNA splicing is achieved through the interaction of RNA sequence elements and a variety of RNA-splicing related proteins (splicing factors). The splicing machinery in humans is not yet fully elucidated, partly because splicing factors in humans have not been exhaustively identified. Furthermore, experimental methods for splicing factor identification are time-consuming and lab-intensive. Although many computational methods have been proposed for the identification of RNA-binding proteins, there exists no development that focuses on the identification of RNA-splicing related proteins so far. Therefore, we are motivated to design a method that focuses on the identification of human splicing factors using experimentally verified splicing factors. The investigation of amino acid composition reveals that there are remarkable differences between splicing factors and non-splicing proteins. A support vector machine (SVM) is utilized to construct a predictive model, and the five-fold cross-validation evaluation indicates that the SVM model trained with amino acid composition could provide a promising accuracy (80.22%). Another basic feature, amino acid dipeptide composition, is also examined to yield a similar predictive performance to amino acid composition. In addition, this work presents that the incorporation of evolutionary information and domain information could improve the predictive performance. The constructed models have been demonstrated to effectively classify (73.65% accuracy) an independent data set of human splicing factors. The result of independent testing indicates that in silico identification could be a feasible means of conducting preliminary analyses of splicing factors and significantly reducing the number of potential targets that require further in vivo or in vitro confirmation. |
URI: | http://dx.doi.org/10.1371/journal.pone.0027567 http://hdl.handle.net/11536/14931 |
ISSN: | 1932-6203 |
DOI: | 10.1371/journal.pone.0027567 |
期刊: | PLOS ONE |
Volume: | 6 |
Issue: | 11 |
結束頁: | |
Appears in Collections: | Articles |
Files in This Item:
If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.