标题: | PredCRP: predicting and analysing the regulatory roles of CRP from its binding sites in Escherichia coli |
作者: | Tsai, Ming-Ju Wang, Jyun-Rong Yang, Chi-Dung Kao, Kuo-Ching Huang, Wen-Lin Huang, Hsi-Yuan Tseng, Ching-Ping Huang, Hsien-Da Ho, Shinn-Ying 生物科技学系 生物资讯及系统生物研究所 Department of Biological Science and Technology Institude of Bioinformatics and Systems Biology |
公开日期: | 17-一月-2018 |
摘要: | Cyclic AMP receptor protein (CRP), a global regulator in Escherichia coli, regulates more than 180 genes via two roles: activation and repression. Few methods are available for predicting the regulatory roles from the binding sites of transcription factors. This work proposes an accurate method PredCRP to derive an optimised model (named PredCRP-model) and a set of four interpretable rules (named PredCRP-ruleset) for predicting and analysing the regulatory roles of CRP from sequences of CRP-binding sites. A dataset consisting of 169 CRP-binding sites with regulatory roles strongly supported by evidence was compiled. The PredCRP-model, using 12 informative features of CRP-binding sites, and cooperating with a support vector machine achieved a training and test accuracy of 0.98 and 0.93, respectively. PredCRP-ruleset has two activation rules and two repression rules derived using the 12 features and the decision tree method C4.5. This work further screened and identified 23 previously unobserved regulatory interactions in Escherichia coli. Using quantitative PCR for validation, PredCRP-model and PredCRP-ruleset achieved a test accuracy of 0.96 (=22/23) and 0.91 (=21/23), respectively. The proposed method is suitable for designing predictors for regulatory roles of all global regulators in Escherichia coli. PredCRP can be accessed at https://github.com/NctuICLab/PredCRP. |
URI: | http://dx.doi.org/10.1038/s41598-017-18648-5 http://hdl.handle.net/11536/144376 |
ISSN: | 2045-2322 |
DOI: | 10.1038/s41598-017-18648-5 |
期刊: | SCIENTIFIC REPORTS |
Volume: | 8 |
显示于类别: | Articles |