标题: PredCRP: predicting and analysing the regulatory roles of CRP from its binding sites in Escherichia coli
作者: Tsai, Ming-Ju
Wang, Jyun-Rong
Yang, Chi-Dung
Kao, Kuo-Ching
Huang, Wen-Lin
Huang, Hsi-Yuan
Tseng, Ching-Ping
Huang, Hsien-Da
Ho, Shinn-Ying
生物科技学系
生物资讯及系统生物研究所
Department of Biological Science and Technology
Institude of Bioinformatics and Systems Biology
公开日期: 17-一月-2018
摘要: Cyclic AMP receptor protein (CRP), a global regulator in Escherichia coli, regulates more than 180 genes via two roles: activation and repression. Few methods are available for predicting the regulatory roles from the binding sites of transcription factors. This work proposes an accurate method PredCRP to derive an optimised model (named PredCRP-model) and a set of four interpretable rules (named PredCRP-ruleset) for predicting and analysing the regulatory roles of CRP from sequences of CRP-binding sites. A dataset consisting of 169 CRP-binding sites with regulatory roles strongly supported by evidence was compiled. The PredCRP-model, using 12 informative features of CRP-binding sites, and cooperating with a support vector machine achieved a training and test accuracy of 0.98 and 0.93, respectively. PredCRP-ruleset has two activation rules and two repression rules derived using the 12 features and the decision tree method C4.5. This work further screened and identified 23 previously unobserved regulatory interactions in Escherichia coli. Using quantitative PCR for validation, PredCRP-model and PredCRP-ruleset achieved a test accuracy of 0.96 (=22/23) and 0.91 (=21/23), respectively. The proposed method is suitable for designing predictors for regulatory roles of all global regulators in Escherichia coli. PredCRP can be accessed at https://github.com/NctuICLab/PredCRP.
URI: http://dx.doi.org/10.1038/s41598-017-18648-5
http://hdl.handle.net/11536/144376
ISSN: 2045-2322
DOI: 10.1038/s41598-017-18648-5
期刊: SCIENTIFIC REPORTS
Volume: 8
显示于类别:Articles