標題: Biogenesis mechanisms of circular RNA can be categorized through feature extraction of a machine learning model
作者: Liu, Chengyu
Liu, Yu-Chen
Huang, Hsien-Da
Wang, Wei
生物資訊及系統生物研究所
Institude of Bioinformatics and Systems Biology
公開日期: 1-Dec-2019
摘要: Motivation: In recent years, multiple circular RNAs (circRNA) biogenesis mechanisms have been discovered. Although each reported mechanism has been experimentally verified in different circRNAs, no single biogenesis mechanism has been proposed that can universally explain the biogenesis of all tens of thousands of discovered circRNAs. Under the hypothesis that human circRNAs can be categorized according to different biogenesis mechanisms, we designed a contextual regression model trained to predict the formation of circular RNA from a random genomic locus on human genome, with potential biogenesis factors of circular RNA as the features of the training data. Results: After achieving high prediction accuracy, we found through the feature extraction technique that the examined human circRNAs can be categorized into seven subgroups, according to the presence of the following sequence features: RNA editing sites, simple repeat sequences, self-chains, RNA binding protein binding sites and CpG islands within the flanking regions of the circular RNA back-spliced junction sites. These results support all of the previously reported biogenesis mechanisms of circRNA and solidify the idea that multiple biogenesis mechanisms co-exist for different subset of human circRNAs. Furthermore, we uncover a potential new links between circRNA biogenesis and flanking CpG island. We have also identified RNA binding proteins putatively correlated with circRNA biogenesis.
URI: http://dx.doi.org/10.1093/bioinformatics/btz705
http://hdl.handle.net/11536/153524
ISSN: 1367-4803
DOI: 10.1093/bioinformatics/btz705
期刊: BIOINFORMATICS
Volume: 35
Issue: 23
起始頁: 4867
結束頁: 4870
Appears in Collections:Articles