标题: | 利用重叠基因建构原核生物的基因体树之研究 On the Study of Constructing Genome Trees of Prokaryotes Based on Overlapping Genes |
作者: | 姜礼玮 Li-Wei Jiang 卢锦隆 Chin Lung Lu 生物资讯及系统生物研究所 |
关键字: | 生物资讯;演算法;基因体树;直系同源;重叠基因;bioinformatics;algorithm;genome tree;ortholog;overlapping gene |
公开日期: | 2007 |
摘要: | 随着DNA定序技术的进步,越来越多物种的完整基因体序列变得更容易取得。因此,藉由完整基因体来建构出物种之间的演化树,将有助于了解物种演化的亲属关系。除了以序列为主的方法之外,还有利用整个基因体基因内容和基因次序,这些都能被用来建构出更准确和稳定的演化树。然而已有文献指出,只利用基因内容或基因次序来建构微生物的基因体树可能是不合适的。为了克服这个问题,Luo所属的研究团队最近提出一个利用重叠基因的内容来建构出细菌演化树的新方法。所谓的重叠基因是指在染色体位置相邻的两个基因,它们的序列会部份或全部重叠。实际上,重叠基因在微生物的基因体上是非常普遍的,而且他们比非重叠基因在演化上是更具有保留性的,这意味着重叠基因在微生物中是比非重叠基因更适合当作建构物种演化关系的特征。事实上,物种的基因在演化过程中是会很容易地发生基因体的重组,这导致了即使在两个亲属关系很近的物种上,他们之间的直向同源基因的次序可能会不同,这当然也会造成他们之间的直向同源重叠基因的次序也会不同。这似乎意味着不仅是重叠基因的内容而且重叠基因的次序也应该被考虑用来建构原核生物的基因体树。因此,在这篇论文中,我们结合在物种整个基因体上重叠基因的内容与次序定义出一个新的衡量两个基因体之间距离的方法,我们称之为重叠基因距离。然后我们根据原核生物基因体两两之间的重叠基因距离并且使用UPGMA、NJ和FM的方法来建构出他们之间的基因体树。 根据上面所描述的方法,我们发展出一个网站伺服器的工具称之为OGtree,其可利用原核生物整个基因体之间的重叠基因距离建构出原核生物的基因体树。除此之外,我们也利用一些蛋白细菌的基因体来测试OGtree在建构基因体树的品质。相较于Luo所属的研究团队所建构出的演化树,我们OGtree所建构出来的基因体树与利用16s rRNA以及串接多个蛋白质序列所建构出来的演化树是相当一致的。这些结果已说明了我们的OGtree可以做为一个有用的工具来建构出更准确与更稳定的原核生物基因体树。 As more and more complete genomes of species are available, phylogenetic tree inference by comparing whole genome can be helpful for the reconstruction of evolutionary relationships of species. In addition to sequence-based phylogenomic approaches, methods based on whole genomes, like those based on gene content and gene orders, can be used to construct more precise and robust phylogenetic trees. However, it has been reported in the literature that the genome trees constructed only based on gene content or gene order may not be suitable for microbial genomes. To address these problems, Luo et al. [6, 7] have recently proposed an alternative way to reconstruct genome trees of bacteria using a measure based on the presence and absence of overlapping genes. The overlapping genes (OGs) are defined as adjacent genes whose coding sequences overlap partially or entirely. Actually, OGs are ubiquitous in microbial genomes and more conserved between species than non-overlapping genes, implying that OGs can serve as better phylogenetic characters than non-overlapping genes for reconstructing the evolutionary relationships among microbial genomes. In fact, during evolutionary process, species genomes are subject to genome rearrangements that alter the order and orientation of genes on the genomes, leading to that the orders of orthologous genes, as well as the ones of orthologous OG pairs certainly, even between two closely related species may not be conserved. This suggests that not only OG content but also orthologous OG order should be considered to reconstruct the genome trees of prokaryotic species. Therefore, in this thesis, we define a new distance measure, called as overlapping-gene distance, between two genomes based on a combination of OG content and OG order in their whole genomes. We then use UPGMA, as well as NJ and FM (Fitch-Margolias), to build the genome tree of prokaryotic genomes according to their pairwise OG distance. Based on the method described above, we have implemented a web-based tool, called OGtree, for constructing the genome trees of prokaryotes based on OG distance between prokaryotic complete genomes. In addition, we have tested our OGtree on several Proteobacteria complete genomes to assess its quality of genome tree reconstruction. Compared with the phylogenetic trees produced by Luo et al. [6, 7], the genome trees constructed by our OGtree are quite consistent with those reference trees that were reconstructed based on 16S rRNAs as well as concatenation of multiple proteins. All these results have suggested that our OGtree can serve as a useful tool for constructing more precise and robust genome trees for prokaryotic genomes. |
URI: | http://140.113.39.130/cdrfb3/record/nctu/#GT009551509 http://hdl.handle.net/11536/39433 |
显示于类别: | Thesis |
文件中的档案:
If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.