標題: DeF-GPU: Efficient and effective deletions finding in hepatitis B viral genomic DNA using a GPU architecture
作者: Cheng, Chun-Pei
Lan, Kuo-Lun
Liu, Wen-Chun
Chang, Ting-Tsung
Tseng, Vincent S.
資訊工程學系
Department of Computer Science
關鍵字: Big data mining;Deletion finding;Genome;Graphics processing unit;Hepatitis B virus;Next-generation sequencing
公開日期: 1-十二月-2016
摘要: Hepatitis B viral (HBV) infection is strongly associated with an increased risk of liver diseases like cirrhosis or hepatocellular carcinoma (HCC). Many lines of evidence suggest that deletions occurring in HBV genomic DNA are highly associated with the activity of HBV via the interplay between aberrant viral proteins release and human immune system. Deletions finding on the HBV whole genome sequences is thus a very important issue though there exist underlying the challenges in mining such big and complex biological data. Although some next generation sequencing (NGS) tools are recently designed for identifying structural variations such as insertions or deletions, their validity is generally committed to human sequences study. This design may not be suitable for viruses due to different species. We propose a graphics processing unit (GPU)-based data mining method called DeF-GPU to efficiently and precisely identify HBV deletions from large NGS data, which generally contain millions of reads. To fit the single instruction multiple data instructions, sequencing reads are referred to as multiple data and the deletion finding procedure is referred to as a single instruction. We use Compute Unified Device Architecture (CUDA) to parallelize the procedures, and further validate DeF-GPU on 5 synthetic and 1 real datasets. Our results suggest that DeF-GPU outperforms the existing commonly-used method Pindel and is able to exactly identify the deletions of our ground truth in few seconds. The source code and other related materials are available at https://sourceforge.net/projectsidefgpui. (C) 2016 Elsevier Inc. All rights reserved.
URI: http://dx.doi.org/10.1016/j.ymeth.2016.07.020
http://hdl.handle.net/11536/132767
ISSN: 1046-2023
DOI: 10.1016/j.ymeth.2016.07.020
期刊: METHODS
Volume: 111
起始頁: 56
結束頁: 63
顯示於類別:期刊論文