G-FQZip: Lossless Reference-Based Compression of FASTQ Files Using GPUs

  • Cong Peng
  • , Qingjin Deng
  • , Zhian Huang
  • , Yiwen Sun*
  • , Zexuan Zhu
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The exponentially increasing high throughput of sequencing data calls for efficient specific compression methods to address the challenges posed by the storage and transmission of such data. In this work, we develop a GPU version of lossless reference-based compression method namely G-FQZip by introducing a GPU-based arithmetic coding, a template matching approach, and a parallel light-weight mapping model. The comparison experiments demonstrate that G-FQZip can improve the (de) compression speed while maintaining comparable compression ratios. Besides, the follow-up evaluation demonstrated the efficiency of the GPU-based arithmetic coding and the template matching approach.

Original languageEnglish
Title of host publicationProceedings - 13th International Conference on Computational Intelligence and Security, CIS 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages553-556
Number of pages4
ISBN (Electronic)9781538648223
DOIs
StatePublished - 2 Jul 2017
Externally publishedYes
Event13th International Conference on Computational Intelligence and Security, CIS 2017 - Hong Kong, Hong Kong
Duration: 15 Dec 201718 Dec 2017

Publication series

NameProceedings - 13th International Conference on Computational Intelligence and Security, CIS 2017
Volume2018-January

Conference

Conference13th International Conference on Computational Intelligence and Security, CIS 2017
Country/TerritoryHong Kong
CityHong Kong
Period15/12/1718/12/17

Keywords

  • GPU acceleration
  • High-throughput sequencing
  • Lossless compression
  • Reference-based DNA sequence compression

Fingerprint

Dive into the research topics of 'G-FQZip: Lossless Reference-Based Compression of FASTQ Files Using GPUs'. Together they form a unique fingerprint.

Cite this