Next Generation Sequencing Data Management:Post-Sequencing Data Analysis Pipeline

Abstract

We present a platform independant data analysis for use with the next generation sequencing (NGS) technology. This pipeline was developed by Eureka Genomics in collaboration with the University of Houston's Bioinformatics lab.

The key features of the pipeline include:

  • Read quality estimation and filtration;
  • Condensed representation of any given set of reads making data manipulation process simple and easy;
  • Mapping (aligning) reads with mismatches (insertions, deletions and subsitituions at any position and in any combination);
  • No limitation on the length and number of sequences;
  • Automatic detection of the genomic variation (SNPs, InDel).

The key advantages include:

    (a) ability to incorporate the quality of individual reads in the analysis;

    (b) highly efficient in time: it takes only 25 min on a desktop PC to map 200Mb of reads to a 5 Mb long reference microbial genome with one mismatch;and

    (c) minimal hardware requirements.

Read more about the Next Generation Sequencing Data Management: Post-Sequencing Data Analysis Pipeline (pdf)

 

 

© 2010, 2009, 2008 Eureka Genomics Corporation | 410 Pierce Street, Ste 307 Houston TX 77002
750 Alfred Nobel Drive | Hercules, CA 94547 | Phone: (510) 964-0461 | Fax: (510) 964-9066 | Email: contact@eurekagenomics.com