Eureka Genomics Technologies

 

Eureka Genomics' core technology was developed at the University of Houston Bioinformatics lab over the past seven years. This research was supported by the NIH, the Department of Homeland Security Science and Technology Directorate, the Texas Learning and Computational Center and the W. M. Keck Foundation.

With our novel algorithms (PDF), data structures and genomic data collections, Eureka Genomics can:

1. Count each appearance of each 6-100+ nucleotides long subsequence in genome of any size in reasonable time and store it in specially designed data structures.

2. Do algebra on such data structures in seconds (bacterial genomes) or minutes (human genomes).

3. Count each appearance of each subsequence that may appear from each sequence in genome by any combination of 1, 2, 3, and 4+ mismatches, including any combination of insertions, deletions, and substitutions.

4. Identify average distance of each target genome from the 'background' such as human DNA + human SNPs + human ESTs.

5. Identify location of each potential signature in genome and average distance of each region of the genome from the 'background'.

6. Identify the minimal combination of background-blind (2+ mismatches away) signatures (probes or primer pairs) needed to identify any set of targets.

7. Identify the minimal combination of background-blind (2+ mismatches away) signatures (probes or primer pairs) needed to identify any subset (cluster) of targets.

Eureka Genomics' novel algorithms and data structures are especially effective for the analysis of large sets of short genomic subsequences. In contrast to heuristic based approaches, such as BLAST and GMAP, Eureka Genomics' proprietary computational technology is designed to perform rigorous analysis (no missing cases) of not only all subsequences present in any sequenced genome (of any size), but also, of all subsequences present in this genome with 1, or 2, or 3 or 4 mismatches (all possible combinations of insertions, deletions and substitutions in all possible positions).

The exceptional efficiency of the Eureka Genomics' software and its ability to manipulate huge amounts of genomic data opens a unique opportunity to perform sophisticated analysis of a large number of genomic sequences (animals, plants, insects and thousands of bacteria and viruses) in conjunction with the data produced by Next Generation Sequencing technology.

For licensing opportunities outside of the US, please contact us.

 

 

© 2011-2008 Eureka Genomics Corporation | 410 Pierce Street, Ste 307 Houston TX 77002
750 Alfred Nobel Drive | Hercules, CA 94547 | Phone: (510) 964-0461 | Fax: (510) 964-9705 | Email: contact@eurekagenomics.com