Bioinformatics technology
Eureka Genomics' core technology was developed at
the University of Houston Bioinformatics lab over the past seven years.
This research was supported by the NIH, the Department of Homeland
Security Science and Technology Directorate, the Texas Learning and
Computational Center and the W. M. Keck Foundation.
With our novel algorithms (PDF),
data structures and genomic data collections, Eureka Genomics can:
- Count each appearance of each 6-100+ nucleotides long subsequence in genome of any size in reasonable time and store it in specially designed data structures.
- Do algebra on such data structures in seconds (bacterial genomes) or minutes (human genomes).
- Count each appearance of each subsequence that may appear from each sequence in genome by any combination of 1, 2, 3, and 4+ mismatches, including any combination of insertions, deletions, and substitutions.
- Identify average distance of each target genome from the 'background' such as human DNA + human SNPs + human ESTs.
- Identify location of each potential signature in genome and average distance of each region of the genome from the 'background'.
- Identify the minimal combination of background-blind (2+ mismatches away) signatures (probes or primer pairs) needed to identify any set of targets.
- Identify the minimal combination of background-blind (2+ mismatches away) signatures (probes or primer pairs) needed to identify any subset (cluster) of targets.
Eureka Genomics' novel algorithms and data structures are especially
effective for the analysis of large sets of short genomic subsequences. In
contrast to heuristic based approaches, such as BLAST and GMAP, Eureka
Genomics' proprietary computational technology is designed to perform
rigorous analysis (no missing cases) of not only all subsequences present
in any sequenced genome (of any size), but also, of all subsequences
present in this genome with 1, or 2, or 3 or 4 mismatches (all possible
combinations of insertions, deletions and substitutions in all possible
positions).
The exceptional efficiency of the Eureka Genomics' software and its
ability to manipulate huge amounts of genomic data opens a unique
opportunity to perform sophisticated analysis of a large number of genomic
sequences (animals, plants, insects and thousands of bacteria and viruses)
in conjunction with the data produced by
Next Generation Sequencing
technology.
For licensing opportunities outside of the US, please
contact us.
Eureka Genomics Corporation: 410 Pierce Street, Ste. 307, Houston, TX 77002 | 750 Alfred Nobel Drive, Hercules, CA 94547
Phone: (510) 964-0461 | Fax: (510) 964-9705 | Email: contact@eurekagenomics.com