Interval Operations Benchmark: February 2026 Update
polars-bio 0.24.0 leads 7 of 8 interval operations at scale, with near-linear thread scaling and sub-second overlap on 307M pairs.
We leverage bioinformatics and omics analysis with the power of Big Data technology. Helping you keep control over sequencing data, process and analyze large datasets in research, diagnostics, and precision medicine.
We focus on NGS sequencing, functional and populational genetics.
We deliver end-to-end solutions to speed up your research and accelerate implementation.
We use Big Data technologies and Machine Learning techniques to drive better insights into your data.
We build solutions with latest cloud technologies based on our Software Engineering experience.
A new type of data analysis system for long-term storage, retrieval, analysis, and clinical use of genomic and other digital information.
In the current era of high-throughput DNA sequencing, the cost of a single examination makes it accessible for common diagnostics. The analysis itself is complicated and resource-consuming.
To address the pace of research and the requirements from diagnostics as precision medicine gains popularity, there is a need for a new kind of data analysis system.
Learn More About iGAP →The number of sequenced individuals is growing exponentially. Simple file-based access is inefficient and limits the way you could use the data.
To unlock the power of NGS-based diagnostics, data analytics performance cannot be a limitation, especially for multisample analysis.
Genomic data is the most personal data we all have. It should be processed and stored with the highest security standards — GDPR and HIPAA compliant.
With comprehensive knowledge of the latest pipelines and efficient use of technology, we provide the required analysis faster at the highest quality.
A mix of world-class specialists in genomics, bioinformatics, data science, and information technology.
Head of Research
Tomek has been working on research and commercial projects within Genomics and Bioinformatics for the past 10 years.
Head of Technology
His professional career has always focused on data — starting in Data Warehousing, BI, and continuing in Big Data technologies.
Chief Architect
Maciej is an experienced DevOps and infrastructure engineer specializing in cloud platforms and CI/CD for genomics workloads.
Operations
Rafal has been working in IT for over 10 years, mostly in BI and Cloud areas, focused on Business Development and building Teams.
Want to work with cutting-edge technologies? Join us! info (at) zettagene.com
Check our recent work and findings.
polars-bio 0.24.0 leads 7 of 8 interval operations at scale, with near-linear thread scaling and sub-second overlap on 307M pairs.
Introducing polars-bio—a blazingly fast Python DataFrame library for genomics built on Apache DataFusion and Arrow. Experience up to 6.5x faster overlaps and 15.5x faster nearest queries.
This year we were also teaching students of 'Omics Data Science' course on how to effectively run genomic analysis in distributed environment.
Working closely with the biggest sequencing centers in Poland, providing bioinformatics services to researchers and medical institutions.