genomic storage and compression
1 TopicGenomic Data Storage in Azure: Basic Compression for Mapped Sequencing Data
In this article, we show how compressing mapped (aligned) genomic sequencing data (BAM files) using the CRAM standard can reduce storage size (and cost) by around 63% at a compression cost of under a penny (< USD) per sample. We demonstrate our results on a set of 62 whole genome sequencing (WGS) samples from the 1000 Genomes Project. We also give detailed instructions on how to use this method on your own data.3.6KViews0likes2Comments