15p.zip
: Containing those same reads mapped to a reference genome.
: A co-compressed file containing both BAM and FASTQ data is typically only slightly larger than the BAM file alone . 15p.zip
Introduced in , the "Deep" method (triggered by the --deep command) is a novel approach to co-compression . How It Works : Containing those same reads mapped to a reference genome
: This effectively "hides" the FASTQ data within the BAM file's footprint, potentially halving the storage costs for genomic repositories. How It Works : This effectively "hides" the
While a standard .zip file uses the (a combination of LZ77 and Huffman coding) to find repeated patterns within a single stream of data, Genozip’s Deep method is domain-specific. It understands the structure of genomic data, allowing it to achieve compression ratios that general-purpose tools cannot match.
: Despite the aggressive reduction, the process is bit-for-bit lossless. Using the Genozip platform , users can reconstruct the original FASTQ and BAM files exactly as they were. Performance and Impact
: Decompression is handled automatically by standard Genozip commands like genounzip (to restore the full set) or genocat (to extract a specific file). Comparison: Deep vs. Standard ZIP
