Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

ApplyBQSR wastes too much disk space

0

4 comments

  • Avatar
    registered_user

    This might work: 

    samtools view -@ 6 -h -b --output-fmt-option level=6 -o compressed.bam uncompressed.bqsr.bam

    You should be able to leave out "--output-fmt-option level=6" and it should work just as well.

    0
    Comment actions Permalink
  • Avatar
    Pamela Bretscher

    Hi registered_user,

    Thank you for providing this solution that may be helpful for other users in the same situation! Was this successful for you at changing the bam compression?

    Kind regards,

    Pamela

    0
    Comment actions Permalink
  • Avatar
    registered_user

    I was able to get a ~25% filesize reduction with running the bam file through samtools view. However, trying to use "ApplyBQSR --java-options "-Dsamjdk.compression_level=6" results in error:

    Error: Could not find or load main class "-Dsamjdk.compression_level=6"

    Working on solving that currently so I don't need to run samtools view for all future files.

    0
    Comment actions Permalink
  • Avatar
    registered_user

    Solution:

    gatk --java-options "-Dsamjdk.compression_level=6" ApplyBQSR [options]
    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk