Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

How to compress VCF for IndexFeatureFile to avoid MalformedFeatureFile error

0

2 comments

  • Avatar
    Gökalp Çelik

    Hi Sofya Dubovskova

    Our tools and bunch of other HTS related tools do not use gzip directly. Instead they use a modified block compression algorithm called bgzip. HaplotypeCaller compresses using this therefore indexing works. You may be able to use bgzip tool that comes with samtools or bcftools or can be found in many mainstream linux distro repos. 

    Regards. 

    1
    Comment actions Permalink
  • Avatar
    Sofya Dubovskova

    Thank you!

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk