Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Idetify biallelic loci in GATK 4,8

0

1 comment

  • Avatar
    Gökalp Çelik

    Hi RN Sarma

    Our recommendations are still the same. Instead of changing the default behavior of HaplotypeCaller we recommend running HaplotypeCaller by default and later filtering and selecting variants from the raw vcf. 

    Also the current command is quite open to resource errors due to having too many files open and requiring too much heap size to keep all bam files for tracking variants. We suggest you to call variants from each sample individually using the --ERC GVCF parameter and later combine GVCF files and genotype them together so that you can work within more reasonable compute resources. 

    Also instead of opening multiple topics for the same question, please try to continue from the same topic as it helps keeping all the ideas together and keeps following up issues better on our end and your end as well. 

    I am closing all other topics related to this subject from before. Please continue from here.

    Regards. 

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk