Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

GATK4: RNAseq short variant discovery (SNPs + Indels)

0

3 comments

  • Avatar
    Bhanu Gandham

    Hi bsmith030465




    I agree this is confusing and we are looking into fixing this. So thank you for bringing this up.

    There are two things going on here:

    1. HaplotypeCaller is calling the site's genotype homozygous reference, but there is an alt allele. This should not happen. Can you please try to reproduce this error with the latest version of GATK and let us know if the issue persists?

    2. ValidateVariants is doing more strict validation than the VCF spec to try to catch logical errors in data processing. So we should probably add a flag to disable the extra strictness in ValidateVariants. We have created a issue ticket for it here: https://github.com/broadinstitute/gatk/issues/6553

    0
    Comment actions Permalink
  • Avatar
    bsmith030465

    Bhanu,

    I will try the newer version of haplotype caller.

    Meanwhile, is there a way that ValidateVariants makes a list of all variants that fail in a vcf file? Currently, it appears to exit as soon as one failed variant or error is encountered. Is there a flag I can set for this?

     

    thanks!

    0
    Comment actions Permalink
  • Avatar
    Bhanu Gandham

    Hi bsmith030465

     

    Try the --warn-on-errors argument. This will emit warnings on errors instead of terminating the run at the first instance

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk