Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

GATK 4.1.7.0 does not annotate ID using dbSNP build 153 VCF

0

3 comments

  • Avatar
    Bhanu Gandham

    Hi danilovkiri

     

    We are looking into fixing this. Will share a PR with you for it soon. Stay tuned!

    1
    Comment actions Permalink
  • Avatar
    danilovkiri

    Thank you Bhanu Gandham

    In case it might be helpful for other users, the quickest and the easiest workaround here it is to use

    bcftools norm --fasta-ref <path_to_fasta> -c wx --multiallelic - <input_vcf>

    in order to split all multiallelic sites in dbSNP VCF and normalize them. The resulting VCF can be used for annotation without any of the above-mentioned problems.

    0
    Comment actions Permalink
  • Avatar
    Bhanu Gandham

    Hi danilovkiri

     

    Thank you for sharing the workaround! And here is the PR we promised: https://github.com/broadinstitute/gatk/pull/6626

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk