Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

BaseRecalibrator error

0

2 comments

  • Avatar
    Niharika

    Hi Tanay,

    How did you solve this problem? I am facing same problem.

    0
    Comment actions Permalink
  • Avatar
    Louis Bergelson

    It looks like a mismatch in the reference you're using between different files.  The reference file has contigs named chr1, chr2, etc.  The known sites vcf's seems to be based on a different reference because it has contigs are named 1,2,3, etc.  Is it possible one of the files isn't actually based on hg38?  

    I notice the warning:

    18:29:40.675 WARN  IndexUtils - Feature file "file:///scratch/tbiswas/largefiles/hg38_dnSNP.vcf" appears to contain no sequence dictionary. Attempting to retrieve a sequence dictionary from the associated index file

     

    If you're sure that the files ARE matched and generated on the same reference build you might try adding a sequence dictionary with the appropriate contigs to your known sites vcf. 

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk