Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

get the basics for Haplotypecaller

0

2 comments

  • Avatar
    James Emery

    Hello Maarten Nijenhuis. We have a number of resources available for coming up to speed on using our tools for VariantCalling. You can find some old recordings of our GATK workshop material that goes over our best practices here https://support.terra.bio/hc/en-us/articles/360029633732-GATK-workshop-at-BroadE-March-2019-. We also have a set of FAQ tutorials here: https://gatk.broadinstitute.org/hc/en-us/categories/36000230231

    Those links mostly goes over the basics for how our calling pipeline works rather than your specific use case organism. There are two aspects of what you describe that I suspect will cause problems, the first of which is that you are using amplicon sequencing which can cause lots of problems in our tools. You can search the forums for advice on this sort of calling but our general advice is to drop MarkDuplicates from your pipeline and run HaplotypeCaller with the extra argument:

    --dont-use-soft-clipped-bases true

    Furthermore you should also be able to address the ployidy by adjusting the ployidy setting in HaplotypeCaller with this argument:

    --ploidy 4
    0
    Comment actions Permalink
  • Avatar
    Maarten Nijenhuis

    Thank you James Emery. I will start by studying these links.

    Best,
    Maarten

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk