Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Hard filtering using using germline-resource

0

1 comment

  • Avatar
    Gökalp Çelik

    Hi Shashwat Sahay

    Germline resource is used as a prior for Mutect2 to decide whether there is enough suspicion for a site to be of germline origin. Germline Filter is explained more deeply in our documentation.

    https://github.com/broadinstitute/gatk/blob/master/docs/mutect/mutect.pdf 

    In short allele frequencies given in the Germline source act as a parameter to calculate the probability that a particular site is present in tumor as germline or somatic. Presence of a site in germline resource with a very low allele frequency may also be the reason why that variant could well be a somatic origin. Higher the allele frequency the lesser the probability that a variant is somatic and more likely a germline one. 

    There is also a parameter in Mutect2 to set a threshold for population allele frequency for germline resource variants to be tagged as germline

    --max-population-af,-max-af <Double>
                                  Maximum population allele frequency in tumor-only mode.  Default value: 0.01.

    I hope this helps.

    Regards. 

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk