Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

normal sample AF in Mutect2 paired mode

1

3 comments

  • Avatar
    David Benjamin

    The AFs emitted by Mutect2 are the means of the posterior distribution on cell fraction conditioned on a variant existing.  The AF of 0.032 you see, for example, means "if this variant is really present in the normal (not that Mutect2 thinks it is), then our best guess of its allele fraction in the normal sample's cells is 0.032".

    0
    Comment actions Permalink
  • Avatar
    Dmitrij Kazancev

    Hi, I'm glad I found this post because I had the same question regarding non-zero AF values.

    What would be the best method to filter out variants that have zero ALT reads? I'm thinking about counting ALT variants in reads and if there are zero assign "real" AF = 0. Maybe there are better methods?

    I'm using Mutect2 (GATK 4.5.0.0) to call somatic variants in whole exome tumor-normal pairs.

    The command in my scripts looks like this:

    docker run --rm -v $WDIR:$WDIR -v $REFDIR:$REFDIR -w $WDIR/bam \
        broadinstitute/gatk:$VERSION \
        gatk \
        Mutect2 \
        --native-pair-hmm-threads $THREADS \
        -R $REFG \
        --germline-resource "$REFDIR/gatk-BP-somatic-hg38/af-only-gnomad.hg38.vcf.gz" \
        -I $BAMFILE_NORMAL \
        -normal $SAMPLE_NORMAL \
       -I $BAMFILE_TUMOR \
        -O $WDIR/vcf/$ID.unfiltered.vcf

    Thanks for developing these tools!

    Dmitrij

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi Dmitrij Kazancev

    Germline AF calculation in Mutect2 has its own merits from population genetics and genotypic priors. But if you absolutely filter out any 0 ALT allele in normals you may use our VariantFiltration tool with to manually filter those sites.

    Regards. 

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk