Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Why the germline mutation site generated from FilterMutectCalls dont have any base alteration in normal sample?



    Gökalp Çelik

    Hi jackie chan wang

    We have a better response from our team. 

    This is counterintuitive but correct.  Because the population allele frequency is so high, the likelihood ratio of 2^15 for being absent in the germline versus being a germline het does not outweigh the prior odds ratio (roughly a million to one) that overwhelmingly favors germline versus somatic.This is a purely Bayesian calculation and has nothing to do with hard filtering using gnomAD.

    We hope this answers your question. 

    jackie chan wang

    like this.

    the GATK version is v4.4.0.0
    b) Exact command used: 

    gatk Mutect2 \
    -R /data/index/GATKindex/Homo_sapiens_assembly38.fasta \
    -I /data/breastfq/tmp/lym.sorted.markdup.BQSR.bam \
    -I /data/breastfq/tmpsam/small.sorted.markdup.BQSR.bam \
    -normal lym \
    --germline-resource /data/index/GATKindex/MUTECT_index/af-only-gnomad.hg38.vcf.gz \
    -L /data/breastfq/bed/baits.bed \
    -O ./primary_somatic.vcf.gz \
    -bamout ./small_normal.bam

    nohup gatk GetPileupSummaries \
    -I /data/breastfq/tmpsam/small.sorted.markdup.BQSR.bam \
    -V /data/index/GATKindex/MUTECT_index/small_exac_common_3.hg38.vcf.gz \
    -L /data/index/GATKindex/MUTECT_index/small_exac_common_3.hg38.vcf.gz \
    -O ./smallsummary.table &


    gatk GetPileupSummaries \
    -I /data/breastfq/tmp/lym.sorted.markdup.BQSR.bam \
    -V /data/index/GATKindex/MUTECT_index/small_exac_common_3.hg38.vcf.gz \
    -L /data/index/GATKindex/MUTECT_index/small_exac_common_3.hg38.vcf.gz \
    -O ./lymsummary.table


    gatk CalculateContamination \
    -I /data/breastfq/smallsummary.table \
    -matched /data/breastfq/privcf/lymsummary.table \
    -tumor-segmentation ./segments.table \
    -O ./pair_calculatecontamination.table


    gatk FilterMutectCalls \
    -R /data/index/GATKindex/Homo_sapiens_assembly38.fasta \
    -V /data/breastfq/primary_somatic.vcf.gz \
    --contamination-table /data/breastfq/pair_calculatecontamination.table \
    --stats /data/breastfq/primary_somatic.vcf.gz.stats \
    --tumor-segmentation /data/breastfq/segments.table \
    -O ./somatic_oncefiltered.vcf.gz

    Gökalp Çelik

    Hi jackie chan wang

    Do you observe the same filtering applied to the same position when you do not use the resource file from gnomad? It may be possible that site filtering could have been there due to the resource file but not the actual normal bam file you have?

    Can you try your analysis once again without the gnomad resource file ?

    I hope this helps. 

