Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

The DP value of variants called in Mutect2 vcf much lower than depth in input bam file

0

5 comments

  • Avatar
    Gökalp Çelik

    Hi Bunmi Fariyike

    There will be a few questions to ask before we proceed. 

    1- How did you calculate the average on target depth for these samples? Did you use CollectHsMetrics?

    2- Have you tried with --max-reads-per-alignment-start 0 option with Mutect2 ? Mutect2 and HaplotypeCaller uses downsampling to ease the performance however for those who are absolutely in need for getting every single read to be processed we recommend using this option regardless of the data type. 

    Finally we do not recommend using bowtie2 for aligner with Mutect2 unless you have a very important reason to do so. Bowtie2 uses  a different scoring scheme for MAPQ which is not compatible with GetPileupSummaries tool that we have as an accessory for somatic calling. 

    Let us know the answers to these questions and we can proceed further. 

    Regards. 

    0
    Comment actions Permalink
  • Avatar
    Bunmi Fariyike

    Hi Gökalp Çelik

    Thanks for the quick reply and your advice. 

    1. For calculating the depth, we used the depth operator in samtools with a path to our covered .bed file and the .bam file after bowtie2 and then the same with the .bam file after mutect2

    2. I also feared downsampling and set the downsampling stride to 20 in a recent run of a few samples. It seems like depth grossly improved. Is this similar to what you propose? Should I employ this max-reads-per-alignment method instead?

    3. We started using bowtie2 at the recommendation of another lab who used it for their pipeline. I think their only rationale was that it was already installed on our computing cluster. If not bowtie2, what would you recommend? BW2? Some other alignment tool?

    Thanks again for all your wisdom!

     

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi again. 

    You can use --max-reads-per-alignment-start 0 parameter to disable downsampling in Mutect2. 

    Bowtie2 uses maximum mapping quality of 44 for local mode and 42 for end-to-end mapping. These values are not directly compatible with GetPileupSummaries tool which uses minimum MAPQ 50 therefore if you use the tool without modifying this parameter it will return an empty file. If this tool is not of your interest then it may be OK to proceed. Overall BWA MEM can map more reads compared to Bowtie2 and may recover more shorter matches. 

    I hope this helps.

    Regards. 

    0
    Comment actions Permalink
  • Avatar
    Bunmi Fariyike

    Gökalp Çelik Hey there again! Thank you again for this helpful information. One last question just to confirm: turning off downsampling doesn't have any deleterious implications for downstream analysis, right? The reads will still be high quality, not duplicates, etc?

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    No it does not affect the way other ReadFilters perform. It will only disable subsampling of reads. 

    Regards. 

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk