Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

GATK Mutect2 counts the alternate base twice in overlapping paired reads

0

5 comments

  • Avatar
    Gökalp Çelik

    Hi JUNHUI LI

    Can you try running the same sample through GATK 4.6.0.0 and see if the issue persists?

    0
    Comment actions Permalink
  • Avatar
    JUNHUI LI

    Thanks, Gökalp Çelik.

    This issue is still there in GATK version 4.6.0.0. I would appreciate any recommendations you might have for working around this issue.

    --Junhui

     

     

     

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi JUNHUI LI

    The current behavior for 4.6.0.0 is the expected one and Mutect2 itself is smart in separating overlapping evidence due to PCR error vs real evidence from mutations in DNA. Therefore base calling qualities are adjusted during reassembly for those overlapping sites until it reaches a point where it falls into PCR error category. If you wish to get us more insights about this particular site or any other false positive site you can provide examples of please share us the entire variant context with the INFO fields so that we can take a look at what Mutect2 provided us. 

    I hope this helps.

    Regards. 

    0
    Comment actions Permalink
  • Avatar
    JUNHUI LI

    Hi Gökalp Çelik,

    Thanks. Here are two variants called with version 4.6.0.0

    chr1    42256836        .       C       T       .       .       AS_SB_TABLE=20,21|2,2;DP=45;ECNT=4;ECNTH=2;MBQ=20,20;MFRL=121,97;MMQ=60,60;MPOS=49;POPAF=3.83;TLOD=5.72 GT:AD:AF:DP:F1R2:F2R1:FAD:SB    0/1:41,4:0.100:45:15,0:5,2:26,2:20,21,2,2
    chr1    83879352        .       G       T       .       .       AS_SB_TABLE=9,9|2,1;DP=21;ECNT=3;ECNTH=2;MBQ=20,20;MFRL=160,172;MMQ=57,58;MPOS=44;POPAF=7.30;TLOD=5.67  GT:AD:AF:DP:F1R2:F2R1:FAD:SB    0/1:18,3:0.188:21:6,0:3,2:12,2:9,9,2,1

    version 4.0.12 only outputs the second variant as below,

    chr1    83879352        .       G       T       .       .       DP=15;ECNT=2;MBQ=32,33;MFRL=167,203;MMQ=58,58;MPOS=36;POPAF=7.30;TLOD=4.41      GT:AD:AF:DP:F1R2:F2R1:SAAF:SAPP 0/1:12,2:0.186:14:7,0:5,2:0.00,0.141,0.143:0.044,0.011,0.944

    I’m grateful for your support.

    Thanks,

    Junhui

     

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi JUNHUI LI

    Those values seem like they are not much and they will definitely get filtered out by FilterMutectCalls. Our suggestion would be to perform FilterMutectCalls first before deciding if there are any outstanding false positives that remain. 

    I hope this helps.

    Regards. 

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk