Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Mismatch between mutect2 VCF and IGV BAM stats (causing strand bias issue)

0

3 comments

  • Avatar
    Gökalp Çelik

    Hi Ben Thompson

    F1R2 and F2R1 are not the number of reads supporting either allele from different strands but rather pair orientations that support either ref and alt alleles. What you see in IGV is number individual piled reads for that site that support either allele. 

    I hope this helps. 

    0
    Comment actions Permalink
  • Avatar
    Ben Thompson

    Thank you Gökalp! That makes sense then I think I misunderstood the IGV view and got Strand Bias and Read Orientation Bias mixed up. I believe I successfully filtered out strand bias using Mutect2’s F1R2/F2R1 annotations, but when I checked the results in IGV, I mistakenly thought the remaining read orientation artifacts were still strand bias.

    Given this, would you recommend adding LearnOrientationModel into my pipeline so I can use --ob-priors in FilterMutectCalls? From what I understand, this is the best way to remove read orientation bias from my Mutect2 VCFs currently?

    I don’t have a Panel of Normals (PoN) or control samples, just MSA brain samples, so would the best approach be:

    1. Running LearnOrientationModel separately on each sample,
    2. Combining all .tar.gz model files into one orientation bias model file,
    3. Using that as the --ob-priors input when running FilterMutectCalls?

    Many thanks for your help!

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi again. 

    Read Orientation Model should be generated only per sample and should not be made into a combined model. For the usefulness of the tool you need to make sure that UMI collapsing should 

    - not create collapsed reads solely based on UMI similarity

    - care about read strandedness and orientation during collapsing.

    If these 2 parts are not provided orientation bias is not useful and should definitely be avoided. 

    I hope this helps. 

     

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk