Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

MUTECT2 bamout reports far more reads than original bam - reliable?

0

1 comment

  • Avatar
    David Benjamin

    Jana Marie Schwarz The extra reads might be the artifical reads representing the locally-assembled haplotypes.  These will generally all have a uniform length that spans the entire assembly window, without ending in the middle like an actual read.  Also, they are marked with a fake read group tag -- I think it's "HP".  If you sort by read group in IGV this should stand out.

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk