Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

MarkDuplicates has different duplication metrics than EstimateLibraryComplexity

0

1 comment

  • Avatar
    Genevieve Brandt (she/her)

    Hi Ravi Mandla, we would expect some differences in these metrics because the tools do not work the same. MarkDuplicates uses alignment information to determine duplicates. EstimateLibraryComplexity determines duplicates from the bases of the reads, allowing for some error, ignoring the reference. Hopefully this helps clarify these differences for you!

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk