Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Picard MarkDuplicates Functionality

0

4 comments

  • Avatar
    Gökalp Çelik

    Hi Nagashree Rachote

    MarkDuplicates recalculates all duplicates and ignores previous duplicate tags so you don't have to worry about clearing duplicate tags on your reads. If a read is marked duplicate before and MarkDuplicates decides that it is a duplicate, it will be remarked as if it was not marked before. 

    Regards. 

    0
    Comment actions Permalink
  • Avatar
    Dong Yiyi

    Hi all,

    I wonder if the reads marked by Picard's MarkDuplicates will be automatically ignored in the analysis processes in subsequent steps. By this, I mean the reads are only marked but not removed (-REMOVE_DUPLICATES false). I've heard it mentioned that once reads are marked, they will never be considered again by any software. Is this true? Are there any arguments to indicate whether to overlook these reads or not?

     

    Thanks!

    0
    Comment actions Permalink
  • Avatar
    Michael Gatzen

    Hi Dong Yiyi,

    It is true that MarkDuplicates by default only marks the reads as duplicates and does not remove them (unless you set REMOVE_DUPLICATES to true). Every GATK tool has a default duplicate read filter enabled and will therefore not consider reads marked as duplicates. While this should be true for most other bioinformatics tools as this is certainly the right thing to do, we cannot guarantee that every (third-party) software behaves this way.

    Hope this helps,
    Michael

    1
    Comment actions Permalink
  • Avatar
    Dong Yiyi

    Thank you, Michael!

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk