Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

GATK mark duplicates error

0

2 comments

  • Avatar
    Kshama Aswath

    So adding to the above post:

    even if it is a "harmless" error and I can ignore it,  could it be a memory issue? I ran it default and did not specify any memory allotment. If that is the case, is there a guide on memory needed ( roughly) on different tools. I understand input file sizes varies and it is hard to say exactly how much to allot, but a ball park idea could hlep. Thankyou !

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Hi Kshama Aswath, please post the entire error message so we can look into the problem. 

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk