Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

gatk MarkDuplicatesSpark exits without error message

Answered
0

3 comments

  • Avatar
    Pamela Bretscher

    Hi Ramesh Ramasamy,

    You may be allocating too much of your total memory to the job given that you are specifying -Xmx116G with only 120G of total memory. Generally, it is recommended to allocate no more than 80-90% of your available memory to the job. Please let me know if this helps solve the problem.

    Kind regards,

    Pamela

    0
    Comment actions Permalink
  • Avatar
    Ramesh Ramasamy

    Hi Pamela Bretscher,

    Thank you. Yes, it does look like OOMKiller kicked in and killed the job (based on the kernel log: /var/log/kern.log). I allocated ~85% of the total memory to the job and it is running since this morning. The job runs okay so far! I will let you know if I again run into the same issue.

     

    Thanks,

    Ramesh

     

    Update: The job ran successfully. Thanks!

    0
    Comment actions Permalink
  • Avatar
    Pamela Bretscher

    Hi Ramesh Ramasamy,

    Thank you for letting me know, I'm glad to hear that it worked!

    Kind regards,

    Pamela

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk