Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Issue with Snakemake Rule and Cluster Resource Allocation for MarkDuplicatesSpark

0

1 comment

  • Avatar
    Gökalp Çelik

    Hi Aravind Sundar

    Automatic garbage collection could be the cause of those additional threads which can all be reduced by increasing the amount of heap size and limiting the number of parallel GC threads in java parameters. Other than that you might need to experiment with your parameters to figure out the optimal settings for your case. Unfortunately we cannot help solve issues with snakemake therefore your mileage may vary. 

    Regards. 

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk