Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Mutect2 LearnReadOrientationModel Memory Error

0

5 comments

  • Avatar
    Genevieve Brandt (she/her)

    Arvind Ravi have you tried yet specifying a temporary directory with --tmp-dir?

    0
    Comment actions Permalink
  • Avatar
    Arvind Ravi

    Thanks Genevieve. I take it that allows the task to offload memory to the disk during execution?

    I've updated the call as follows but am still running into a memory issue in a 30G mem VM...

    Command:

    gatk --java-options "-Xmx21G -Xms21G -XX:-UseGCOverheadLimit" LearnReadOrientationModel \
    -I ${sep=" -I " orientation_bias_files} \
    -O "artifact-priors.tar.gz" \
    --tmp-dir ob_tmp

    Error:

    02:10:58.629 INFO LearnReadOrientationModel - Shutting down engine [October 2, 2020 2:10:58 AM GMT] org.broadinstitute.hellbender.tools.walkers.readorientation.LearnReadOrientationModel done. Elapsed time: 168.95 minutes. Runtime.totalMemory()=20043530240 Exception in thread "main" java.lang.OutOfMemoryError: Java heap space at sun.misc.FloatingDecimal.readJavaFormatString(FloatingDecimal.java:1875) at sun.misc.FloatingDecimal.parseDouble(FloatingDecimal.java:110) at java.lang.Double.parseDouble(Double.java:538) at htsjdk.samtools.util.FormatUtil.parseDouble(FormatUtil.java:141) at htsjdk.samtools.metrics.MetricsFile.read(MetricsFile.java:434) at org.broadinstitute.hellbender.tools.walkers.readorientation.LearnReadOrientationModel.readMetricsFile(LearnReadOrientationModel.java:296) at org.broadinstitute.hellbender.tools.walkers.readorientation.LearnReadOrientationModel.lambda$doWork$7(LearnReadOrientationModel.java:96) at org.broadinstitute.hellbender.tools.walkers.readorientation.LearnReadOrientationModel$$Lambda$53/805561728.apply(Unknown Source) at java.util.stream.ReferencePipeline$3$1.accept(ReferencePipeline.java:193) at java.util.ArrayList$ArrayListSpliterator.forEachRemaining(ArrayList.java:1382) at java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:482) at java.util.stream.AbstractPipeline.wrapAndCopyInto(AbstractPipeline.java:472) at java.util.stream.ReduceOps$ReduceOp.evaluateSequential(ReduceOps.java:708) at java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:234) at java.util.stream.ReferencePipeline.collect(ReferencePipeline.java:566) at org.broadinstitute.hellbender.tools.walkers.readorientation.LearnReadOrientationModel.doWork(LearnReadOrientationModel.java:97) at org.broadinstitute.hellbender.cmdline.CommandLineProgram.runTool(CommandLineProgram.java:140) at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMainPostParseArgs(CommandLineProgram.java:192) at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:211) at org.broadinstitute.hellbender.Main.runCommandLineProgram(Main.java:160) at org.broadinstitute.hellbender.Main.mainEntry(Main.java:203) at org.broadinstitute.hellbender.Main.main(Main.java:289) 2020/10/02 02:11:05 Starting delocalization.

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Arvind Ravi it can help with memory issues if the temporary directory your machine is using does not have a lot of space, or is slow for reading and writing.

    Did you re-try your command on the 200 GB memory machine with 140 GB memory for java heap space? 

    How many files are in your input?

    0
    Comment actions Permalink
  • Avatar
    Arvind Ravi

    Still getting the same error with the larger machine mem settings:

    "Elapsed time: 85.25 minutes. Runtime.totalMemory()=133621612544"

     

    There are 50 input files (6X whole genomes) for the call.

    Any other suggestions welcome!

     

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Arvind Ravi Are you running multiple samples with the same command? This should be run on a single sample.

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk