Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

CombineGVCFs, java heap space, and ploidy

Answered
0

6 comments

  • Avatar
    Pamela Bretscher

    Hi Alix Matthews,

    It is true that CombineGVCFs should be able to handle any ploidy, but there have been significant improvements to the tool since version 4.0.0.0. Could you try running this with the latest version of GATK?

    Kind regards,

    Pamela

    0
    Comment actions Permalink
  • Avatar
    Alix Matthews

    Thank you for your quick reply. I will request an update to GATK 4.2.0 on the server I am using and try again. I will let you all know if the problem resolves or doesn't resolve with the update. Thank you again.

    0
    Comment actions Permalink
  • Avatar
    Isadora Machado Ghilardi

    Hello! I`m using the version 4.4.0.0.

    I did the HaplotypeCaller in my 18 samples, and it worked. Now I`m using the CombineGVCFs to put the samples together in a cohort. However I go this error: 

    Exception in thread "main" java.lang.OutOfMemoryError: Java heap space
            at htsjdk.tribble.readers.TabixReader.readLong(TabixReader.java:195)
            at htsjdk.tribble.readers.TabixReader.readIndex(TabixReader.java:267)
            at htsjdk.tribble.readers.TabixReader.readIndex(TabixReader.java:287)
            at htsjdk.tribble.readers.TabixReader.<init>(TabixReader.java:165)
            at htsjdk.tribble.readers.TabixReader.<init>(TabixReader.java:129)
            at htsjdk.tribble.TabixFeatureReader.<init>(TabixFeatureReader.java:80)
            at htsjdk.tribble.AbstractFeatureReader.getFeatureReader(AbstractFeatureReader.java:117)
            at org.broadinstitute.hellbender.engine.FeatureDataSource.getTribbleFeatureReader(FeatureDataSource.java:433)
            at org.broadinstitute.hellbender.engine.FeatureDataSource.getFeatureReader(FeatureDataSource.java:377)
            at org.broadinstitute.hellbender.engine.FeatureDataSource.<init>(FeatureDataSource.java:319)
            at org.broadinstitute.hellbender.engine.FeatureDataSource.<init>(FeatureDataSource.java:291)
            at org.broadinstitute.hellbender.engine.FeatureManager.addToFeatureSources(FeatureManager.java:225)
            at org.broadinstitute.hellbender.engine.MultiVariantWalker.lambda$initializeDrivingVariants$0(MultiVariantWalker.java:86)
            at org.broadinstitute.hellbender.engine.MultiVariantWalker$$Lambda$196/0x00002ad3f85a89f0.accept(Unknown Source)
            at java.base/java.util.ArrayList$ArrayListSpliterator.forEachRemaining(ArrayList.java:1625)
            at java.base/java.util.stream.ReferencePipeline$Head.forEach(ReferencePipeline.java:762)
            at org.broadinstitute.hellbender.engine.MultiVariantWalker.initializeDrivingVariants(MultiVariantWalker.java:76)
            at org.broadinstitute.hellbender.engine.VariantWalkerBase.initializeFeatures(VariantWalkerBase.java:67)
            at org.broadinstitute.hellbender.engine.GATKTool.onStartup(GATKTool.java:726)
            at org.broadinstitute.hellbender.engine.MultiVariantWalker.onStartup(MultiVariantWalker.java:49)
            at org.broadinstitute.hellbender.cmdline.CommandLineProgram.runTool(CommandLineProgram.java:147)
            at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMainPostParseArgs(CommandLineProgram.java:198)
            at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:217)
            at org.broadinstitute.hellbender.Main.runCommandLineProgram(Main.java:160)
            at org.broadinstitute.hellbender.Main.mainEntry(Main.java:203)
            at org.broadinstitute.hellbender.Main.main(Main.java:289)

    0
    Comment actions Permalink
  • Avatar
    Isadora Machado Ghilardi

    Do I have to do the aggregation  step and then do the joint genotype using the GenotypeGVCFs or can I move to the GenotypeGVCFs setp?

     

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi Isadora Machado Ghilardi

    GenotypeGVCFs require a single VCF input for genotyping therefore GVCF files must be combined or imported to genomicsdb before genotyping. Can you increase the heap size by using the below parameter?

    --java-options "-Xmx8G"

    This parameter will set the heap size to 8 GB so if needed you can increase the number but do not assign more than 80~90% of all your total memory size as that will result in additional issues. 

     

    0
    Comment actions Permalink
  • Avatar
    Isadora Machado Ghilardi

    It worked! Thank you!! 

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk