Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

CombineGVCF slowing down at certain region on chromosome 1

0

3 comments

  • Avatar
    wd

    Update on the problem above.

    As smaller batches or selecting only one chromosome (using intervals option (-L)) did not speed up CombineGVCF, I decided to try GenomicsDBimport.

    I successfully created a GenomicsDB of the 36 samples for chromosome 1, using the following command:

    gatk GenomicsDBImport \
    -V output/B7_2.g.vcf.gz \
    -V (34 samples)... \
    -V output/B8_7.g.vcf.gz \
    --genomicsdb-workspace-path RC_chr1 \
    --intervals chr_1

    and would like now use GenotypeVCF on this GenomicsDB  using the following command:

    gatk GenotypeGVCFs \
    -R Reference/uT.fasta \
    -V gendb://RC_chr1 \
    -O RC_chr1_36samples.vcf

    but not get the following error:

    22:57:31.195 INFO  GenotypeGVCFs - ------------------------------------------------------------

    22:57:31.196 INFO  GenotypeGVCFs - The Genome Analysis Toolkit (GATK) v4.1.6.0

    22:57:31.196 INFO  GenotypeGVCFs - For support and documentation go to https://software.broadinstitute.org/gatk/

    22:57:31.196 INFO  GenotypeGVCFs - Executing as xxxxx@galaxy on Linux v4.4.0-133-generic amd64

    22:57:31.196 INFO  GenotypeGVCFs - Java runtime: OpenJDK 64-Bit Server VM v1.8.0_192-b01

    22:57:31.196 INFO  GenotypeGVCFs - Start Date/Time: 26 May 2020 22:57:30 CEST

    22:57:31.196 INFO  GenotypeGVCFs - ------------------------------------------------------------

    22:57:31.197 INFO  GenotypeGVCFs - ------------------------------------------------------------

    22:57:31.197 INFO  GenotypeGVCFs - HTSJDK Version: 2.21.2

    22:57:31.197 INFO  GenotypeGVCFs - Picard Version: 2.21.9

    22:57:31.198 INFO  GenotypeGVCFs - HTSJDK Defaults.COMPRESSION_LEVEL : 2

    22:57:31.198 INFO  GenotypeGVCFs - HTSJDK Defaults.USE_ASYNC_IO_READ_FOR_SAMTOOLS : false

    22:57:31.198 INFO  GenotypeGVCFs - HTSJDK Defaults.USE_ASYNC_IO_WRITE_FOR_SAMTOOLS : true

    22:57:31.198 INFO  GenotypeGVCFs - HTSJDK Defaults.USE_ASYNC_IO_WRITE_FOR_TRIBBLE : false

    22:57:31.198 INFO  GenotypeGVCFs - Deflater: IntelDeflater

    22:57:31.198 INFO  GenotypeGVCFs - Inflater: IntelInflater

    22:57:31.198 INFO  GenotypeGVCFs - GCS max retries/reopens: 20

    22:57:31.198 INFO  GenotypeGVCFs - Requester pays: disabled

    22:57:31.198 INFO  GenotypeGVCFs - Initializing engine

    22:57:32.260 INFO  GenotypeGVCFs - Shutting down engine

    [26 May 2020 22:57:32 CEST] org.broadinstitute.hellbender.tools.walkers.GenotypeGVCFs done. Elapsed time: 0.03 minutes.

    Runtime.totalMemory()=2313682944

    ***********************************************************************

     

    A USER ERROR has occurred: Couldn't create GenomicsDBFeatureReader

     

    ***********************************************************************

    In addition, suppose I would succeed in creating the GenotypeVCF for chr1, how do I combine this vcf with those for other chromosomes?

    Any help is more than welcome!

    Thank you in advance.

    Regards

    Wannes

    PS: also tried the command below, based on the Gotcha's/Forum, but did not work either:

    "TILEDB_DISABLE_FILE_LOCKING=1`` gatk --java-options "-Xmx30g" GenotypeGVCFs\
    -R Reference/uT.fasta \
    -V gendb://RC_chr1 \
    -O RC_chr1_36samples.vcf

     

    0
    Comment actions Permalink
  • Avatar
    wd

    Update on the problem above:

    Added "-DGATK_STACKTRACE_ON_USER_EXCEPTION=true"  to --java-options

    Apparently "A USER ERROR has occurred: Couldn't create GenomicsDBFeatureReader"

    is caused by 

    "Caused by: java.io.IOException: GenomicsDB JNI Error: GenomicsDBConfigException : Syntax error in JSON file"

    Any suggestion is moer than welcome!

    ***********************************************************************

     

    A USER ERROR has occurred: Couldn't create GenomicsDBFeatureReader

     

    ***********************************************************************

    org.broadinstitute.hellbender.exceptions.UserException: Couldn't create GenomicsDBFeatureReader

    at org.broadinstitute.hellbender.engine.FeatureDataSource.getGenomicsDBFeatureReader(FeatureDataSource.java:410)

    at org.broadinstitute.hellbender.engine.FeatureDataSource.getFeatureReader(FeatureDataSource.java:326)

    at org.broadinstitute.hellbender.engine.FeatureDataSource.<init>(FeatureDataSource.java:282)

    at org.broadinstitute.hellbender.engine.VariantLocusWalker.initializeDrivingVariants(VariantLocusWalker.java:76)

    at org.broadinstitute.hellbender.engine.VariantWalkerBase.initializeFeatures(VariantWalkerBase.java:67)

    at org.broadinstitute.hellbender.engine.GATKTool.onStartup(GATKTool.java:706)

    at org.broadinstitute.hellbender.engine.VariantLocusWalker.onStartup(VariantLocusWalker.java:63)

    at org.broadinstitute.hellbender.cmdline.CommandLineProgram.runTool(CommandLineProgram.java:137)

    at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMainPostParseArgs(CommandLineProgram.java:191)

    at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:210)

    at org.broadinstitute.hellbender.Main.runCommandLineProgram(Main.java:163)

    at org.broadinstitute.hellbender.Main.mainEntry(Main.java:206)

    at org.broadinstitute.hellbender.Main.main(Main.java:292)

    Caused by: java.io.IOException: GenomicsDB JNI Error: GenomicsDBConfigException : Syntax error in JSON file /data/wannesd/RG_chr1/callset.json

    at org.genomicsdb.reader.GenomicsDBQueryStream.jniGenomicsDBInit(Native Method)

    at org.genomicsdb.reader.GenomicsDBQueryStream.<init>(GenomicsDBQueryStream.java:209)

    at org.genomicsdb.reader.GenomicsDBQueryStream.<init>(GenomicsDBQueryStream.java:182)

    at org.genomicsdb.reader.GenomicsDBQueryStream.<init>(GenomicsDBQueryStream.java:91)

    at org.genomicsdb.reader.GenomicsDBFeatureReader.generateHeadersForQuery(GenomicsDBFeatureReader.java:176)

    at org.genomicsdb.reader.GenomicsDBFeatureReader.<init>(GenomicsDBFeatureReader.java:80)

    at org.broadinstitute.hellbender.engine.FeatureDataSource.getGenomicsDBFeatureReader(FeatureDataSource.java:407)

    ... 12 more

    0
    Comment actions Permalink
  • Avatar
    Tiffany Miller

    Hi @wd, sorry for the delay. 

    It appears another user was having this issue and it was due to the sample names getting altered. Does this apply to your situation?

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk