Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

GenomicsDBImport -L [string] (but for the whole genome): purpose to generate invariant sites

Answered
0

8 comments

  • Avatar
    Genevieve Brandt (she/her)

    Hi Pete Thorpe,

    GenomicsDBImport can take more than one interval. I noticed that you are using a very old version of GATK: 4.0.2.1. We are currently on 4.2.4.1 and there have been major changes to GenomicsDBImport, so this problem you are having could be an old bug. 

    Could you try with a newer version of GATK?

    Best,

    Genevieve

    0
    Comment actions Permalink
  • Avatar
    Pete Thorpe

    Thank you for your reply. I downloaded the docker images on: Nov 19 2021 (15:44) - something strange must have happened here. 

    I have just downloaded the new image and this is the version: gatk-package-4.2.4.1-local.jar

    I will try this. 

    Thank you, Pete

    0
    Comment actions Permalink
  • Avatar
    Pete Thorpe

    VERSION FIX: 

    My download command (which pulled down an old version): singularity pull docker:docker broadinstitute/gatk

    what did pull down the correct version (Redhat 6.9, running on centos 7):

    singularity pull docker:broadinstitute/gatk:latest

    Now trying on Rocky Linux 8. This worked:

    singularity pull docker://broadinstitute/gatk

    0
    Comment actions Permalink
  • Avatar
    Pete Thorpe

    I tried this with a bed file:

    gatk --java-options "-Xmx40g" GenomicsDBImport --reader-threads 20 -V ../F2014_female_fall_DrosEU_match.g.vcf.gz \
    -V ../MR19female2.g.vcf.gz -V ../MR20female1.g.vcf.gz -V ../MR20male2.g.vcf.gz -V \
    ../F2015_female_fall_DrosEU_match.g.vcf.gz -V ../MR19male1.g.vcf.gz -V ../MR20female2.g.vcf.gz -V \
    ../MR_female.g.vcf.gz -V ../MR19female1.g.vcf.gz -V ../MR19male2.g.vcf.gz -V ../MR20male1.g.vcf.gz -V \
    ../MR_male.g.vcf.gz --genomicsdb-workspace-path allsamples -L ../dmel-all-chromosome-r6.43.bed

    A USER ERROR has occurred: More than one interval specified. The tool takes only one

     

    can you please give me an example of how -L can take multiple options? (Also add this to the docs?)

    0
    Comment actions Permalink
  • Avatar
    Pete Thorpe

    The docker image version now says: v4.0.2.1-32-g1e4a397-SNAPSHOT  ... I just downloaded this.

    Is dockerhub upto date?

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Yes, the docker hub is up to date. Maybe there is an issue for which gatk version you have in your path. 

    I start my docker image this way and it has always worked for me:

    docker run -it broadinstitute/gatk:4.2.4.1

    Here is an article we have regarding running GATK with Docker: https://gatk.broadinstitute.org/hc/en-us/articles/360035889991--How-to-Run-GATK-in-a-Docker-container

    0
    Comment actions Permalink
  • Avatar
    Pete Thorpe

    Thank you for your time. I have add a "version fix" to a comment above. Lesson learn for me just assuming it pulled down the latest. (sorry).

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Oh good I'm glad to hear that you have it working now! Thanks for writing in so we could get GATK working for you!

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk