Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

GenotypeGVCFs

Answered
0

1 comment

  • Avatar
    Anthony DiCi

    Hi Yumi Hashi,

    Thank you for writing to the GATK forum. I hope that we can help you sort this out.

    So when looking at your command line history, your second command seems inconsistent with the first command. When you ran the VcfToIntervalList command, it generated the interval file to the current directory, which is the parent of bwa_bam. In the GenomicsDBImport command, you specified the directory location as one step below the interval list you had just created with the first command.

    Try rerunning with the corrected commands and see if that works. If not, please let me know, and we can investigate further.

    Original:

    $gatk VcfToIntervalList 
     I=bwa_bam/gvcf/sample1_markdup.addRG.g.vcf 
     O=sample.interval_list

    $gatk GenomicsDBImport \
      --genomicsdb-workspace-path bwa_bam/GenomicDB \
      --intervals bwa_bam/sample.interval_list \
       --sample-name-map bwa_bam/gvcf_sample_name.txt


    Corrected:

    $gatk VcfToIntervalList \
     I=bwa_bam/gvcf/sample1_markdup.addRG.g.vcf \
     O=bwa_bam/sample.interval_list

    $gatk GenomicsDBImport \
      --genomicsdb-workspace-path bwa_bam/GenomicDB \
      --intervals bwa_bam/sample.interval_list \
       --sample-name-map bwa_bam/gvcf_sample_name.txt

    Best,
    Anthony

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk