Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

CombineGVCFs - KEY END found in VariantContext field INFO but this key isn't defined in the VCFHeader

0

6 comments

  • Avatar
    Pamela Bretscher

    Hi Ryan Sun,

    It looks like your VCF header may be missing a declaration of the "END" INFO field which is causing the error. Can you take a look at this similar post which shows the potential solution: https://www.biostars.org/p/371307/. Please let me know if this is helpful.

    Kind regards,

    Pamela

    0
    Comment actions Permalink
  • Avatar
    Ryan Sun

    Hi Pamela,

    Thanks for your response. I followed the suggestions of that post, and manually added END tag to VCF header. It worked but ran into other problems later.

    One person in Github gave me a reminder about this issue and I carefully checked best practice again, finding the CombineGVCFs must take gVCF (genomic VCF) files as input. Unfortunately, I used normal VCF files (forgot to add -ERC GVCF in HaplotypeCaller step). I tried again and it worked well this time.

    Anyway, I really appreciated your time and effort!

    Ryan

    0
    Comment actions Permalink
  • Avatar
    Pamela Bretscher

    Hi Ryan Sun,

    Okay, I'm glad to hear that was successful and that you were able to get HaplotypeCaller to work as well. Are there any issues that you are still experiencing that I can help with?

    Kind regards,

    Pamela

    0
    Comment actions Permalink
  • Avatar
    Niharika

    Hi Ryan Sun

    will you please tell me how did you add END tag in the VCF?

    Also I would like to know if you got an end key reported in vcf at the position where error is thrown? I can not see any END tag in vcf which is not declared in vcf header

     

    0
    Comment actions Permalink
  • Avatar
    kevin wang

     

    Key END found in VariantContext field INFO at Hic1:818 but this key isn't defined in the VCFHeader

    java.lang.IllegalStateException: Key END found in VariantContext field INFO at Hic1:818 but this key isn't defined in the VCFHeader.  We requir

    e all VCFs to have complete VCF headers by default.

            at htsjdk.variant.vcf.VCFEncoder.fieldIsMissingFromHeaderError(VCFEncoder.java:202)

            at htsjdk.variant.vcf.VCFEncoder.write(VCFEncoder.java:141)

            at htsjdk.variant.variantcontext.writer.VCFWriter.add(VCFWriter.java:248)

            at org.broadinstitute.hellbender.tools.walkers.CombineGVCFs.endPreviousStates(CombineGVCFs.java:407)

            at org.broadinstitute.hellbender.tools.walkers.CombineGVCFs.mergeWithNewVCs(CombineGVCFs.java:325)

            at org.broadinstitute.hellbender.tools.walkers.CombineGVCFs.apply(CombineGVCFs.java:164)

            at org.broadinstitute.hellbender.engine.MultiVariantWalkerGroupedOnStart.apply(MultiVariantWalkerGroupedOnStart.java:68)

            at org.broadinstitute.hellbender.engine.MultiVariantWalker.lambda$traverse$1(MultiVariantWalker.java:119)

            at java.util.stream.ForEachOps$ForEachOp$OfRef.accept(ForEachOps.java:184)

            at java.util.stream.ReferencePipeline$3$1.accept(ReferencePipeline.java:193)

            at java.util.stream.ReferencePipeline$2$1.accept(ReferencePipeline.java:175)

            at java.util.stream.ReferencePipeline$3$1.accept(ReferencePipeline.java:193)

            at java.util.Iterator.forEachRemaining(Iterator.java:116)

            at java.util.Spliterators$IteratorSpliterator.forEachRemaining(Spliterators.java:1801)

            at java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:481)

            at java.util.stream.AbstractPipeline.wrapAndCopyInto(AbstractPipeline.java:471)

            at java.util.stream.ForEachOps$ForEachOp.evaluateSequential(ForEachOps.java:151)

            at java.util.stream.ForEachOps$ForEachOp$OfRef.evaluateSequential(ForEachOps.java:174)

            at java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:234)

            at java.util.stream.ReferencePipeline.forEach(ReferencePipeline.java:418)

            at org.broadinstitute.hellbender.engine.MultiVariantWalker.traverse(MultiVariantWalker.java:117)

            at org.broadinstitute.hellbender.engine.MultiVariantWalkerGroupedOnStart.traverse(MultiVariantWalkerGroupedOnStart.java:122)

            at org.broadinstitute.hellbender.engine.GATKTool.doWork(GATKTool.java:1039)

            at org.broadinstitute.hellbender.cmdline.CommandLineProgram.runTool(CommandLineProgram.java:139)

            at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMainPostParseArgs(CommandLineProgram.java:191)

            at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:210)

            at org.broadinstitute.hellbender.Main.runCommandLineProgram(Main.java:162)

            at org.broadinstitute.hellbender.Main.mainEntry(Main.java:205)

            at org.broadinstitute.hellbender.Main.main(Main.java:291)

    0
    Comment actions Permalink
  • Avatar
    Ryan Sun

    Hi Niharika,

    As I mentioned in my last comment "One person in Github gave me a reminder about this issue and I carefully checked best practice again, finding the CombineGVCFs must take gVCF (genomic VCF) files as input. Unfortunately, I used normal VCF files (forgot to add -ERC GVCF in HaplotypeCaller step). I tried again and it worked well this time." I actually regenerated new gVCF files to combine, raw VCF file from HaplotypeCaller will not work for CombineGVCFs. Hope this can help you.

    1
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk