Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Funcotator maf output problems on Varscan2 vcf files

0

10 comments

  • Avatar
    Bhanu Gandham

    Hi,

    Sorry about the delay in getting back to you. Our lead funcotator developer is doing a workshop in Costa Rica to help community outreach.

    Here is the suggestion with regards to your question: The MAF specification expects a tumor/normal pair and in this instance, this requirement is manifesting as requiring a ref (normal) and alt (tumor) allelic depth.  This is not a funcotator issue, but rather a limitation on funcotator due to the specification of MAF files.  This does not occur with VCF output because VCF files do not have the same requirements as MAF files.  This cannot be worked around in the funcotator code, but adding in dummy values for the missing AD field in the data coming from Varscan2 will fix this issue and allow MAF files to be created.

    0
    Comment actions Permalink
  • Avatar
    Egmidio Medina

    Hello,

    I am also interested in running Funcotator on VarScan2 outputs. Are there any new updates regarding when/if Funcotator will allow VarScan2 inputs?

    Thank you.

    0
    Comment actions Permalink
  • Avatar
    Bhanu Gandham

    Hi Egmidio Medina

     

    Funcotator currently does not support MAF files. You could convert MAF to VCF files and then annotate with Funcotator.

    0
    Comment actions Permalink
  • Avatar
    Bhanu Gandham

    We do we have a ticket open for adding maf reading to Funcotator but this will take sometime. You can follow its progress here: https://github.com/broadinstitute/gatk/issues/3703

    0
    Comment actions Permalink
  • Avatar
    Siddharth Prakash

    Hi Bhanu,

    I am running Funcotator on Mutect2 outputs and encountered a similar problem. Funcotator starts normally but then abruptly exits with this error:

    06:26:13.703 INFO Funcotator - Creating a VCF file for output: file:/corral-secure/uth/Sex-Chromosome-Loss/bamFiles/14047.303211.annotated.vcf.gz
    06:26:13.736 INFO ProgressMeter - Starting traversal
    06:26:13.737 INFO ProgressMeter - Current Locus Elapsed Minutes Variants Processed Variants/Minute
    06:26:15.468 INFO VcfFuncotationFactory - dbSNP 9606_b150 cache hits/total: 0/0
    06:26:15.468 INFO VcfFuncotationFactory - gnomAD_exome 2.1 cache hits/total: 0/0
    06:26:15.472 INFO Funcotator - Shutting down engine
    [July 19, 2020 6:26:16 AM CDT] org.broadinstitute.hellbender.tools.funcotator.Funcotator done. Elapsed time: 0.32 minutes.
    Runtime.totalMemory()=1669332992
    java.lang.NullPointerException

    However, Funcotator creates MAF files without errors. What can I do? Should I convert the MAF output to VCF?

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Hi Siddharth Prakash, could you please provide the GATK version you are using, your command, and the entire error log?

    0
    Comment actions Permalink
  • Avatar
    Siddharth Prakash

    GATK 4.1.8.0

    java -Xmx4G -jar ${GATK_DIR}/gatk-package-4.1.8.0-local.jar Funcotator -R ${REF_DIR}/hs37d5.fa -V ${BAM_DIR}/14047.303211.filtered.vcf.gz -O ${BAM_DIR}/14047.303211.annotated.vcf.gz --output-file-format VCF --data-sources-path ${REF_DIR}/funcotator_dataSources.v1.6.20190124s/ --ref-version hg19

    where

    GATK_DIR="/work/03437/sprakash/lonestar/apps/gatk"

    REF_DIR="/work/03437/sprakash/lonestar/references"

    BAM_DIR="/corral-secure/uth/Sex-Chromosome-Loss/bamFiles"

     

    10:24:36.331 INFO Funcotator - Creating a VCF file for output: file:/corral-secure/uth/Sex-Chromosome-Loss/bamFiles/14047.303211.annotated.vcf.gz
    10:24:36.363 INFO ProgressMeter - Starting traversal
    10:24:36.363 INFO ProgressMeter - Current Locus Elapsed Minutes Variants Processed Variants/Minute
    10:24:37.879 INFO VcfFuncotationFactory - dbSNP 9606_b150 cache hits/total: 0/0
    10:24:37.879 INFO VcfFuncotationFactory - gnomAD_exome 2.1 cache hits/total: 0/0
    10:24:37.882 INFO Funcotator - Shutting down engine
    [July 20, 2020 10:24:39 AM CDT] org.broadinstitute.hellbender.tools.funcotator.Funcotator done. Elapsed time: 0.31 minutes.
    Runtime.totalMemory()=1749024768
    java.lang.NullPointerException
    at htsjdk.tribble.index.tabix.TabixIndexCreator.finalizeFeature(TabixIndexCreator.java:106)
    at htsjdk.tribble.index.tabix.TabixIndexCreator.finalizeIndex(TabixIndexCreator.java:129)
    at htsjdk.variant.variantcontext.writer.IndexingVariantContextWriter.close(IndexingVariantContextWriter.java:177)
    at htsjdk.variant.variantcontext.writer.VCFWriter.close(VCFWriter.java:233)
    at org.broadinstitute.hellbender.tools.funcotator.vcfOutput.VcfOutputRenderer.close(VcfOutputRenderer.java:137)
    at org.broadinstitute.hellbender.tools.funcotator.Funcotator.closeTool(Funcotator.java:890)
    at org.broadinstitute.hellbender.engine.GATKTool.doWork(GATKTool.java:1053)
    at org.broadinstitute.hellbender.cmdline.CommandLineProgram.runTool(CommandLineProgram.java:140)
    at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMainPostParseArgs(CommandLineProgram.java:192)
    at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:211)
    at org.broadinstitute.hellbender.Main.runCommandLineProgram(Main.java:163)
    at org.broadinstitute.hellbender.Main.mainEntry(Main.java:206)
    at org.broadinstitute.hellbender.Main.main(Main.java:292)

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Hi Siddharth Prakash, it looks like there is an error regarding your index file. Here is a similar example on the forum that may help you with this problem:

    https://gatk.broadinstitute.org/hc/en-us/community/posts/360062293851-Mutect2-issue-with-long-scaffolds-non-human-mouse-

    If you cannot solve it, please start a new post since it is not the same as the one above.

    0
    Comment actions Permalink
  • Avatar
    Siddharth Prakash

    Thanks Genevieve,

    Outputting as uncompressed .vcf worked.

     

    -Siddharth

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Great! Thanks for the update.

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk