Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

VariantsToTable not extracting INFO 'sub-fields'

Answered
0

3 comments

  • Avatar
    Genevieve Brandt (she/her)

    Hi Francesco Mazzarotto, what you are attempting to do with VariantsToTable is not possible. We don't have a GATK tool that will do exactly what you want, but you could look into getting MAF output format from Funcotator and the results will be easier to manipulate into a table.

    0
    Comment actions Permalink
  • Avatar
    Shahryar Alavi

    But MAF output is somewhat different from VCF; and I think the VCF output format is better for germline variant annotation.

    With Funcotator we get an integrated (and minuter) "variant calling - annotation" workflow. But the problem is "vertical bar" separated INFOs are not easy for downstream text processing.

     

    I have two suggestions for the GATK Team:

    You may want to develop a new tool (like VariantsToTable) to separate each "sub-info" in the FUNCOTATION INFO, and put them into separate columns with corresponding headers when creating the tab-delimited table.

    Or add a feature to Funcotator to create multiple INFOs with FUNCOTATION prefix in their IDs; e.g.

    #INFO=<ID=FUNCOTATION_Gencode_34_hugoSymbol,...>
    #INFO=<ID=FUNCOTATION_Gencode_34_ncbiBuild,...>

    instead of

    #INFO=<ID=FUNCOTATION,...,Description="Funcotation fields are: Gencode_34_hugoSymbol|Gencode_34_ncbiBuild|...">

    Thanks

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Hi Shahryar Alavi,

    I brought up your feature request to the GATK team and they agreed that it was a great idea and would be highly useful for Funcotator results. I created a feature request issue ticket on the GATK github so that our team can add the tool whenever resources allow: https://github.com/broadinstitute/gatk/issues/7556. Feel free to chime in with your thoughts in the comment section of that ticket.

    Best,

    Genevieve

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk