Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

GenomicsDBImport Follow

1 comment

  • Avatar
    Maciek Konopinski

    I'm affraid noone answers the questions posted here but I this may be interesting information for visitors of this scarce documentation page.
    If you are going to use vcfs automatically produced by the HaplotypeCaller in the previous step you need to index them. It is unfortunately not mentioned neither here nor in HaplotypeCaller page. It is easy though. Example command is:

    ```
    vcfs=($(find . -type f -name "*.g.vcf"))
    for vcf in ${vcfs[@]}
    do
    gatk IndexFeatureFile -I $vcf
    done
    ```
    I think it can be created at the HaplotypeCaller stage by `--create-output-variant-index`
     or `-OVI` options but I did not check it so I am not sure. i wish there is some comprehensive walk-through guide easy to find at this page...

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk