Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

FindBadGenomicKmersSpark (BETA) Follow

1 comment

  • Avatar
    Anamica Bedi de Silva

    I'm experiencing a long runtime (currently on hour 51) with this tool while running it on my institution's HPC. 

    My reference fasta is a 20Mb PacBio-derived genome. Contigs/chromosomes have been concatenated into one long sequence. 

    Are there any considerations that might help me with a succesful, shorter, run?

    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk