Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

GATK HaplotypeCaller: only first chromosome in output vcf

0

4 comments

  • Avatar
    Lily Peck

    Update >>

    I am running this as a while loop, looping through each chromosome / unplaced contig. However I would still like to run it once, on every chromosome and contig in the assembly and bam producing one vcf. Is this possible? 

    Thanks

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi Lily Peck

    It is possible that the compute environment you are working has limitations on how long each task should last therefore your jobs do not reach a completion and therefore prematurely terminated by the system. 

    You may call each chromosome separately into a different file and later use GatherVcfs tool to combine them into a single call file. 

    I hope this helps.

    Regards. 

    0
    Comment actions Permalink
  • Avatar
    Lily Peck

    Hello

    Thank you for getting back to me. The problem is not caused by the computing environment, the job is finishing (not terminating unexpectedly) and it only takes ~ 20 mins. 

    Is there a way to run all chromosomes at once? 

    Thanks

    Lily

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi again

    Is this part of the log that your command produces?

    Job 5237262 ended on:    n1891
    Job 5237262 ended on:    Wed Oct 16 16:48:22 PDT 2024

    From the looks of it your command did not even last more than 3 minutes in time. 

    This may still be due to compute environment limitations that are not evident to you directly. We strongly recommend you to consult your IT specialists to help you extend the time you need to run GATK. HaplotypeCaller does not have any such limitations to end prematurely. 

    I hope this helps.

    Regards. 

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk