Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Assistance Required with DetermineGermlineContigPloidy in GATK 4.6

0

5 comments

  • Avatar
    Gökalp Çelik

    Hi Shai Casif

    Before any hyperparameter adjustments have you performed any QC on your samples to see if you have any outliers in terms of coverage, evenness of coverage, AT/GC dropout etc? Also are you using human samples or another organism? Finally are you working on only primary contigs or do you have chrM, and other decoy/unplaced contigs in your analysis?

    Let us know about these items and we will be able to help you better. 

    Regards. 

    0
    Comment actions Permalink
  • Avatar
    Shai Casif

    Gökalp Çelik Thank you for your reply.

    1. I followed the pipline and performed QC using GATK’s CollectReadCounts tool to assess coverage across my samples, and AnnotateIntervals for explicit GC correction. Could you recommend the best way to check for these specific issues using GATK or other tools?

    2. I am working with human samples

    3. The ploidy_priors.tsv file I am using for step 3 includes only chr1 through chr22, as well as chrX and chrY. I came across a post suggesting that any other contigs will be skipped by the tool. Could you confirm if this is the expected behavior or if additional steps are required to handle other contigs?

    Any additional guidance or recommendations you can provide would be greatly appreciated.
    Shai

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi again. 

    Depending on the type of study you have we recommend using CollectWgsMetrics or CollectHsMetrics tools to check mean and median depth as well as evenness of coverage so that you can eliminate outliers from your analysis. 

    Also, can you share your ploidy priors file with us?

    0
    Comment actions Permalink
  • Avatar
    Shai Casif

    Hello Gökalp Çelik and thanks again for the response. 

    This is my ploidy priors file:

    CONTIG_NAME    PLOIDY_PRIOR_0    PLOIDY_PRIOR_1    PLOIDY_PRIOR_2    PLOIDY_PRIOR_3
    chr1    0    0.01    0.98    0.01
    chr2    0    0.01    0.98    0.01
    chr3    0    0.01    0.98    0.01
    chr4    0    0.01    0.98    0.01
    chr5    0    0.01    0.98    0.01
    chr6    0    0.01    0.98    0.01
    chr7    0    0.01    0.98    0.01
    chr8    0    0.01    0.98    0.01
    chr9    0    0.01    0.98    0.01
    chr10    0    0.01    0.98    0.01
    chr11    0    0.01    0.98    0.01
    chr12    0    0.01    0.98    0.01
    chr13    0    0.01    0.98    0.01
    chr14    0    0.01    0.98    0.01
    chr15    0    0.01    0.98    0.01
    chr16    0    0.01    0.98    0.01
    chr17    0    0.01    0.98    0.01
    chr18    0    0.01    0.98    0.01
    chr19    0    0.01    0.98    0.01
    chr20    0    0.01    0.98    0.01
    chr21    0    0.01    0.98    0.01
    chr22    0    0.01    0.98    0.01
    chrX    0.01    0.49    0.49    0.01
    chrY    0.495    0.495    0.01    0

     

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi Shai Casif

    Your ploidy priors file seems fine. Depending on how your samples are represented for the baseline these values can be adjusted. 

    About the other part of my question, Have you performed additional QC to see if any of your samples are outliers compared to the rest?

    Regards. 

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk