Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

GenotypeGVCF: How can I know which sample falied in joint genotyping?

Answered
0

5 comments

  • Avatar
    Genevieve Brandt (she/her)

    Hi Vinod Kumar,

    You should get some warning or error if certain chromosomes fail the genotyping step. You can check the GenomicsDB to see if those chromosomes are present. If not, search for the error in the stack trace output of GenotypeGVCFs.

    Best,

    Genevieve

    0
    Comment actions Permalink
  • Avatar
    Vinod Kumar

    Hi Genevieve Brandt (she/her),

    Thanks for the reply. Actually I am talking about that few samples failed for a particular chromosome while for rest of the chromosomes all these samples are okay. When chromosome fails or give some errors then it is okay but when only few samples don't produce any results only for a chromosome, what to do in that case? Can I find this information somewhere in DB database or genotypeGVCF results?

    Thanks,

    Vinod,

     

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Yes, this information should be available in the stack trace output. If you can find more information about why this happened (from an error or warning) then we can make sure it doesn't happen next time.

    0
    Comment actions Permalink
  • Avatar
    Vinod Kumar

    Hi Genevieve Brandt (she/her),

    I couldn't find an issue why only 16 out of 973 samples failed in one particular chromosome. Just one weird thing is that files from temp directory for this particular chromosome have not been deleted during genomicsDBimport. However, everything look okay in the error file. Just updating again the genomicsDB with these 16 sample by first deleting them from callset.json file. Will see if it will solve the problem.

    In genotypeGVCF, I can see that samples are not there in stack trace output but I have no idea if these samples have been successfully imported in genomicsDB or not. Just updating the old DB and will see the results.

    Thanks,

     

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Hi Vinod Kumar,

    Sounds good, please let me know if it solves the problem. In the meantime, please share the stack trace from GenomicsDBImport from one of the samples that failed if you want me to take a look.

    Best,

    Genevieve

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk