Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Best Practices for Tumor-Only WES Data Analysis: Seeking Feedback on Variant Calling Pipeline

0

5 comments

  • Avatar
    Gökalp Çelik

    Hi georgepou

    You seem to be doing all that is possible with tumor-only calling approach yet we do not usually recommend doing it. If you don't have matched normals for your tumor samples there is really nothing much that you can do except that you can lower

    --initial-tumor-lod 

    parameter to retain ultra low frequency calls which could potentially be important depending on the sample type you are working on. 

    For the CNV approach we have our own Somatic CNV calling workflow however it also requires "Panel of Normals" for proper CNV segmentation. We do not have provisions for any other algorithms therefore with cnvkit your mileage may vary. 

    I hope this helps.

    Regards. 

    1
    Comment actions Permalink
  • Avatar
    georgepou

    I'll give --initial-tumor-lod a try.

    Thank you so much for your feedback; it’s always helpful to get confirmation, especially when working with uncommon workflows.

    0
    Comment actions Permalink
  • Avatar
    Abhishek Ajay

    Hi georgepou

    I have a similar setup of tumor-only wes samples that I am trying to run somatic calling on. However, I didn't see a mention on interval files in your post. Did you use a target exome interval files for your samples? If so, were they acquired through the sequencing company or GATK resource bundle? 

    I have found the following file but not sure if this is the correct file to be used for exome sequences.

    File: genomics-public-data/resources/broad/hg38/v0/wgs_calling_regions.hg38.interval_list

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi Abhishek Ajay

    We recommend using the interval files from the original manufacturer for your whole exome based work. 

    Regards. 

    0
    Comment actions Permalink
  • Avatar
    Abhishek Ajay

    Thank you for the response, Gökalp Çelik

     

     

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk