Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

INVALID_CIGAR error after ClipOverlap

0

3 comments

  • Avatar
    Gökalp Çelik

    Hi Hanan Altbach

    We cannot give any recommendations on BamUtil however let me ask you a few questions to track this issue.

    1- Do you observe this kind of error messages before BamUtil step?

    2- Is there a particular reason why you would like to Clip overlapping reads? Many of our tools can handle read overlaps properly and there is no need to clip those reads. 

    Regards 

    0
    Comment actions Permalink
  • Avatar
    Hanan Altbach

    Hi Gökalp Çelik. Thanks for your response!

    To answer the questions:

    1- The CIGAR issue is not observed before the BamUtil step, but the "ERROR: MATE_NOT_FOUND 421257" is. Running AddOrReplaceGroups resolved read group errors and warnings. 

    2- ClipOverlap was part of the tutorial I used to guide my pipeline (https://github.com/nt246/lcwgs-guide-tutorial). To be honest, skipping this step would have saved me a headache.

    Much appreciated

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi again.

    Looks like you can skip that step since many contemporary variant callers have proper algorithms to handle overlapping basepairs from paired reads and these methods are usually on by default i.e. HaplotypeCaller.

    Regards. 

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk