Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

picard CompareSAMs

Answered
0

3 comments

  • Avatar
    Genevieve Brandt (she/her)

    Thank you for your post, Matt Snyder! I want to let you know we have received your question and will be moving it to the Community Discussions -> General Discussion topic, as the Other topic is for reporting bugs and issues with GATK.

    We'll get back to you if we have any updates or follow up questions. Please see our Support Policy for more details about how we prioritize responding to questions. 

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Hi Matt Snyder

    I'm wondering if you have tried or are aware of the --LENIENT_HEADER option in CompareSAMs? This option allows for the headers to be different in certain fields.

    Let me know if that helps with your comparisons!

    Best,

    Genevieve

    0
    Comment actions Permalink
  • Avatar
    Matt Snyder

    Hi Genevieve Brandt (she/her),

    Sorry, I know this issue is ancient. I did use `--LENIENT_HEADER true` and just re-tried with Version:2.27.4-SNAPSHOT and got the same result. MAPPINGS_DIFFER, UNMAPPED_LEFT, UNMAPPED_RIGHT, MISSING_LEFT, MISSING_RIGHT, and DUPLICATE_MARKINGS_DIFFER are all zero, but ARE_EQUAL is "N". When I use the --LENIENT_HEADER argument, I no longer see "Number of program records differs" in the output log, but I still see "Read Group ID differs". Since the read group is present in the header as well as the body of the BAM, I wonder if this is causing the issue... So maybe this is really a feature request for a --LENIENT_READ_GROUP argument? Again, as I stated before, when we compare SAM/BAM files, I think we are most often concerned with the difference in the mappings, which is reflected in the output table, and often not as interested in read groups.

    Thanks!

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk