Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Missing documentation on strict_strand

Answered
0

4 comments

  • Avatar
    Genevieve Brandt (she/her)

    Hi GE,

    Regarding the VCF header, we do not change the header based on whether or not a certain filter was applied with a tool. I can see how that may be confusing, but that is how GATK works currently.

    For the strict_strand filter, it is turned on when the argument --min-reads-per-strand is greater than 0. The integer sets the minimum number of reads required on both strands to support the alternate allele.

    This strict_strand filter is a hard filter and is removing variants when they are only supported by reads in one direction. You can lose a lot of sensitivity with this filter, especially if you do not have high coverage. The reason why your variant is filtered by weak_evidence with the strict_strand filter turned on is because you are losing sensitivity. Mutect's internal model for the tumor being active loses evidence, which changes the weak_evidence filter.

    The strict_strand filter is an advanced filter and we do not recommend it for general users.

    Hope this helps you understand the output,

    Genevieve 

    0
    Comment actions Permalink
  • Avatar
    GE

    Thanks. That makes sense. The only part I think should be evaluated in future GATK versions is: "Regarding the VCF header, we do not change the header based on whether or not a certain filter was applied with a tool. "

    This is because I think that situations of 'empty data' (i.e. the filter was not applied and that is why the variant doesn't have the filter) should be distinguished from 'data=0' situations (filter was applied but not found for the variant).

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Yes, that is a confusing situation, but it is not what headers are meant to be used for. I discussed the header situation with the GATK team today and they noted that they are not planning to change this behavior. If other users chime in and also would like us to change this, however, we can continue the conversation.

    0
    Comment actions Permalink
  • Avatar
    GE

    Sure. Note that if others think this is useful, a compromise might be to add to the tag "NOT-USED", so that the description of the tag is still present, but the user also can see that it was not applied.

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk