Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Mutect2 filter : clustered_events definition

Answered
0

7 comments

  • Avatar
    David Benjamin

    "Clustered" mean co-occurring in the same local assembly interval, usually about 200bp.  By the way, you should upgrade to the most recent release of GATK 4.  The GATK 3 version of Mutect2 is obsolete.

    2
    Comment actions Permalink
  • Avatar
    whynot

    Thanks for answer!

     

    Can I have one more question?

     

    I got new information about clustered_events filter.

    When they count 'clustered' variants, they have another condition:

    "variants with distance between >= 3 bp"

    The reason I thought this condition was true is that if the distance between variants in my vcf file is less than 3 bp, I did not get caught by the clustered_events filter.

    Is that really right condition?

    If that's right, then "clustered" mean co-occurring in the same local assembly interval, about 3-200bp?

     

    0
    Comment actions Permalink
  • Avatar
    David Benjamin

    I think the intent was to avoid filtering MNPs.  We do it differently in GATK 4.  I can't emphasize enough how important it is to upgrade to the most recent GATK 4 version.

    0
    Comment actions Permalink
  • Avatar
    Dario

    Is there a table of filters containing detailed definitions of all of the possible ones in a document somewhere? The VCF header ones are too short for readers who weren't involved with the development of the software.

    0
    Comment actions Permalink
  • Avatar
    David Benjamin

    The most current Mutect2 documentation can be found in our github repo here: https://github.com/broadinstitute/gatk/blob/master/docs/mutect/mutect.pdf

    0
    Comment actions Permalink
  • Avatar
    Дмитрий Голик

    David Benjamin unfortunately there is not description for every possible filtering outcome.

    And how i should handle MNPs in GATK 4?

    0
    Comment actions Permalink
  • Avatar
    David Benjamin

    Привет, Дмитрий, есть конкретная ситуация, которая тебя интересует?

    Что касается MNPs, используйте опцию `--max-mnp-distance`.  Когда это Ноль, GATK считает их независимыми SNPs.

    1
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk