Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

performance of indel calling by MuTect2

Answered
1

5 comments

  • Avatar
    Mark Fleharty

    The sensitivity and false positive rate you get with Mutect2 will depend on a lot of variables.

    How deep is your sequencing, are you doing WGS or a panel?  Are your samples FFPE?  Are you using a tumor/normal pair, what is the allele fraction of your variants, etc. 

    Indels are more difficult than SNVs, so it is not unusual to have poorer performance for indels.

     

    0
    Comment actions Permalink
  • Avatar
    huan wang

    Our coverage for tumor normal is around 200X / 120X on WES data, using FFPE tumor with matching blood. AFs of false positive indels vary with in the range of 0.02 and 0.4 I believe. I would say over 50% of the indels which I manually reviewed in IGV turned out to be false positives due to poor mapping quality or other issues seen in IGV

    0
    Comment actions Permalink
  • Avatar
    Mark Fleharty

    Having 50% of indels be false positives is definitely a problem.  FFPE can cause problems, but typically don't cause mapping artifacts.

    I would expect that mapping artifacts would be filtered well due to your normals.

    So I'm honestly at a bit of a loss as to why you are having so many false positives.  Perhaps I could take a brief look at your data?

    0
    Comment actions Permalink
  • Avatar
    huan wang

    These data were generated using tumor/normal pairs but without the use of panel or normal. Perhaps that could be a cause? I'm happy to share one T/N pair with you. What would you recommend the best way of data sharing?

    0
    Comment actions Permalink
  • Avatar
    Mark Fleharty

    If your data is on Google Cloud, that is probably easiest.  Just give me permissions to access, and let me know the gs:// URL.

    Otherwise, dropbox, and other sharing services work.

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk