Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

discrepency between CollectReadCounts and CollectWgsMetrics

0

2 comments

  • Avatar
    Gökalp Çelik

    Hi Le Qi

    CollectReadCounts only counts the number of reads passing its filters that have their start sites within the interval so it is not directly comparable to mean or median coverages. 

    I hope this helps. 

    0
    Comment actions Permalink
  • Avatar
    Le Qi

    Hi Gökalp Çelik

    Thanks for the reply. I understand this is how CollectReadCounts works, so it's intended behavior then. The only reason I called it coverage was that it's the first step to generate medianCov.transposed.bed file in the EvidenceQC module. This is in turn used by several later modules. I'm having trouble figuring out if it's only the ratio or the absolute number of coverages across samples in this file that matters. 

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk