Genome Analysis Toolkit

simon lee · December 14, 2021 04:06

If you are seeing an error, please provide(REQUIRED) :
a) GATK version used: 4.1.4.1
b) Exact command used:

 gatk CollectReadCounts \
--minimum-mapping-quality $MQmin \
-L $binnedIntervalsPadded \
-R $refGen -imr OVERLAPPING_ONLY \
-I ${bamList[${i}]} --format TSV \
-O $tsvFolderFinalPath/${tsvList[${i}]}

performed on targeted wgs data. Padding 0bp, intervals 300bp

The gatk documentation says that getreadcounts uses a default 10MQ cutoff:

However. I have extensively tested the filters on this step because the counts I am getting do not match the IGV visualization.

note: I know counts are defined as the number of start read sites in a window.

I have turned off every other filter except quality and I am getting a count of 1 in a window identified as a double deletion by gCNV pipeline. However, the count is actually 45, by manually counting read start sites in the given interval in IGV with a quality filter of 10.

I finally identified the quality filter as the cause and set to experiment with thresholds:

MQmin=0: count = 46

MQmin=5: count = 45

MQmin=10: count = 45

MQmin=30: count = 1

MQmin=36: count = 0

I feel silly now for wasting so much time trusting that the defaults were what they said they were, causing me to look everywhere but the quality filter for the cause of my discrepancy.

So firstly: There is an error here: either an error in the tutorial documentation saying the default is 10 when it is actually 30, or an error in the code's defaults, setting to 30 when it should be 10.

and Secondly (if the 30 default is correct): Why is the default GATK mapping quality filter so high for the purposes of cnv analysis, and should I be using a different threshold when identifying double deletions. The current default seems absurd but I would like some feedback from long time users of the tool.

best regards,

Simon

Genome Analysis Toolkit

Need Help?

Community Forum

CollectReadCounts tutorial incorrectly states default read quality threshold

1 comment

Welcome

Didn't find what you were looking for?

Quick Links

Recent GATK News

About the GATK community