Rank sum test of REF versus ALT base quality scores (BaseQRankSum)
Category Variant Annotations
Overview
Rank Sum Test of REF versus ALT base quality scoresThis variant-level annotation tests compares the base qualities of the data supporting the reference allele with those supporting the alternate allele. The ideal result is a value close to zero, which indicates there is little to no difference. A negative value indicates that the bases supporting the alternate allele have lower quality scores than those supporting the reference allele. Conversely, a positive value indicates that the bases supporting the alternate allele have higher quality scores than those supporting the reference allele. Finding a statistically significant difference either way suggests that the sequencing process may have been biased or affected by an artifact.
Statistical notes
The value output for this annotation is the u-based z-approximation from the Mann-Whitney-Wilcoxon Rank Sum Test for base qualities (bases supporting REF vs. bases supporting ALT). See the method document on statistical tests for a more detailed explanation of the ranksum test.
Caveat
The base quality rank sum test can not be calculated for sites without a mixture of reads showing both the reference and alternate alleles.
GATK version 4.2.0.0-SNAPSHOT built at Mon, 22 Feb 2021 13:44:49 -0800.
2 comments
We are developing a pipeline and we think that we can detect False Positive (FP) variations using this approach. We found VariantAnnotator and this parameter very useful but we don't understand the results of this BaseQualityRankSum test in some scenarios.
We have a training dataset so we know what variants are FP had what are True Positive (TP). Considering the following TP
Why do these TP variants get a very low BaseQRankSum??? I show in the table the average base quality of the reads supporting REF and the reads supporting ALT with the SD. The VAF is around 50% and the Depth is always >1000 reads so it should be enough reads in both groups of reads to compute a better Z score.
I have plot all my data to see the VAF against the BaseQRankSum
I am looking for an approach to reduce FP without misclassifying TP. With the results I have in the previous plot, this task is not possible.
However, if I plot VAF against the average quality base of reads supporting REF minus the average quality base of reads supporting ALT, we can see a better distribution of FP and TP
Z=-40
The two-tailed P value is less than 0.0001
By conventional criteria, this difference is considered to be extremely statistically significant. So, Why is telling me that the TP variants in the table are extremely statistically FP?
The code I used is
Interestingly the MQRankSum uses the same statistical test that the BaseQualityRankSumTest, and this statistical analysis seem to work fine
Are both Z-score calculated in the same way??
Please sign in to leave a comment.