Median distance of variant starts from ends of reads supporting each allele (MPOS)
Category Variant Annotations
Overview
Median distance of variant starts from ends of reads supporting each alt allele. The output is an array containing, for each alt allele, the median distance of the variant start from the closest read end over all reads that best match that allele. For example, for variant context with ref allele A and alt allele C the read position for alt-supporting read GGGGCTT is 2 because the A to C substitution is 2 bases from the right end of the read, which is less than its distance from the left end. For variant context with ref allele AG and alt allele A (deletion) the read position of alt-supporting read ATTTTT is 0. For variant context with ref allele A and alt allele AG (insertion) the read position of alt-supporting read TTTTAG is 1.The annotation considers only the read's bases themselves and not the position they map to with respect to the reference. For example, suppose a substitution is preceded by 80 matching bases and followed by 10 matching bases, a 10-base deletion, and 10 more matching bases. Its distance from the end of the read is 20 bases, not 30 bases, because the deleted bases belong to the reference, not the read. Similarly soft-clipped bases are counted in the distance.
This annotation is useful for filtering alignment artifacts.
GATK version 4.2.2.0-SNAPSHOT built at Thu, 19 Aug 2021 09:49:28 -0700.
0 comments
Please sign in to leave a comment.