When checking the UMI_METRICS output file from `UmiAwareMarkDuplicatesWithMateCigar`, I found that DUPLICATE_SETS_WITH_UMI > DUPLICATE_SETS_IGNORING_UMI.
What it means? I expected that when using UMIs, the number of real duplicates is lower, since it is able to distinguish between PCR duplicates and different molecules.
|Number of duplicate sets found before taking UMIs into account
|Number of duplicate sets found after taking UMIs into account
I found this explanation but I don't understand it completely.
Please sign in to leave a comment.