Question GetPileupSummaries, columns all 0
If you are seeing an error, please provide(REQUIRED) :
a) GATK version used: 4.1.9.0
b) Exact command used:
gatk GetPileupSummaries -I ../SRR10022835.md.bam -V $breast/chr17.af-only-gnomad.raw.sites.hg19.vcf.gz -L $breast/chr17.af-only-gnomad.raw.sites.hg19.vcf.gz -O SRR10022835.test.pileups.table
c) Entire error log:
NA
If not an error, choose a category for your question(REQUIRED):
c) Why do I see (......)?
Why when I run GetPileupSummaries on bam file that has duplicates marked with gnomadVCF are all columns ref_count, alt_count, other_alt_count zeros for all rows even when there is an alignment at the position? This is WXS. I also aligned reads with bowtie2 will defaults.
-
Hi Brian Wiley, could you give some more information about this issue? What is the expected output, could you also include the stack trace and an example of the incorrect output?
https://gatk.broadinstitute.org/hc/en-us/articles/360053424571-How-to-Write-a-Post
-
Hi Genevieve Brandt (she/her),
Here is an example of an output when I ran for above command:
BAM:
671723 147 chr17 1352 42 76M = 1272 -156 AAAAAAGTTTGGGGGGATTCCCCTAAGCCCGCCACCCGGAGACAGCGGATTTCCTTAGTTACTTACTATGCTCCTT
JJJJJJJJJJJJJJJJJJJJFJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJJFFFAA MD:Z:76 PG:Z:MarkDuplicates RG:Z:Sample.1 XG:i:0 NM:i:0 XM
:i:0 XN:i:0 XO:i:0 AS:i:0 YS:i:0 YT:Z:CP
13179887 1171 chr17 1352 42 76M = 1272 -156 AAAAAAGTTTGGGGGGATTCCCCTAAGCCCGCCACCCGGAGACAGCGGATTTCCTTAGTTACTTACTATG
CTCCTN <F-JJAFFJJJJJFA-FA7FJFJA-7AJJJJJJJJJFJJJJJJJJJAFJAAFJJJJJJJJJJJAJJFF7FJFFAA# MD:Z:75T0 PG:Z:MarkDuplicates RG:Z:Sample.1 XG:i
:0 NM:i:1 XM:i:1 XN:i:0 XO:i:0 AS:i:-1 YS:i:-1 YT:Z:CPThese reads start at 1352 and are 76 bps long. Should there be values of 2 in the pileups under ref_count since CIGARS are 76M and both overlap 1389 and 1397?
Pileup (why are columns 3-5 zero?)
contig position ref_count alt_count other_alt_count allele_frequency
chr17 1389 0 0 0 0.074
chr17 1397 0 0 0 0.04Maybe I am not sure exactly how GetPileupSummaries works, but I assumed it was the summary of ref match and alt counts that samtools mpileups does?
-
Thanks for that information, it is helpful in figuring this out.
Could you check your Bam file for any issues with ValidateSamFile? Here is a tutorial on how to troubleshoot the output.
Thanks,
Genevieve
-
Thanks. Just ran it and no errors found. I'll see what happens if I run on other datasets I guess later.
-
Ok, that will be helpful. Let me know what you find and we can keep troubleshooting.
-
Hi Brian Wiley,
I just wanted to follow up with some extra feedback about what could be causing this issue. This tool is a very straightforward tool with no known bugs, so most likely this is caused by no coverage at the regions you are looking at. If you have input reads that you are sure are in the range of your intervals, they may also be getting filtered out. There are some default read filters with this tool. You can check out the stack trace for more information about the read filters applied.
Hope this helps.
Best,
Genevieve
-
Hi Brian-
Did you ever resolve this issue? I'm running in to the same problem, and I have verified that there is coverage and that no reads are filtered.
Post is closed for comments.
7 comments