Remove Duplicate Samples from GenomeDB
I may be missing something, how does one perform "GenotypeGVCFs" on only a subset of individuals in a genomeDB? It can't be the case that I have to recreate the entire genomedb if I want to remove individuals from it before joint genotyping? I have recently discovered I have duplicates imported to my genomeDB and want to remove them. I have searched multiple times for an answer to this in the forum and tool documentation both for GenotypeGVCFs and GenomeDBImport (and everything else). However, I wouldn't put it past me to have missed something and want to check before I re-import all of my samples to a new database.
-
You have got it correct. AFAIK genomicsDB only allows samples to be updated, but doesn't support removal / replacement of samples.
A new genomicsDB would have to be created. Do check out the ReblockGVCF tool, it will make creating new dbs faster and saves on storage as well.
Please sign in to leave a comment.
1 comment