I have produced a VCF from the best practice pipe line ( bams to gvcf's, genomicDBImport, GenotypeGVCFs, GatherVcfs). I'm using the version gatk-22.214.171.124.
I was wondering if I am using the command --exclude-non-variants correctly, my command is the following:
gatk SelectVariants \
-R /home/genomic_refseq.fna \
-V /home/WGS_RAW.vcf \
--select-type-to-include SNP \
--exclude-non-variants true \
I have produced files with and without the command and I get the same number of SNPs and with the command the file still contains non variant / monomorphic sites in the resulting VCF file.
Am I doing this correctly? Or do I need to perform this command separately? I would like to attain SNPs only at variant sites.
Please sign in to leave a comment.