GRCh38_verily vs GRCh38
Hi,
Which reference should I use for my analyses?
The reference "GCA_000001405.15_GRCh38_no_alt_plus_hs38d1_analysis_set.fna.gz" (GRCh38_verily) from https://cloud.google.com/life-sciences/docs/resources/public-datasets/reference-genomes seems to be reasonable but all pipelines from Terra/GATK refers to the "original" GRCh38.
Will it be a big "faux pa" if I use "GRCh38_verily" instead of the "original" one? Will I have problems in downstream analyses?
-
Hi Damian Loska
No, you will not have any problems. The Verily's build is described here (https://cloud.google.com/life-sciences/docs/resources/public-datasets/reference-genomes) with all the differences explicitly stated. Note that it has chrN-like contig naming scheme, thus you have to maintain it for every external resource VCF you intend to use in your pipeline.
Please sign in to leave a comment.
1 comment