If you are seeing an error, please provide(REQUIRED) :
a) GATK version used:
b) Exact command used:
c) Entire error log:
If not an error, choose a category for your question(REQUIRED):
a)How do I (......)?
b) What does (......) mean?
c) Why do I see (......)?
d) Where do I find (......)?
e) Will (......) be in future releases?
How do I (know the maximum number of samples to use in a PoN)?
We'd like to use the latest 1000 genomes data in a PoN, since it's been prepared with the same method we are going to use (NovaSeq 6000, TruSeq PCR free). This dataset has over 2000 samples we could use, but is it a good idea to use this many? Are there diminishing returns to more samples above a certain point, or can it even be detrimental to use too many?
Please sign in to leave a comment.