GATK 4.2.6.0 and GATK 4.2.6.1 represent changes to GATK corresponding to the period from February 4, 2022 - April 13, 2021. While we always recommend using the newest version of GATK, in this particular case, it is critical that you upgrade.
There were some especially critical bugs that were discovered in GATK 4.2.5.0 that were discovered, and these have been addressed in the newest version.
Some days you're the pigeon, other days you find a bug in the code that makes everyone else the statue.
Spanner in the machine
When we released GATK 4.2.5.0, some changes were made to GenotypeGVCFs
and GenomicsDB
that resulted in NullPointerException
in some cases with many alternate alleles. Additionally, a feature of the QUAL calculation was disabled that would lead to false positives and low quality alleles in some multi-allelic sites.
As a result, for any workspaces or pipelines that have used GenotypeGVCFs
under GATK 4.2.5.0 (such as the WARP Joint Genotyping pipeline) we strongly recommend that you re-run the analyses under GATK 4.2.6.1.
It may be possible to filter out multiallelic alleles if your analysis has already been completed, but the safest best would definitely be to re-run the analysis, to ensure that the data was not affected by these bugs at all.
In addition, we fixed the "Bucket is a Requester Pays bucket but no user project provided" error that would occur when accessing Requester Pays buckets in Google Cloud Storage, even though the --gcs-project-for-requester-pays
argument was being specified. If you experienced this error in the past, it should be resolved now, however if you continue to have difficulties accessing Requester Pays Google Cloud Storage buckets, please let us know by filing an issue on GitHub.
What else is new in GATK 4.2.6.1
In additional to bug fixes, a slew of new tools and features have been added to GATK. The full release notes are available on the GATK GitHub, but here is a rundown of our main highlights in this version.
-
New RNA tools:
PostProcessReadsForRSEM
is for re-ordering and filtering reads before running RSEM (for estimating gene and isoform expression levels from RNA-seq data). Also, theTransferReadTags
tool can transfer the read tags from an unaligned BAM to its matching aligned BAM counterpart. These are tags would normally get lost when converting a SAM file to FASTQ and then back to SAM (ie. when clipping adapter bases before alignment. -
Two new tools for the Structural Variation calling pipeline: The
SVAnnotate
tool adds functional annotations to SVs called by theGATK-SV
pipeline, while the newPrintSVEvidence
tool can merge evidence from a cohort in theGATK-SV
pipeline. -
Fixes to
HaplotypeCaller
andMutect2
where force-calling alleles were lost upon trimming, as well as changes to support future Mutect2 releases. - Changes to Funcotator: added the ability to customize the severity ratings of
VariantClassifications
using the new--custom-variant-classification-order
argument. - Fixes to convergence issues in
VariantRecalibrator
in order to make the tool more robust with highly correlated annotations. - Changes to the GATK Engine: new
MultiFeatureWalker
traversal is available. Picard
dependencies have been updated to version2.27.1
. Picard documentation migration into GATK technical documentation is being finalized.
0 comments
Please sign in to leave a comment.