Error in GenotypeGVCF: Cannot read tile from file
REQUIRED for all errors and issues:
a) GATK version used:
4.3.0.0
b) Exact command used:
set=CB
ref=V_macrocarpon_Stevens_v1
gatk --java-options "-Xmx12g" GenotypeGVCFs \
-R ${ref}.fasta \
-V gendb://CB \
-O ${set}.vcf.gz
c) Entire program log:
I skipped through most of the traversal but I got he following error at the end. I've never had this error before and I can't anything out about it. Is it just a memory issue? I've tried with "-Xmx4g" and "-Xmx12g" and it stops at the same place
Using GATK jar /home/barlex/miniconda3/envs/snp/share/gatk4-4.3.0.0-0/gatk-package-4.3.0.0-local.jar
Running:
java -Dsamjdk.use_async_io_read_samtools=false -Dsamjdk.use_async_io_write_samtools=true -Dsamjdk.use_async_io_write_tribble=false -Dsamjdk.compression_level=2 -Xmx12g -jar /home/barlex/miniconda3/envs/snp/share/gatk4-4.3.0.0-0/gatk-package-4.3.0.0-local.jar GenotypeGVCFs -R V_macrocarpon_Stevens_v1.fasta -V gendb://CB -O CB.vcf.gz
13:41:10.771 INFO NativeLibraryLoader - Loading libgkl_compression.so from jar:file:/home/barlex/miniconda3/envs/snp/share/gatk4-4.3.0.0-0/gatk-package-4.3.0.0-local.jar!/com/intel/gkl/native/libgkl_compression.so
13:41:10.991 INFO GenotypeGVCFs - ------------------------------------------------------------
13:41:10.992 INFO GenotypeGVCFs - The Genome Analysis Toolkit (GATK) v4.3.0.0
13:41:10.992 INFO GenotypeGVCFs - For support and documentation go to https://software.broadinstitute.org/gatk/
13:41:10.992 INFO GenotypeGVCFs - Executing as barlex@DESKTOP-VDKQE0G on Linux v4.4.0-19041-Microsoft amd64
13:41:10.992 INFO GenotypeGVCFs - Java runtime: OpenJDK 64-Bit Server VM v1.8.0_382-b05
13:41:10.992 INFO GenotypeGVCFs - Start Date/Time: March 1, 2024 1:41:10 PM PST
13:41:10.992 INFO GenotypeGVCFs - ------------------------------------------------------------
13:41:10.992 INFO GenotypeGVCFs - ------------------------------------------------------------
13:41:10.993 INFO GenotypeGVCFs - HTSJDK Version: 3.0.1
13:41:10.993 INFO GenotypeGVCFs - Picard Version: 2.27.5
13:41:10.993 INFO GenotypeGVCFs - Built for Spark Version: 2.4.5
13:41:10.993 INFO GenotypeGVCFs - HTSJDK Defaults.COMPRESSION_LEVEL : 2
13:41:10.993 INFO GenotypeGVCFs - HTSJDK Defaults.USE_ASYNC_IO_READ_FOR_SAMTOOLS : false
13:41:10.994 INFO GenotypeGVCFs - HTSJDK Defaults.USE_ASYNC_IO_WRITE_FOR_SAMTOOLS : true
13:41:10.994 INFO GenotypeGVCFs - HTSJDK Defaults.USE_ASYNC_IO_WRITE_FOR_TRIBBLE : false
13:41:10.994 INFO GenotypeGVCFs - Deflater: IntelDeflater
13:41:10.994 INFO GenotypeGVCFs - Inflater: IntelInflater
13:41:10.994 INFO GenotypeGVCFs - GCS max retries/reopens: 20
13:41:10.994 INFO GenotypeGVCFs - Requester pays: disabled
13:41:10.994 INFO GenotypeGVCFs - Initializing engine
13:41:17.347 INFO GenomicsDBLibLoader - GenomicsDB native library version : 1.4.3-6069e4a
13:41:32.188 info NativeGenomicsDB - pid=30804 tid=30805 No valid combination operation found for INFO field InbreedingCoeff - the field will NOT be part of INFO fields in the generated VCF records
13:41:32.188 info NativeGenomicsDB - pid=30804 tid=30805 No valid combination operation found for INFO field MLEAC - the field will NOT be part of INFO fields in the generated VCF records
13:41:32.188 info NativeGenomicsDB - pid=30804 tid=30805 No valid combination operation found for INFO field MLEAF - the field will NOT be part of INFO fields in the generated VCF records
13:41:32.430 INFO GenotypeGVCFs - Done initializing engine
13:41:32.508 INFO ProgressMeter - Starting traversal
13:41:32.511 INFO ProgressMeter - Current Locus Elapsed Minutes Variants Processed Variants/Minute
13:41:55.132 INFO ProgressMeter - chr1:88507 0.4 1000 2652.4
15:27:37.716 INFO ProgressMeter - chr5:6821992 106.1 3392000 31973.8
[TileDB::ReadState] Error: Cannot read tile from file; Memory map error.
terminate called after throwing an instance of 'VariantStorageManagerException'
what(): VariantStorageManagerException exception : VariantArrayCellIterator increment failed
TileDB error message : [TileDB::ReadState] Error: Cannot read tile from file; Memory map error
-
Hi Shaun Clare
From your logs it looks like you are using WSL version 1 as your environment. WSL environment is not a perfect match for linux binaries and memory allocation errors may occur such as this one. We recommend using a native environment through docker or you may at least try converting your WSL environment to version 2 which is a native linux VM inside Hyper-V and sandboxes all linux binaries therefore no memory allocation issues occur.
I hope this helps.
-
Hey,
It took me a while to upgrade to WSL2 but it looks like it finished GenotypeGVCF this time. Thank you!!
Please sign in to leave a comment.
2 comments