AssertionError: Some samples do not have read depth metadata
I am unable to figure out how to solve for this error. Can anyone please help. Thanks.
REQUIRED for all errors and issues:
a) GATK version used:
The Genome Analysis Toolkit (GATK) v4.6.1.0
HTSJDK Version: 4.1.3
Picard Version: 3.3.0
b) Exact command used:
gatk GermlineCNVCaller --run-mode COHORT --contig-ploidy-calls ${wdir}/hdf5/PLOIDY_COHORT-calls/ \
--output ${wdir}/hdf5/ --output-prefix PLOIDY_COHORT_RUN \
-L /rsrch5/home/rsrch/name/GATK4/Homo_sapiens/GRCh38.filtered.intervals \
--interval-merging-rule OVERLAPPING_ONLY \
--annotated-intervals /rsrch5/home/rsrch/name/GATK4/Homo_sapiens/GRCh38.preprocessed.interval_list.anno.tsv \
${SAMPLE_INPUT} --verbosity DEBUG
c) Entire program log:
Traceback (most recent call last):
File "/tmp/cohort_denoising_calling.15835403171985346891.py", line 146, in <module>
shared_workspace = gcnvkernel.DenoisingCallingWorkspace(
File "/rsrch5/home/rsrch/name/miniforge3/envs/gatk4.6.1.0/lib/python3.10/site-packages/gcnvkernel-0.9-py3.10.egg/gcnvkernel/models/model_denoising_calling.py", line 389, in __init__
File "/rsrch5/home/rsrch/name/miniforge3/envs/gatk4.6.1.0/lib/python3.10/site-packages/gcnvkernel-0.9-py3.10.egg/gcnvkernel/models/model_denoising_calling.py", line 634, in _get_baseline_copy_number_and_read_depth
AssertionError: Some samples do not have read depth metadata
14:01:46.080 INFO GermlineCNVCaller - Shutting down engine
[November 19, 2024 at 2:01:46 PM CST] org.broadinstitute.hellbender.tools.copynumber.GermlineCNVCaller done. Elapsed time: 3.39 minutes.
Runtime.totalMemory()=29209133056
org.broadinstitute.hellbender.utils.python.PythonScriptExecutorException:
python exited with 1
Command Line: python /tmp/cohort_denoising_calling.15835403171985346891.py --ploidy_calls_path=/rsrch5/home/rsrch/name/project/hdf5/PLOIDY_COHORT-calls --output_calls_path=/rsrch5/home/rsrch/name/project/hdf5/PLOIDY_COHORT_RUN-calls --output_tracking_path=/rsrch5/home/rsrch/name/project/hdf5/PLOIDY_COHORT_RUN-tracking --random_seed=1984 --modeling_interval_list=/tmp/intervals11515188705086268749.tsv --output_model_path=/rsrch5/home/rsrch/name/project/hdf5/PLOIDY_COHORT_RUN-model --enable_explicit_gc_bias_modeling=True --read_count_tsv_files /tmp/sampleXXXXX.rc13402331463704911309.tsv /tmp/sampleXXXXX.rc9162085982634963414.tsv /tmp/sampleXXXXX.rc15632061791572929519.tsv /tmp/sampleXXXXX.rc12135784994783049012.tsv /tmp/sampleXXXXX.rc430747418122575965.tsv /tmp/sampleXXXXX.rc14100083836932384600.tsv /tmp/sampleXXXXX.rc4749128214540875296.tsv /tmp/sampleXXXXX.rc8318840780860732792.tsv /tmp/sampleXXXXX.rc11414521834826895451.tsv /tmp/sampleXXXXX.rc3219421133382692130.tsv /tmp/sampleXXXXX.rc4386926073403609910.tsv /tmp/sampleXXXXX.rc7368782073389059702.tsv /tmp/sampleXXXXX.rc7208545983456729911.tsv /tmp/sampleXXXXX.rc15558723279796366096.tsv /tmp/sampleXXXXX.rc2451027049655416054.tsv --psi_s_scale=1.000000e-02 --mapping_error_rate=1.000000e-02 --depth_correction_tau=1.000000e+04 --q_c_expectation_mode=hybrid --num_samples_copy_ratio_approx=200 --max_bias_factors=6 --psi_t_scale=1.000000e-02 --log_mean_bias_std=1.000000e-01 --init_ard_rel_unexplained_variance=1.000000e-01 --num_gc_bins=20 --gc_curve_sd=1.000000e+00 --active_class_padding_hybrid_mode=50000 --enable_bias_factors=True --disable_bias_factors_in_active_class=False --p_alt=5.000000e-04 --cnv_coherence_length=1.000000e+04 --max_copy_number=5 --p_active=0.100000 --class_coherence_length=10000.000000 --learning_rate=1.000000e-02 --adamax_beta1=9.000000e-01 --adamax_beta2=9.900000e-01 --log_emission_samples_per_round=50 --log_emission_sampling_rounds=10 --log_emission_sampling_median_rel_error=5.000000e-03 --max_advi_iter_first_epoch=5000 --max_advi_iter_subsequent_epochs=200 --min_training_epochs=10 --max_training_epochs=50 --initial_temperature=1.500000e+00 --num_thermal_advi_iters=2500 --convergence_snr_averaging_window=500 --convergence_snr_trigger_threshold=1.000000e-01 --convergence_snr_countdown_window=10 --max_calling_iters=10 --caller_update_convergence_threshold=1.000000e-03 --caller_internal_admixing_rate=7.500000e-01 --caller_external_admixing_rate=1.000000e+00 --disable_caller=false --disable_sampler=false --disable_annealing=false
at org.broadinstitute.hellbender.utils.python.PythonExecutorBase.getScriptException(PythonExecutorBase.java:75)
at org.broadinstitute.hellbender.tools.copynumber.GermlineCNVCaller.doWork(GermlineCNVCaller.java:380)
at org.broadinstitute.hellbender.cmdline.CommandLineProgram.runTool(CommandLineProgram.java:150)
at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMainPostParseArgs(CommandLineProgram.java:203)
at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:222)
at org.broadinstitute.hellbender.Main.runCommandLineProgram(Main.java:166)
at org.broadinstitute.hellbender.Main.mainEntry(Main.java:209)
at org.broadinstitute.hellbender.Main.main(Main.java:306)
-
Hi Beetle
AssertionError: Some samples do not have read depth metadata
This error message tells us that either some of the count files are not complete or empty or your INPUT line is malformed that it searches for wrong files or locations.
Can you check these two possibilities?
I hope this helps.
Regards.
-
Hello Gökalp Çelik,
Thanks for reply. I have checked count files, and input lines; everything looks fine to me. So i deleted the previous folder and re-run the same command and it worked this time. I have observed when I submit many multiple jobs at the same time some files give error but when i repeat the those script later on in small batches it works fine.
Anyways I was able to run gatk GermlineCNVCaller.
Thanks
Please sign in to leave a comment.
2 comments