Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

GermlineCNVCaller prefix string too short

0

9 comments

  • Avatar
    SkyWarrior

    This issue is fixed with one of the later versions. I suggest you use 4.2.3.0. 

    0
    Comment actions Permalink
  • Avatar
    Ahmed S. Chakroun

    Hi,

    Thanks a lot SkyWarrior. Indeed, it seems working with 4.2.3.0. However, the same command now it getting the following error:

    cohort_denoising_calling.16183341557554455300.py: error: unrecognized arguments: --num_samples_copy_ratio_approx=200

    which is odd because I don't have any --num_samples_copy_ratio_approx argument in my command.

    Please, let me know if I should create a new post.

    Regards.

    Ahmed

    0
    Comment actions Permalink
  • Avatar
    Ahmed S. Chakroun

    Hi,

    I am pretty sure, that's a bug with the python script being generated each time we use GermlineCNVCaller and here is the full error message I get:

    cohort_denoising_calling.1986139956957694084.py: error: unrecognized arguments: --num_samples_copy_ratio_approx=200
    13:54:03.192 INFO GermlineCNVCaller - Shutting down engine
    [December 6, 2021 at 1:54:03 PM CET] org.broadinstitute.hellbender.tools.copynumber.GermlineCNVCaller done. Elapsed time: 0.19 minutes.
    Runtime.totalMemory()=1275068416
    org.broadinstitute.hellbender.utils.python.PythonScriptExecutorException:
    python exited with 2
    Command Line: python /tmp/cohort_denoising_calling.1986139956957694084.py \
    --ploidy_calls_path=/home/ahmed/NGS/wes/gatk_cnv/GRCh38/Twist/ploidy-calls \
    --output_calls_path=/home/ahmed/NGS/wes/gatk_cnv/GRCh38/Twist/cohort16/cohort16-calls \
    --output_tracking_path=/home/ahmed/NGS/wes/gatk_cnv/GRCh38/Twist/cohort16/cohort16-tracking \
    --random_seed=1984 \
    --modeling_interval_list=/tmp/intervals10704902589400019780.tsv \
    --output_model_path=/home/ahmed/NGS/wes/gatk_cnv/GRCh38/Twist/cohort16/cohort16-model \
    --enable_explicit_gc_bias_modeling=True \
    --read_count_tsv_files /tmp/13-20.rc9050216882449081471.tsv /tmp/722.rc8266875526976418082.tsv /tmp/D19047.rc3271710619885345892.tsv /tmp/F24F1.rc4166074511500834871.tsv /tmp/NS.rc7655370649896520978.tsv /tmp/TBC039.rc1852505427358232937.tsv /tmp/VP.rc17807754401657016405.tsv /tmp/WES002.rc477781383630322624.tsv /tmp/WES02.rc18129316156732628621.tsv /tmp/17062-T1-.rc11172633656988231709.tsv /tmp/18001-M1-.rc12085939516166347679.tsv /tmp/516.rc17861001317409373195.tsv /tmp/533.rc7117324399860262573.tsv /tmp/NBH.rc17070722809383417351.tsv /tmp/ADN492.rc5813836201309586145.tsv /tmp/WES607.rc6033612252752964854.tsv \
    --psi_s_scale=1.000000e-04 \
    --mapping_error_rate=1.000000e-02 \
    --depth_correction_tau=1.000000e+04 \
    --q_c_expectation_mode=hybrid \
    --num_samples_copy_ratio_approx=200 \
    --max_bias_factors=5 \
    --psi_t_scale=1.000000e-03 \
    --log_mean_bias_std=1.000000e-01 \
    --init_ard_rel_unexplained_variance=1.000000e-01 \
    --num_gc_bins=20 \
    --gc_curve_sd=1.000000e+00 \
    --active_class_padding_hybrid_mode=50000 \
    --enable_bias_factors=True \
    --disable_bias_factors_in_active_class=False \
    --p_alt=1.000000e-06 \
    --cnv_coherence_length=1.000000e+04 \
    --max_copy_number=5 \
    --p_active=0.010000 \
    --class_coherence_length=10000.000000 \
    --learning_rate=1.000000e-02 \
    --adamax_beta1=9.000000e-01 \
    --adamax_beta2=9.900000e-01 \
    --log_emission_samples_per_round=50 \
    --log_emission_sampling_rounds=10 \
    --log_emission_sampling_median_rel_error=5.000000e-03 \
    --max_advi_iter_first_epoch=5000 \
    --max_advi_iter_subsequent_epochs=200 \
    --min_training_epochs=10 \
    --max_training_epochs=50 \
    --initial_temperature=1.500000e+00 \
    --num_thermal_advi_iters=2500 \
    --convergence_snr_averaging_window=500 \
    --convergence_snr_trigger_threshold=1.000000e-01 \
    --convergence_snr_countdown_window=10 \
    --max_calling_iters=10 \
    --caller_update_convergence_threshold=1.000000e-03 \
    --caller_internal_admixing_rate=7.500000e-01 \
    --caller_external_admixing_rate=1.000000e+00 \
    --disable_caller=false \
    --disable_sampler=false \
    --disable_annealing=false
    at org.broadinstitute.hellbender.utils.python.PythonExecutorBase.getScriptException(PythonExecutorBase.java:75)
    at org.broadinstitute.hellbender.tools.copynumber.GermlineCNVCaller.doWork(GermlineCNVCaller.java:351)
    at org.broadinstitute.hellbender.cmdline.CommandLineProgram.runTool(CommandLineProgram.java:140)
    at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMainPostParseArgs(CommandLineProgram.java:192)
    at org.broadinstitute.hellbender.cmdline.CommandLineProgram.instanceMain(CommandLineProgram.java:211)
    at org.broadinstitute.hellbender.Main.runCommandLineProgram(Main.java:160)
    at org.broadinstitute.hellbender.Main.mainEntry(Main.java:203)
    at org.broadinstitute.hellbender.Main.main(Main.java:289)

    Any help is welcome.

    Regards.

    Ahmed

    0
    Comment actions Permalink
  • Avatar
    Pamela Bretscher

    Hi Ahmed S. Chakroun,

    Did you update your conda/Docker environment when you updated your GATK version? 

    Kind regards,

    Pamela

    0
    Comment actions Permalink
  • Avatar
    Ahmed S. Chakroun

    Hi Pamela,

    Indeed, updating the conda environment solved the issue. Now GermlineCNVCaller works fine.

    Thank you very much for the valuable help.

    Regards.
    Ahmed

    0
    Comment actions Permalink
  • Avatar
    Pamela Bretscher

    Hi Ahmed S. Chakroun,

    Great! I'm glad to hear that solved the problem.

    Kind regards,

    Pamela

    0
    Comment actions Permalink
  • Avatar
    Benedicte N

    Hi,

    I have the same issue than you describe. Which version of GATK do you use for make it works ?

    Thank you.

     

    0
    Comment actions Permalink
  • Avatar
    Pamela Bretscher

    Hi Benedicte N,

    I would recommend using the latest version of GATK (4.2.3.0), which I believe is what the above user used to solve the problem.

    Kind regards,

    Pamela

    0
    Comment actions Permalink
  • Avatar
    Ahmed S. Chakroun

    Hi Benedicte N,

    Indeed, I confirm what Pamela said, I upgraded to the latest version and I have also updated the conda environment using the latest version .yml file.

    Good luck.

    Ahmed

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk