Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Error in GATK-SV joint-calling terra pipeline, 07-FilterBatchSites step

0

9 comments

  • Avatar
    Gökalp Çelik

    Hi Dong Wang

    I guess this question has already been answered under github issues. It requires more samples to get a result for this step. 

    Regards. 

    0
    Comment actions Permalink
  • Avatar
    Dong Wang

    Yes, thanks for following up! I am currently re-running the pipeline with more samples. If anything arises, would the Terra support forum be a more efficient place to troubleshoot than here? 

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi again.

    You can definitely use here as well. GATK-SV team members will take your questions gladly. 

    Regards. 

    0
    Comment actions Permalink
  • Avatar
    Dong Wang

    Hello, I'm running into a new error and cross-posting here again. Running batches of 100 samples fixed my previous issue.

    Now at step 18-SVConcordance, I'm running into this error: htsjdk.tribble.TribbleException$InternalCodecException: The allele with index 2 is not defined in the REF/ALT columns in the record.
    The VCF causing this is the output from step 16-RefineComplexVariants, named cpx_refined.vcf.gz. What does this error indicate?

    Thanks for your help!

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi Dong Wang

    This could be bug at one of the tools that generate the target vcf. I will relay to GATK-SV team to check it out.

     

    0
    Comment actions Permalink
  • Avatar
    Dong Wang

    Hi Gökalp Çelik, sounds good, please let me know who/where I should follow up with!

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi Dong Wang

    Here is the response from the GATK-SV team. This is a known issue and it is fixed in a later version of docker image. They are recommending updating the 

    sv_pipeline_docker 

    variable to

    us.gcr.io/broad-dsde-methods/gatk-sv/sv-pipeline:2025-02-10-v1.0.2-72c15c6b

    and rerunning the 

    CleanVcf

    and onward. 

    Developers also mentioned that the live workspace has already been updated so you may need to check your workspace configuration against the working configuration of the live workspace for GATK-SV. 

    Regards. 

     

    0
    Comment actions Permalink
  • Avatar
    Dong Wang

    Hi Gökalp Çelik, I appreciate your quick response! I will retry the steps with the updated docker. It seems like the pipeline is being actively updated to address breaking bugs like this. For the future, should I run the pipeline with the default v1.0 code or use whatever is the most updated (v1.0.2 now for most workflows)? Thanks for your help. 

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi again. 

    Using the most updated code would be the best. 

    Regards.

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk