Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Help with WDL workflow for mutec2?

Answered
0

9 comments

  • Avatar
    Beri

    In the WDL script the output block for the M2 task specifies two variables 
    String tumor_sample = read_string("tumor_name.txt")
    String normal_sample = read_string("normal_name.txt")

    The read_string() function should be reading the contents of those files, perhaps the files are empty? Try checking the log.stderr and log.stdout files to confirm M2 task ran without any problems. 

    0
    Comment actions Permalink
  • Avatar
    Beri

    Also, the repo you're pointing to is archived, it's best to use the latest version of the workflow found here.

    0
    Comment actions Permalink
  • Avatar
    Alex Lee

    Beri thanks. What is strange is that each subdirectory looks fine.  There are no errors. and both tumor_name.txt and normal has a text to the file names. Each subdirectory has a a output vcf. Stdout looks fine 

    Tool returned:
    SUCCESS
    Tool returned:
    SUCCESS
    Tool returned:
    SUCCESS

    Moreover the stderr no errors, the last entry was a command.  

    Running:
    java -Dsamjdk.use_async_io_read_samtools=false -Dsamjdk.use_async_io_write_samtools=true -Dsamjdk.use_async_io_write_tribble=false -Dsamjdk.compression_level=2 -Xmx3000m -jar /root/gatk.jar GetPileupSummaries -R /cromwell-executions/Mutect2/40fdbfbc-bc9c-4bdf-be88-3630342aaca1/call-M2/shard-40/inputs/-1275058761/Homo_sapiens_assembly19.fasta -I /cromwell-executions/Mutect2/40fdbfbc-bc9c-4bdf-be88-3630342aaca1/call-M2/shard-40/inputs/2119282986/HM5086F_A.b37.bam --interval-set-rule INTERSECTION -L /cromwell-executions/Mutect2/40fdbfbc-bc9c-4bdf-be88-3630342aaca1/call-M2/shard-40/inputs/1020858285/0040-scattered.interval_list -V /cromwell-executions/Mutect2/40fdbfbc-bc9c-4bdf-be88-3630342aaca1/call-M2/shard-40/inputs/-1275058761/small_exac_common_3.vcf -L /cromwell-executions/Mutect2/40fdbfbc-bc9c-4bdf-be88-3630342aaca1/call-M2/shard-40/inputs/-1275058761/small_exac_common_3.vcf -O normal-pileups.table

     

     

    0
    Comment actions Permalink
  • Avatar
    Beri

    So If there aren't any errors in the stderr and GATK says it was successful chances are it related to Cromwell/WDL. 

    The WDL documentation has some info mentioning possible issues when using read_string()

    If the entire contents of the file can not be read for any reason, the calling task or workflow will be considered to have failed. Examples of failure include but are not limted to not having access to the file, resource limitations (e.g. memory) when reading the file, and implementation imposed file size limits.

    0
    Comment actions Permalink
  • Avatar
    Alex Lee

    Beri thanks. What is bizaare is that even if I hard coded the file names, what just get rid of read_string() then there are other errors.  I've just pretty much given up on this since it would've been faster had I just build this from scratch! 

    0
    Comment actions Permalink
  • Avatar
    TA

    Hi,

    Was there any solution to this error? I am getting the same error when I tried to run this Mutect2 wdl using cromwell on my server - I imagine it has to do with runtime attributes you have to change when it isn't used on terra as this workflow is working when I use it on terra vs. on my institute's cluster (same samples/runtime attributes are giving me this error). Thank you!

     

    Toby

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi TA

    Can you post the actual error messages you get and also provide versions of tools that you use to run these workflows?

    We can help you better with that information since this topic is already 3 years old and there has been many updates to all tools mentioned here. 

    Regards. 

    0
    Comment actions Permalink
  • Avatar
    TA

    Hi,

     

    The error message is: 

    cromwell.backend.standard.StandardAsyncExecutionActor$$anon$2: Failed to evaluate job outputs:

    Bad output 'M2.tumor_sample': Failed to read_string("tumor_name.txt") (reason 1 of 1): Future timed out after [60 seconds]

    Bad output 'M2.normal_sample': Failed to read_string("normal_name.txt") (reason 1 of 1): Future timed out after [60 seconds]

    at cromwell.backend.standard.StandardAsyncExecutionActor.$anonfun$handleExecutionSuccess$1(StandardAsyncExecutionActor.scala:990)

    at scala.concurrent.impl.Promise$Transformation.run(Promise.scala:467)

    at akka.dispatch.TaskInvocation.run(AbstractDispatcher.scala:41)

    at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(ForkJoinExecutorConfigurator.scala:49)

    at akka.dispatch.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)

    at akka.dispatch.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)

    at akka.dispatch.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)

    at akka.dispatch.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)

    The tools I am using are all the tools from the most recent gatk docker container:

    us.gcr.io/broad-gatk/gatk@sha256:53b4590c3c8d2d5db7a02c07754a69ea134841f6a08a5a16b238bbf2d6ebd250

    Basically I am running best practices mutect2 wdl here:

    https://github.com/broadinstitute/gatk/blob/master/scripts/mutect2_wdl/mutect2.wdl

    And it is working up until the end of the mutect2 task. I think there is some issue with scatter-gather when it is implemented locally/on a cluster with slurm as opposed to terra with runtime parameters but not exactly sure what runtime parameters to change. We implemented cromwell on the server following the cromwell instructions here:

    https://cromwell.readthedocs.io/en/stable/backends/HPC/

     

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi TA

    Can you give us more details about what version of cromwell you are using? Also can you provide us the cromwell config file for this installation so that we can debug the issue?

    Regards. 

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk