Cromwell metadata of huge workflows
Hi,
I have small question. According to Cromwell documentation I can post to this forum to get help. We run few times into issue when we execute very big workflows. By big, I mean in terms of number of tasks. As we run Cromwell for some time the DB is already a bit big and here is the issue. Getting metadata for such big workflows takes minutes if not longer - resulting in timeouts as the number of rows to get is around 5mln. Users are really unhappy when they need to wait that long to see the graph displayed in our UI. On the slack channel I got to know that a custom implementation of metadata actor might be the option for us. However, not sure how to approach this as there is no docs (or at least I could not find). I would be grateful if somebody could share the code for such an actor.
Best,
Rafal
-
Hi Rafal,
You are definitely welcome to post here and we encourage users to help each other find solutions. However, our GATK support team is not able to offer any Cromwell support, you can see our support policy here. You can also look at these resources for more information:
- Bioinformatics Stack Exchange
- Cromwell slack organization: cromwellhq.slack.com
- Cromwell Documentation
Best,
Genevieve
Please sign in to leave a comment.
1 comment