[galaxy-dev] Plans for workflow & parallelisation work?

2016-02-21 Thread Peter van Heusden
Hi there

I see from the PR landing in Galaxy and the comments on things like
issue #1701 (https://github.com/galaxyproject/galaxy/issues/1701) that
there's lots of work happening on the workflow side of Galaxy. This is an
area of interest at SANBI too, so we'd like to coordinate development
efforts as much as possible. To this end:

1) Are there forks to track so we can see what new code is landing?
2) Is there a roadmap for workflow work or perhaps can we have a Hangout to
talk about this?
3) Specifically in terms of workflows and parallelisation: are there any
plans to work on running workflows as opposed to just generating lots of
jobs? I know this is a major change to how Galaxy works - it would mean
something like submitting a workflow specification to a job runner that is
located on the cluster, and then returning the results of workflow
execution.
4) Currently parallelisation in Galaxy is supported using two mechanisms:
collections and dataset splitters/tasks. Are there plans on extending and
harmonising Galaxy's parallelisation capabilities?

Thanks,
Peter
___
Please keep all replies on the list by using "reply all"
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  https://lists.galaxyproject.org/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/

[galaxy-dev] Scaling tools: data available?

2016-02-21 Thread Peter van Heusden
Hi there

We're researching the resource (CPU time and memory) requirements for RNA
STAR at the moment. Specifically, we'd like to build up a database of input
size to resource usage so that we can use this to feed a dynamic
destination mapper (currently our STAR configuration uses a thumbsuck for
RAM and CPU requirements that is not optimal but works in many cases).

Has anyone collected these metrics before? And more generally, is anyone
collecting metrics for a range of bioinformatics tools? A database like
this could be very useful!

Thanks!
Peter
___
Please keep all replies on the list by using "reply all"
in your mail client.  To manage your subscriptions to this
and other Galaxy lists, please use the interface at:
  https://lists.galaxyproject.org/

To search Galaxy mailing lists use the unified search at:
  http://galaxyproject.org/search/mailinglists/