More efficient scheduling of large ensembles #107

cylc · 2012-09-04T02:58:12Z

This is just an idea for consideration. See also #108.

Currently if a "family trigger" is used in the graph to trigger a large ensemble, the family trigger is replaced with the equivalent expression involving all the ensemble members, and at run time every member is represented by its own dedicated task proxy. The efficiency of the scheduling algorithm depends on the size of the task pool, and large ensembles are the most likely cause of task pool bloat. If all ensemble members trigger at once, and if downstream tasks trigger off the entire ensemble, then the suite would be just as well served by a single task proxy representing the entire ensemble. The family proxy would have to take messages from each of its members, keep track of each of their states, know how to submit all of them at once, and so on. This would result in a massive performance boost for very large ensemble suites.

cylc · 2012-09-04T03:03:36Z

Complications:

if an individual member is singled out in a non-family dependency relationship, it could be removed from family control and given its own task proxy. Or perhaps the family proxy could also manage these relationships.
how to display these aggregate families in gcontrol etc. - perhaps clicking on a family would just bring up a summary of the member states.
what about cylc's internal queues, which work with individual tasks?
what about internal dependence among ensemble members?

cylc · 2012-09-05T02:02:40Z

Here's an idea - keep the individual member proxies but have them interact only amongst themselves in a separate pool (to handle any internal dependence) while they are represented in the main pool by a single family proxy.

hjoliver · 2016-06-15T15:33:51Z

It was suggested in @cylc/core meeting, that dependency matching could work with shared "dependency objects" rather than individual task proxies. Members of graphed families would automatically share the same dependency objects - would probably solve this issue and #1776 in one whack (if the same shared objects could be used to define graph edges). Bumping up to 'soon' on that basis...

hjoliver · 2016-06-19T14:13:05Z

The new idea (prev comment) is superior - closing this and moving that to a new Issue.

cylc mentioned this issue Sep 4, 2012

Ideas for more efficient scheduling of very large suites #108

Closed

benfitzpatrick mentioned this issue Jun 15, 2016

Explicit Sub-suite (suite within a task) support #171

Closed

hjoliver modified the milestones: soon, later Jun 15, 2016

hjoliver closed this as completed Jun 19, 2016

hjoliver added the superseded label Jun 19, 2016

hjoliver removed this from the soon milestone Jun 19, 2016

hjoliver mentioned this issue Jun 19, 2016

Use shared dependency objects. #1894

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More efficient scheduling of large ensembles #107

More efficient scheduling of large ensembles #107

cylc commented Sep 4, 2012

cylc commented Sep 4, 2012

cylc commented Sep 5, 2012

hjoliver commented Jun 15, 2016 •

edited

Loading

hjoliver commented Jun 19, 2016

More efficient scheduling of large ensembles #107

More efficient scheduling of large ensembles #107

Comments

cylc commented Sep 4, 2012

cylc commented Sep 4, 2012

cylc commented Sep 5, 2012

hjoliver commented Jun 15, 2016 • edited Loading

hjoliver commented Jun 19, 2016

hjoliver commented Jun 15, 2016 •

edited

Loading