fixing walk order to resolve priority in multi-sink pipelines #120

amitschang · 2024-11-08T19:56:42Z

When adding multiple sink tasks along the pipeline, it was observed that the further-along tasks do not end up getting priority to run, which could cause a pipeline to effectively run a path to a single output closer to the start and pile up a queue on downstream tasks. Looks like this is due to order of BFS search going backwards, which ends up at some nodes closer to start (following output closer to start) quicker.

Meaning for instance in the pipeline below, the green and yellow paths take more priority. Due to batch sizes, the green path would remain unscheduled, but the yellow path with single row batches and single cpu requests would simply keep taking priority and never submit the while tasks.

The fix here is to penalize the tasks further from the sink along any path, at the same time the sink tasks are all considered the same rank, and thus we can continue to write out quickly. Any tasks at same rank are ordered as before (by execution count).

Note: This comes along with a small additional change: instead of submitting as many invocations of a particular task that fit at a time, we continue to evaluate the sort order, walking backward each time - this makes the pipeline "fairer" in a sense.

codecov · 2024-11-08T19:59:44Z

Codecov Report

All modified and coverable lines are covered by tests ✅

📢 Thoughts on this report? Let us know!

amitschang · 2024-11-08T20:38:48Z

dplutils/pipeline/stream.py

-                task.pending.appendleft(self.task_submit(task.task, merged))
-                task.counter += 1
-                submitted = True
+            num_to_merge = deque_num_merge(task.data_in, batch_size)


fyi: the change in this block simply removes the for loop, submitting (if eligible) only a single invocation of task at a time (to enable the fairness re-evaluation). The break -> return makes the diff look like more than whitespace.

amitschang · 2024-11-08T20:41:01Z

dplutils/pipeline/stream.py

+        submitted = True
+        while submitted:
+            rank = 0
+            submitted = False


below this is unchanged. This is here to implement fairness, so long as anything has been submitted, we keep walking the graph from top down but with new sort order. Once nothing is submitted, either no room or need sources.

xiangchenjhu

The sorting logic in graph.py is well-structured, and I spent additional time reviewing the scheduling details and trade-off methods in stream.py. While I may not have fully delved into and understood every detail of the scheduling process (though I grasp most of them), I have no concerns with the current changes. Thanks for the helpful explanations provided in the comments.
Approved

amitschang · 2024-11-13T20:16:10Z

thanks!

amitschang commented Nov 8, 2024

View reviewed changes

fixing walk order to resolve priority in multi-sink pipelines

76009ea

amitschang force-pushed the fix-task-walk-order branch from e32f776 to 76009ea Compare November 8, 2024 20:46

amitschang requested a review from xiangchenjhu November 8, 2024 20:48

amitschang marked this pull request as ready for review November 8, 2024 20:48

amitschang requested a review from a team as a code owner November 8, 2024 20:48

amitschang added 2 commits November 11, 2024 14:03

re-eval on submit fix

1806d97

add test for stream submission ordering logic

8c2e772

xiangchenjhu approved these changes Nov 13, 2024

View reviewed changes

xiangchenjhu self-requested a review November 13, 2024 20:11

amitschang merged commit bc95a0b into main Nov 13, 2024
11 checks passed

amitschang deleted the fix-task-walk-order branch November 13, 2024 20:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixing walk order to resolve priority in multi-sink pipelines #120

fixing walk order to resolve priority in multi-sink pipelines #120

amitschang commented Nov 8, 2024 •

edited

Loading

codecov bot commented Nov 8, 2024 •

edited

Loading

amitschang Nov 8, 2024

amitschang Nov 8, 2024

xiangchenjhu left a comment

amitschang commented Nov 13, 2024

fixing walk order to resolve priority in multi-sink pipelines #120

fixing walk order to resolve priority in multi-sink pipelines #120

Conversation

amitschang commented Nov 8, 2024 • edited Loading

codecov bot commented Nov 8, 2024 • edited Loading

Codecov Report

amitschang Nov 8, 2024

Choose a reason for hiding this comment

amitschang Nov 8, 2024

Choose a reason for hiding this comment

xiangchenjhu left a comment

Choose a reason for hiding this comment

amitschang commented Nov 13, 2024

amitschang commented Nov 8, 2024 •

edited

Loading

codecov bot commented Nov 8, 2024 •

edited

Loading