Document Scheduler and Worker state machine #6948

crusaderky · 2022-08-24T16:30:50Z

Rendered preview:
https://distributed--6948.org.readthedocs.build/en/6948/scheduling-state.html
https://distributed--6948.org.readthedocs.build/en/6948/worker-state.html

docs/source/scheduling-state.rst

github-actions · 2022-08-24T19:07:06Z

Unit Test Results

See test report for an extended history of previous test failures. This is useful for diagnosing flaky tests.

      15 files ±0       15 suites ±0 6h 17m 26s ⏱️ - 11m 42s
  3 052 tests ±0   2 968 ✔️ +1   83 💤 ±0 1 ❌ - 1
22 577 runs ±0 21 603 ✔️ +2 973 💤 - 1 1 ❌ - 1

For more details on these failures, see this check.

Results for commit cdbcac5. ± Comparison against base commit 6a1b089.

♻️ This comment has been updated with latest results.

fjetter · 2022-08-25T13:08:27Z

@martindurant @jakirkham you might be interested

gjoseph92

Excellent! Very clear and useful documentation; this would be great to read as a new developer interested in working on the Worker. All comments are just naming/grammar nits.

distributed/worker_state_machine.py

gjoseph92 · 2022-08-26T16:28:32Z

docs/source/images/task-state.dot

@@ -5,15 +5,12 @@ digraph{
        ];
    released1 [label=released];
    released2 [label=released];
-    new -> released1;
    released1 -> waiting;


FYI, merge conflict with #6614 for this file

docs/source/images/worker-execute-state.dot

docs/source/scheduling-state.rst

docs/source/worker-state.rst

gjoseph92 · 2022-08-27T01:35:07Z

docs/source/worker-state.rst

+  and only when the message reaches the worker it will be released there too.
+
+
+Flow control


Reading this section of the docs makes me so happy with the design of the worker state machine. Having state transformation strongly separated out from IO and concurrency like this is so nice, and such a big improvement. Nice work!

martindurant · 2022-08-29T13:33:44Z

This is a great document to have available for reference.

I have a couple of high-level thoughts before getting into detail. None of these mean I am requesting changes in the implementation.

I do not see the need for the RESUMED state, why not just go to the target state?
Similarly for CONSTRAINED, which seems identical to READY, which is also constrained but on the thread pool
Reschedule appears to cause a task to be rescheduled, but forgotten? From https://distributed.dask.org/en/stable/api.html#distributed.Reschedule I understand that the point is to clear its state so that the scheduler can restart its life-cycle.

Some diagrams are disjoint. It makes it confusing to follow. For example, the big diagram at the top of Computing shows rescheduled->released->forgotten, but two diagrams later we see that ERROR and MEMORY have exactly the same paths.

I would change some names to make them clearer, if more verbose is allowed. Something like
WAITING -> WAITING_ON_DEPS_TO_COMPLETE
READY -> READY_TO_RUN
FETCH -> WAITING_TO_FETCH_DEPS
FLIGHT -> DEPS_IN_FLIGHT
RESCHEDULE -> RESCHEDULE_RAISED

I would add a clear and specific definition and consequence of every state. Some of this is in TaskStates, but I would add specific details about the data structures affected.
For example:

EXECUTING, the associated function is currently running on a thread on this worker and appears as a value of that thread ID in worker.active_threads
MEMORY, this key is contained in the worker.data dict, the value being the result returned from executing the task's function
FORGOTTEN, this key is no longer in worker.tasks (or any other structure?) and will soon be garbage collected
ERROR, an uncaught exception was raised either during the execution of a tasks function, or during serializaation/deserialization. The content of the exception and traceback are held in the task, so that they are relayed to the client and raised there when the client requests the corresponding future's result. Tasks that depend on this task will also get the status ERROR (?).

Why is the initial state of any task apparently RELEASED?

There is no mention anywhere of Actors, even though a lot of code is dedicated to them.

gjoseph92 · 2022-08-29T19:30:53Z

I also just realized I don't think you mention the TaskState.done attribute in here. I think it's worth noting since the name is confusing (it sounds like it refers to the task being in a terminal state, as opposed to the execute/fetch coroutine that was responsible for it being complete).

Co-authored-by: Gabe Joseph <gjoseph92@gmail.com>

…e_doc

Co-authored-by: Gabe Joseph <gjoseph92@gmail.com>

…doc"

crusaderky · 2022-08-30T11:15:12Z

I also just realized I don't think you mention the TaskState.done attribute in here. I think it's worth noting since the name is confusing (it sounds like it refers to the task being in a terminal state, as opposed to the execute/fetch coroutine that was responsible for it being complete).

I overhauled the docstring of the flag.

I do not see the need for the RESUMED state, why not just go to the target state?

Because you'd have a task in flight which is actually taking up a thread from the threadpool, and which will terminate with one of the subclasses of ExecuteDoneEvent instead of GatherDepDoneEvent; or you'd have a task in executing or long-running which is actually taking up network resources and will terminate with GatherDepDoneEvent instead of ExecuteDoneEvent.

Additionally, when such a task fails, you need to try doing what the scheduler originally asked for. This is actually the norm when a worker dies and the scheduler notices before its peers:

worker dies
scheduler notices and cancels fetches for all keys which existed exclusively on the dead worker
scheduler sends a compute-task message to a random worker
if the worker receiving the compute-task message just happens to be one that previosly had the task in flight, it will need to wait for the TCP connection to fall over, thus sending GatherDepNetworkFailureEvent to the state machine, and then start the computation.

This use case is also documented in the sphinx documents linked above.

Similarly for CONSTRAINED, which seems identical to READY, which is also constrained but on the thread pool

It's a lot more efficient to have two different pipelines for tasks with resources and tasks without, so that a task without resources is not blocked by tasks with.

Reschedule appears to cause a task to be rescheduled, but forgotten? From https://distributed.dask.org/en/stable/api.html#distributed.Reschedule I understand that the point is to clear its state so that the scheduler can restart its life-cycle.

reschedule causes the task to be immediately forgotten on the worker and released on the scheduler, which restarts its life-cycle.

Some diagrams are disjoint. It makes it confusing to follow. For example, the big diagram at the top of Computing shows rescheduled->released->forgotten, but two diagrams later we see that ERROR and MEMORY have exactly the same paths.

Yes, this is on purpose to highlight how reschedule immediately transitions to released and forgotten, while error/memory won't transition to released until the scheduler asks to. I updated the diagrams and the "Forgetting tasks" section.

I would change some names to make them clearer, if more verbose is allowed.

This would make things seriously hard to read considering how many times these labels appear throughout the code.
It would also make it necessary to use enums to avoid misspellings, further aggravating the code verbosity.

FETCH -> WAITING_TO_FETCH_DEPS
FLIGHT -> DEPS_IN_FLIGHT

No, when a task is in fetch or flight state it's itself waiting to be fetched or in flight
If a task is waiting for its dependencies to be gathered, it's in waiting state.
Additionally, a task may be in fetch or flight without being a dependency, as it may have been replicated by the Active Memory Manager.

I would add a clear and specific definition and consequence of every state. Some of this is in TaskStates, but I would add specific details about the data structures affected. For example: [...]

Added clarifications.

Why is the initial state of any task apparently RELEASED?

Historical reasons. There used to be two separate states, new at the beginning and released at the end; then they were merged into one. Either name is improper for the other state. However, you can find a task in released state when it's towards the end of its lifetime, whereas a brand new task will immediately transition to waiting or fetch within a single transitions() call.

There is no mention anywhere of Actors, even though a lot of code is dedicated to them.

Actually, Actors have nothing to do whatsoever with the worker state machine - they're just a task like any other. They are handled exclusively in Worker.

crusaderky · 2022-08-30T11:19:29Z

All review comments have been addressed

fjetter

Good job!

crusaderky requested review from hendrikmakait, sjperkins and gjoseph92 August 24, 2022 16:31

crusaderky self-assigned this Aug 24, 2022

crusaderky added the documentation Improve or add to documentation label Aug 24, 2022

crusaderky commented Aug 24, 2022

View reviewed changes

docs/source/scheduling-state.rst Show resolved Hide resolved

crusaderky force-pushed the workerstate_doc branch from 6751e13 to 47e775f Compare August 24, 2022 16:45

Document Scheduler and Worker state machine

3ff0cfc

crusaderky force-pushed the workerstate_doc branch from 47e775f to 3ff0cfc Compare August 24, 2022 16:45

gjoseph92 reviewed Aug 27, 2022

View reviewed changes

crusaderky and others added 10 commits August 30, 2022 10:49

Update distributed/worker_state_machine.py

db70e53

Co-authored-by: Gabe Joseph <gjoseph92@gmail.com>

Merge branch 'main' into workerstate_doc

6ad7879

Update docs/source/worker-state.rst

4689a08

Co-authored-by: Gabe Joseph <gjoseph92@gmail.com>

Update docs/source/worker-state.rst

33a16e0

Co-authored-by: Gabe Joseph <gjoseph92@gmail.com>

Merge remote-tracking branch 'origin/workerstate_doc' into workerstat…

8be69ab

…e_doc

Update docs/source/worker-state.rst

ed0a1c3

Co-authored-by: Gabe Joseph <gjoseph92@gmail.com>

Update docs/source/worker-state.rst

a9aa5d0

Co-authored-by: Gabe Joseph <gjoseph92@gmail.com>

Update docs/source/worker-state.rst

3ef9913

Co-authored-by: Gabe Joseph <gjoseph92@gmail.com>

Auto stash before merge of "workerstate_doc" and "origin/workerstate_…

843cacf

…doc"

code review

cdbcac5

fjetter approved these changes Aug 30, 2022

View reviewed changes

crusaderky added 3 commits August 30, 2022 16:56

code review

67c3ad0

Merge branch 'main' into workerstate_doc

e40337d

merge dask#6933

de126f7

crusaderky merged commit 817ead3 into dask:main Aug 30, 2022

crusaderky deleted the workerstate_doc branch August 30, 2022 16:11

gjoseph92 pushed a commit to gjoseph92/distributed that referenced this pull request Oct 31, 2022

Document Scheduler and Worker state machine (dask#6948)

cf82c8b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document Scheduler and Worker state machine #6948

Document Scheduler and Worker state machine #6948

crusaderky commented Aug 24, 2022 •

edited

Loading

github-actions bot commented Aug 24, 2022 •

edited

Loading

fjetter commented Aug 25, 2022

gjoseph92 left a comment

gjoseph92 Aug 26, 2022

gjoseph92 Aug 27, 2022

martindurant commented Aug 29, 2022

gjoseph92 commented Aug 29, 2022

crusaderky commented Aug 30, 2022 •

edited

Loading

crusaderky commented Aug 30, 2022

fjetter left a comment

		and only when the message reaches the worker it will be released there too.


		Flow control

Document Scheduler and Worker state machine #6948

Document Scheduler and Worker state machine #6948

Conversation

crusaderky commented Aug 24, 2022 • edited Loading

github-actions bot commented Aug 24, 2022 • edited Loading

Unit Test Results

fjetter commented Aug 25, 2022

gjoseph92 left a comment

Choose a reason for hiding this comment

gjoseph92 Aug 26, 2022

Choose a reason for hiding this comment

gjoseph92 Aug 27, 2022

Choose a reason for hiding this comment

martindurant commented Aug 29, 2022

gjoseph92 commented Aug 29, 2022

crusaderky commented Aug 30, 2022 • edited Loading

crusaderky commented Aug 30, 2022

fjetter left a comment

Choose a reason for hiding this comment

crusaderky commented Aug 24, 2022 •

edited

Loading

github-actions bot commented Aug 24, 2022 •

edited

Loading

crusaderky commented Aug 30, 2022 •

edited

Loading