Add docs on task-specific buffering using multithreading #48542

IanButterworth · 2023-02-05T18:24:47Z

It's common to see people using threadid()-based buffers, for instance in a toy sum example

function sum_multi(a)
     buffers = zeros(eltype(a), Threads.nthreads())
     Threads.@threads for i in eachindex(a)
         buffers[Threads.threadid()] += a[i]
     end
     s = 0
     for b in buffers
         s += b
     end
     return s
 end

but after task migration #40715 (which is undocumented AFAICT) this practice is not race-safe.

This is an attempt to document some best practice.

I expected the task_local_storage api to be the right answer here, but I couldn't figure out how to reduce it after the @threads loop returned.

Help appreciated in making this correct

vtjnash

Thanks for tackling this

doc/src/manual/multi-threading.md

vtjnash · 2023-02-06T01:18:13Z

doc/src/manual/multi-threading.md

+500000500000
+```
+
+Note that we do not use buffers based on the `threadid()` i.e. `buffers = zeros(Threads.nthreads())` because tasks can


Also because nthreads is soft-deprecated (no warning, but may return incorrect answers now)

I didn't add a note on this. I don't understand whether the docs should or shouldn't recommend nthreads()

torfjelde · 2023-02-06T14:13:55Z

doc/src/manual/multi-threading.md

+To fix this, buffers that are specific to the task may be used to segment the sum into chunks that are race-free.
+
+```julia-repl
+julia> function sum_multi_good(a)


I'm not an expert on multi-threaded parallelism, so I might just be wrong.

But this doesn't seem like it'll be particularly performant as the values of the WeakKeyDict won't be contiguous in memory, and so the reduction at the needs to gather these from likely different parts of the heap => slow reduction. In fact, when I tried running something like this on an example of mine it was incredibly slow; probably partially due to what I just mentioned, and partially due to the slowness of WeakKeyDict (I believe you can also use an IdDict which at least should be faster that WeakKeyDict).

An alternative is Atomic as mentioned below, or, even better, you can do a Vector{Atomic{T}} of some buffer_length. In the latter scenario you can then just randomly pick an index for each Task, and act on the corresponding Atomic{T} atomically, e.g. atomic_add!(buffer[rand(1:buffer_size)], i). If the work within each of the tasks is fairly uniform, then this random picking of index to add to should result in fairly even congestion for the different buffers.

I'm sure there are much, much better ways of doing this, but the above is a quick-and-easy way to implement a much faster version of this kind of map-reduce approach using a buffer.

Moelf · 2023-05-30T21:05:12Z

just to move some of the comments here to provide some context as to why this PR is very much needed.

Problem with task local storage

it's not type stable out of the box
if what you're buffering involves large RAM or slow I/O and the result is used across task boundary, tasks may over spawn and overlap with each other (see this for example)

Our ecosystem contains many silently incorrect code now without this

The easiest way for these to be fixed is probably to provide a replacement for what used to work, thus this PR is pretty important.

vtjnash · 2023-05-31T02:37:28Z

At least most of those had the decency to avoid having inbounds annotations, which can quickly (and often) turns mild wrong code into drastic wrong answers

vtjnash · 2023-05-31T02:39:51Z

Those seem more impetus to continue looking into #48543 perhaps?

KristofferC · 2023-05-31T04:43:11Z

I think many of those mimiced the implementation of the old default rng.

doc/src/manual/multi-threading.md

IanButterworth · 2023-06-16T18:26:15Z

Perhaps this is ready for final review? @MasonProtter perhaps?

MasonProtter

Sorry for suggested that code and now I'm suggesting changes, but I guess it would be nice and concise here to just re-use the single threaded version of the function and run it on each of the spawned chunks, and also use it for reducing the returned data?

doc/src/manual/multi-threading.md

Co-Authored-By: Mason Protter <mason.protter@icloud.com>

IanButterworth · 2023-06-29T18:36:40Z

@MasonProtter I absorbed your suggestions. Can you give this a last review please. Thanks

MasonProtter

I like it!

Co-authored-by: Mason Protter <mason.protter@icloud.com>

Backported PRs: - [x] #47782  - [x] #48634  - [x] #49931  - [x] #50064  - [x] #50474  - [x] #50516  - [x] #50635  - [x] #49915  - [x] #50781  - [x] #50845  - [x] #49031  - [x] #50289  - [x] #50559  - [x] #49582  - [x] #50341  - [x] #50525  - [x] #50444  - [x] #50523  - [x] #50860  - [x] #50164  - [x] #50568  - [x] #50871  Need manual backport: - [ ] #48542  - [ ] #50591  Non-merged PRs with backport label: - [ ] #50842  - [ ] #50823  - [ ] #50663  - [ ] #49716  - [ ] #49713  - [ ] #49573  - [ ] #48726  - [ ] #48642  - [ ] #48183  - [ ] #48050  - [ ] #47615

Co-authored-by: Mason Protter <mason.protter@icloud.com> (cherry picked from commit 02f80c6)

Backported PRs: - [x] #48625  - [x] #48387  - [x] #48363  - [x] #48977  - [x] #50719  - [x] #50694  - [x] #50860  - [x] #50594  - [x] #50802  - [x] #50858  - [x] #50874  - [x] #50822  - [x] #50730  - [x] #50850  - [x] #50809  - [x] #50915  - [x] #50929  - [x] #50928  - [x] #50959  - [x] #50823  - [x] #48542  - [x] #50912  - [x] #51010  - [x] #50753  - [x] #51027  - [x] #51019  - [x] #51039  - [x] #51036  - [x] #51042  - [x] #51114  - [x] #50892  - [x] #51154  - [x] #51153  - [x] #51222  - [x] #51236  - [x] #51243  - [x] #51254  - [x] #51175  - [x] #51300  - [x] #51307  - [x] #51303  - [x] #51393 - [x] #51403 Need manual backport: - [x] #51009  - [x] #51053  - [x] #51013  - [x] #51305  Contains multiple commits, manual intervention needed: - [ ] #50663  - [ ] #51035  - [ ] #51092  - [x] #51247  - [x] #51294  Non-merged PRs with backport label: - [ ] #51132  - [x] #51029  - [ ] #50919  - [ ] #50824  - [x] #50385  - [ ] #49805

Co-authored-by: Mason Protter <mason.protter@icloud.com> (cherry picked from commit 02f80c6)

IanButterworth added docs This change adds or pertains to documentation multithreading Base.Threads and related functionality labels Feb 5, 2023

IanButterworth force-pushed the ib/threads_data_race_docs branch 3 times, most recently from cd1070f to d6d0234 Compare February 5, 2023 18:33

vchuravy requested a review from tkf February 5, 2023 18:46

devmotion mentioned this pull request Feb 5, 2023

Remove use of threadid TuringLang/DynamicPPL.jl#429

Open

IanButterworth mentioned this pull request Feb 5, 2023

return used tls from @threads for easier use of task_local_storage() #48543

Closed

vtjnash reviewed Feb 6, 2023

View reviewed changes

torfjelde reviewed Feb 6, 2023

View reviewed changes

ericphanson mentioned this pull request Feb 26, 2023

[docs] fix thread safefy issue in parallelism tutorial jump-dev/JuMP.jl#3240

Merged

Moelf mentioned this pull request May 31, 2023

change default threading scheduler to :static #50019

Closed

felixcremer mentioned this pull request Jun 2, 2023

use static thread scheduling JuliaDataCubes/YAXArrays.jl#260

Merged

IanButterworth mentioned this pull request Jun 3, 2023

add docs on task migration #50047

Merged

IanButterworth force-pushed the ib/threads_data_race_docs branch from d6d0234 to 2637437 Compare June 4, 2023 15:23

MasonProtter mentioned this pull request Jun 5, 2023

New blogpost: PSA: Thread-local state is no longer recommended; Common misconceptions about threadid() and nthreads() JuliaLang/www.julialang.org#1904

Merged

9 tasks

MasonProtter reviewed Jun 16, 2023

View reviewed changes

doc/src/manual/multi-threading.md Outdated Show resolved Hide resolved

IanButterworth force-pushed the ib/threads_data_race_docs branch from 1d330f7 to 1c0ebe6 Compare June 16, 2023 18:23

IanButterworth marked this pull request as ready for review June 16, 2023 18:23

IanButterworth changed the title ~~RFC: add docs on task-specific buffering using @threads~~ Add docs on task-specific buffering using multithreading Jun 16, 2023

MasonProtter reviewed Jun 17, 2023

View reviewed changes

doc/src/manual/multi-threading.md Outdated Show resolved Hide resolved

doc/src/manual/multi-threading.md Outdated Show resolved Hide resolved

doc/src/manual/multi-threading.md Outdated Show resolved Hide resolved

IanButterworth force-pushed the ib/threads_data_race_docs branch from 1c0ebe6 to 0c21503 Compare June 29, 2023 18:29

add docs on task-specific buffering using threads

7f8d474

Co-Authored-By: Mason Protter <mason.protter@icloud.com>

IanButterworth force-pushed the ib/threads_data_race_docs branch from 0c21503 to 7f8d474 Compare June 29, 2023 18:34

IanButterworth requested a review from MasonProtter June 29, 2023 18:36

MasonProtter approved these changes Jun 29, 2023

View reviewed changes

IanButterworth added backport 1.9 Change should be backported to release-1.9 backport 1.10 Change should be backported to the 1.10 release merge me PR is reviewed. Merge when all tests are passing labels Jun 29, 2023

Moelf approved these changes Jun 29, 2023

View reviewed changes

IanButterworth merged commit 02f80c6 into JuliaLang:master Jun 30, 2023

IanButterworth deleted the ib/threads_data_race_docs branch June 30, 2023 00:12

IanButterworth removed the merge me PR is reviewed. Merge when all tests are passing label Jun 30, 2023

IanButterworth added a commit that referenced this pull request Jun 30, 2023

Add docs on task-specific buffering using multithreading (#48542)

a82990e

Co-authored-by: Mason Protter <mason.protter@icloud.com>

KristofferC mentioned this pull request Jul 11, 2023

release-1.9: Backports for 1.9.3 #50507

Merged

35 tasks

IanButterworth mentioned this pull request Aug 19, 2023

[release-1.9] Backports for Julia 1.9 #50977

Merged

31 tasks

IanButterworth added a commit that referenced this pull request Aug 19, 2023

Add docs on task-specific buffering using multithreading (#48542)

90f1735

Co-authored-by: Mason Protter <mason.protter@icloud.com> (cherry picked from commit 02f80c6)

IanButterworth mentioned this pull request Aug 19, 2023

[release-1.10] Backports for Julia 1.10.0-x #50971

Merged

58 tasks

IanButterworth added a commit that referenced this pull request Aug 19, 2023

Add docs on task-specific buffering using multithreading (#48542)

1094763

Co-authored-by: Mason Protter <mason.protter@icloud.com> (cherry picked from commit 02f80c6)

KristofferC removed the backport 1.10 Change should be backported to the 1.10 release label Oct 2, 2023

nalimilan pushed a commit that referenced this pull request Nov 5, 2023

Add docs on task-specific buffering using multithreading (#48542)

e2c5dfb

Co-authored-by: Mason Protter <mason.protter@icloud.com> (cherry picked from commit 02f80c6)

mofeing mentioned this pull request Oct 27, 2024

Backport ebae716 to v1.10 #56354

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add docs on task-specific buffering using multithreading #48542

Add docs on task-specific buffering using multithreading #48542

IanButterworth commented Feb 5, 2023 •

edited

Loading

vtjnash left a comment

vtjnash Feb 6, 2023

IanButterworth Jun 16, 2023

torfjelde Feb 6, 2023 •

edited

Loading

Moelf commented May 30, 2023 •

edited

Loading

vtjnash commented May 31, 2023

vtjnash commented May 31, 2023

KristofferC commented May 31, 2023

IanButterworth commented Jun 16, 2023

MasonProtter left a comment

IanButterworth commented Jun 29, 2023

MasonProtter left a comment

Add docs on task-specific buffering using multithreading #48542

Add docs on task-specific buffering using multithreading #48542

Conversation

IanButterworth commented Feb 5, 2023 • edited Loading

vtjnash left a comment

Choose a reason for hiding this comment

vtjnash Feb 6, 2023

Choose a reason for hiding this comment

IanButterworth Jun 16, 2023

Choose a reason for hiding this comment

torfjelde Feb 6, 2023 • edited Loading

Choose a reason for hiding this comment

Moelf commented May 30, 2023 • edited Loading

Problem with task local storage

Our ecosystem contains many silently incorrect code now without this

vtjnash commented May 31, 2023

vtjnash commented May 31, 2023

KristofferC commented May 31, 2023

IanButterworth commented Jun 16, 2023

MasonProtter left a comment

Choose a reason for hiding this comment

IanButterworth commented Jun 29, 2023

MasonProtter left a comment

Choose a reason for hiding this comment

IanButterworth commented Feb 5, 2023 •

edited

Loading

torfjelde Feb 6, 2023 •

edited

Loading

Moelf commented May 30, 2023 •

edited

Loading