Fast entanglement detection based on entanglement candidates (suspects) #154

shwestrick · 2022-05-23T15:50:58Z

Implements a heap-local candidate (a.k.a. suspect) set of objects that might contain a down-pointer. Candidates have a bit in their header which is marked by the write-barrier when a down-pointer is created. This is then used to accelerate the read-barrier for entanglement detection:

Read barrier fast path: if the object is not a candidate, then skip the entanglement check.
Read barrier slow path: if the object is a candidate, do a full entanglement check (requiring a call into the runtime and an SP maintenance query, i.e. computation graph query).

The fast path is supported by the compiler: in ssa2-to-rssa, we generate the code for the fast path, avoiding a runtime call in the case of a non-candidate.

Whenever a heap becomes a leaf, we clear the candidates within that heap (because at this point, those objects are guaranteed to no longer have down-pointers).

Performance Improvement

The fast path works incredibly well. Here are some results from our recent experiments, measuring the performance improvement due to the fast path.

Running time improvements are as much as 4x at scale.

Notice also:

Number of graph queries (full entanglement checks) reduced to 0 in most cases.
Number of candidate marks is small.
- Number of candidate marks = number of times an object was marked as a candidate by the write barrier.
- This is an upper bound on the number of distinct candidate objects. (The same object could be marked multiple times.)
- Takeaway: in disentangled programs, down-pointers are either rare or highly consolidated (e.g. a single object could be responsible for all down-pointers).

Overall performance

Due to fast path improvements, the overall cost of entanglement detection is now essentially zero. Space overhead appears to be negligible across the board. In terms of time overhead, we measured approximately 1% on average across 23 benchmarks, and a max of 7%. A majority of benchmarks (18 out of 23) incur less than 2% time overhead.

Forward #145 to ec-fast branch

shwestrick · 2022-05-23T18:31:57Z

Did a bit of testing for this merge today and everything looks good.

typerSniper and others added 13 commits January 25, 2022 19:16

implement suspects and fast-path for reads on mutable data

d75d0d4

oopsie

8ba971b

fixed LC issue

d735f58

review changes

6586435

reverting cas code

c80fe72

whopsie

b7d0366

reverting form fancy atomics

676dd46

Merge pull request #146 from shwestrick/runtime-options

d960d60

Forward #145 to ec-fast branch

update scheduler to allow for any default integer size

39b6c08

readd primitives

7783465

Merge branch 'ec-fast' of https://github.com/MPLLang/mpl into ec-fast

9399a47

Merge branch 'ec-fast' of github.com:MPLLang/mpl into ec-fast

0ce3496

add suspects stats

5f86b40

shwestrick merged commit 5f239e8 into master May 23, 2022

shwestrick mentioned this pull request May 25, 2022

bugfix: copySuspect and clear_suspect must adhere to GC_foreachObjptrFun interface #155

Merged

shwestrick mentioned this pull request Jun 19, 2022

Enable entanglement detection by default (prepping for v0.3 release) #159

Merged

shwestrick deleted the ec-fast branch September 11, 2024 14:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fast entanglement detection based on entanglement candidates (suspects) #154

Fast entanglement detection based on entanglement candidates (suspects) #154

shwestrick commented May 23, 2022

shwestrick commented May 23, 2022

Fast entanglement detection based on entanglement candidates (suspects) #154

Fast entanglement detection based on entanglement candidates (suspects) #154

Conversation

shwestrick commented May 23, 2022

Performance Improvement

Overall performance

shwestrick commented May 23, 2022