Use a persistent vector instead of an `Rc<[..]>` #2057

jneem · 2024-09-27T15:47:06Z

This is an experiment regarding array performance. Our current array representation (as essentially a Rc<[RichTerm]>) is problematic because it makes many common operations unnecessarily quadratic. This PR replaces it with a rpds::Vector<RichTerm> in reverse order, and the initial benchmarks look promising.

In more detail, the current arrays have a few performance characteristics that we might like to keep:

constant-time random access
constant-time slicing
some amount of memory sharing
The main problem is that there's no constant-time "cons" operation, and the concatenation operator xs @ ys is O(xs.len() + ys.len()). This makes many functional-style list functions (like the stdlib implementations of reverse and filter) quadratic in the length of their input.

The Vector implementation in the rpds crate is a "persistent vector" aka "bitmapped vector trie", which offers persistence/sharing, fast random access, and fast appends. We can do the same slicing trick that we're current using for Rc<[RichTerm]> to also add fast slicing. Thanks to fast appends, we can do concatenation xs @ ys in O(ys.len()) time (provided that there are no contracts that need to be applied to xs; I'm also ignoring logarithmic terms). This is backwards from the more common concatenation pattern in functional languages, so we store arrays backwards in order to get time O(xs.len()) instead. (We could achieve the minimum of the two by storing an array as two Vectors, a backwards one followed by a forwards one.)

There are a few ways in which rpds::Vector isn't a perfect fit:

it insists on wrapping the vector elements in Arc or Rc, which we don't want because RichTerm already has a shared pointer
it doesn't support many of the optimizations we had for arrays with a single reference
there's no fast iteration over slices (it's linear in the number of elements you skip at the beginning)

Despite these, this PR gives a 35% improvement in random normal, and a few other improvements between 10 and 20%. I'd like to also try benchmarking im and/or imbl.

github-actions · 2024-09-27T15:55:56Z

Bencher Report

Branch	2057/merge
Testbed	ubuntu-latest

⚠️ WARNING: The following Measure does not have a Threshold. Without a Threshold, no Alerts will ever be generated!
Latency
Click here to create a new Threshold
For more information, see the Threshold documentation.
To only post results if a Threshold exists, set the --ci-only-thresholds CLI flag.

Click to view all benchmark results

Benchmark	Latency	nanoseconds (ns)
fibonacci 10	📈 view plot ⚠️ NO THRESHOLD	492,610.00
foldl arrays 50	📈 view plot ⚠️ NO THRESHOLD	1,742,300.00
foldl arrays 500	📈 view plot ⚠️ NO THRESHOLD	6,623,400.00
foldr strings 50	📈 view plot ⚠️ NO THRESHOLD	7,128,100.00
foldr strings 500	📈 view plot ⚠️ NO THRESHOLD	61,214,000.00
generate normal 250	📈 view plot ⚠️ NO THRESHOLD	45,508,000.00
generate normal 50	📈 view plot ⚠️ NO THRESHOLD	2,025,400.00
generate normal unchecked 1000	📈 view plot ⚠️ NO THRESHOLD	3,432,500.00
generate normal unchecked 200	📈 view plot ⚠️ NO THRESHOLD	759,960.00
pidigits 100	📈 view plot ⚠️ NO THRESHOLD	3,170,700.00
pipe normal 20	📈 view plot ⚠️ NO THRESHOLD	1,514,300.00
pipe normal 200	📈 view plot ⚠️ NO THRESHOLD	9,980,000.00
product 30	📈 view plot ⚠️ NO THRESHOLD	834,630.00
scalar 10	📈 view plot ⚠️ NO THRESHOLD	1,545,100.00
sum 30	📈 view plot ⚠️ NO THRESHOLD	826,770.00

🐰 View full continuous benchmarking report in Bencher

yannham · 2024-09-27T15:59:19Z

I'll see what it gives on the private benchmark

jneem · 2024-09-30T05:30:42Z

I tried out imbl, but the performance is worse than rpds.

jneem · 2024-10-10T05:01:00Z

The current version uses a custom re-implementation of persistent vectors, and it seems to be a performance win across the board.

jneem · 2024-10-15T02:59:57Z

Github CI agrees with my local benchmarking: this gives modest gains in general, and big gains whenever quadratic array behavior is the bottleneck.

I don't see a nice UI for comparing results to master, but here is the report for this PR and here is the report for master.

yannham · 2024-10-15T12:11:32Z

@jneem Have you tried on the private bench?

jneem · 2024-10-15T13:30:38Z

Yes, I forgot to mention that. Performance is basically identical on all three sizes.

yannham

Could be nice to add a description to the vector crate.

I'm not too intimate with bitmapped vector tries, so I can't say I'm 100% sure that the whole implementation is flawless, but the general approach looks sane, the testing is also solid, and it's been thoroughly benchmarked.

core/src/eval/operation.rs

yannham · 2024-10-15T13:34:55Z

core/src/transform/free_vars.rs

+                let new_ts = ts
+                    .iter()
+                    .cloned()
+                    .map(|mut t| {
+                        t.collect_free_vars(free_vars);
+                        t
+                    })
+                    .collect();
+                *ts = new_ts;


This is my pet peeve, but I feel like this should be an imperative for, as we're just walking a structure and applying a mutation. Or is it that you just don't want to bother implementing iter_mut()?

Ok, I went ahead and did mutable iteration. It has a bit more copy-paste from the other iterators, unfortunately, but maybe that will be more motivation to figure out a generic version...

yannham · 2024-10-15T13:37:24Z

vector/src/lib.rs

+//! [`Vector`] is a persistent vector (also known as a "bitmapped vector trie")
+//! with cheap clones and efficient copy-on-write modifications. [`Slice`]
+//! backs the implementation of arrays in Nickel. It's basically a [`Vector`]
+//! with support for slicing.


Have you followed a particular paper or source to implement them? Or took inspiration from another crate? If yes, it could be good to link it there.

Mostly just https://hypirion.com/musings/understanding-persistent-vector-pt-1, which I linked a little later. I looked at rpds to see what choices they were making, but I didn't really imitate their implementation otherwise.

vector/src/lib.rs

vector/src/slice.rs

vector/src/vector.rs

yannham · 2024-10-15T14:02:58Z

vector/src/vector.rs

+    }
+}
+
+/// [`Vector`] is a persistent vector (also known as a "bitmapped vector trie").


Nitpick: I feel like this should be the module's documentation, and not Vector.

yannham · 2024-10-16T08:14:17Z

By the way, I had another side question: could this representation take advantage of the in-place modification when a value is 1-RC ? I think the answer is yes, from what I remember reading Clojure's persistent array blog post, but just to make sure.

jneem · 2024-10-16T08:40:48Z

could this representation take advantage of the in-place modification when a value is 1-RC ?

Yep, that should be the case already. We use Rc::make_mut for all the modifications, so it should have the in-place behavior whenever possible (including for subtrees -- if the root tree is shared but some subtree is uniquely owned, the root block will be copied but the uniquely owned part will be mutated). I'll add it to the module docs.

yannham · 2024-10-16T12:30:43Z

The last commit fails some test on Windows only it seems (but I think there is property-based testing, so the Windows part might be a red herring and it's just that some random path leading to the panic):

thread 'array_mutations' panicked at vector\tests\arbtest.rs:85:27:
attempt to calculate the remainder with a divisor of zero

github-actions bot temporarily deployed to pull request September 27, 2024 15:49 Inactive

github-actions bot temporarily deployed to pull request September 27, 2024 16:25 Inactive

github-actions bot temporarily deployed to pull request September 27, 2024 16:50 Inactive

jneem force-pushed the array-perf branch from ec4a3a0 to fb69c86 Compare October 10, 2024 03:52

jneem added 7 commits October 10, 2024 15:01

experiment with rpds vectors

d3a669c

warnings

01282b2

clippy

4539bcd

Try out funcarray

b851d00

Optimizations, maybe

3b970ae

Remove some unnecessary clones

f2c727f

Rebase cleanup

2a40f16

jneem force-pushed the array-perf branch from 1d63967 to 2a40f16 Compare October 10, 2024 08:05

jneem added 9 commits October 10, 2024 17:46

Try not reversing

4c6bf05

Update doc

1c36a0f

Remove unnecessary files

99b81b5

Don't allocate for an empty vector

9d49f15

Documentation, and more consistent param orders

ee8c4bb

Start API docs for Vector

1ba1e11

Finish API docs for vector

860d629

Correct some slice API docs

a4d2aea

More docs

71cace4

jneem marked this pull request as ready for review October 15, 2024 03:00

jneem requested a review from yannham October 15, 2024 03:00

jneem changed the title ~~experiment with rpds vectors~~ Use a persistent vector instead of an Rc<[..]> Oct 15, 2024

yannham approved these changes Oct 15, 2024

View reviewed changes

Add mutable iteration

d459c4b

Use the mutable iterators, and more docs

24270dc

jneem enabled auto-merge October 16, 2024 09:03

jneem added this pull request to the merge queue Oct 16, 2024

Merged via the queue into master with commit 3506821 Oct 16, 2024
4 of 5 checks passed

jneem deleted the array-perf branch October 16, 2024 09:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use a persistent vector instead of an `Rc<[..]>` #2057

Use a persistent vector instead of an `Rc<[..]>` #2057

jneem commented Sep 27, 2024

github-actions bot commented Sep 27, 2024 •

edited

Loading

yannham commented Sep 27, 2024

jneem commented Sep 30, 2024

jneem commented Oct 10, 2024

jneem commented Oct 15, 2024

yannham commented Oct 15, 2024

jneem commented Oct 15, 2024

yannham left a comment

yannham Oct 15, 2024

jneem Oct 16, 2024

yannham Oct 15, 2024

jneem Oct 16, 2024

yannham Oct 15, 2024

yannham commented Oct 16, 2024

jneem commented Oct 16, 2024

yannham commented Oct 16, 2024

Use a persistent vector instead of an Rc<[..]> #2057

Use a persistent vector instead of an Rc<[..]> #2057

Conversation

jneem commented Sep 27, 2024

github-actions bot commented Sep 27, 2024 • edited Loading

Bencher Report

yannham commented Sep 27, 2024

jneem commented Sep 30, 2024

jneem commented Oct 10, 2024

jneem commented Oct 15, 2024

yannham commented Oct 15, 2024

jneem commented Oct 15, 2024

yannham left a comment

Choose a reason for hiding this comment

yannham Oct 15, 2024

Choose a reason for hiding this comment

jneem Oct 16, 2024

Choose a reason for hiding this comment

yannham Oct 15, 2024

Choose a reason for hiding this comment

jneem Oct 16, 2024

Choose a reason for hiding this comment

yannham Oct 15, 2024

Choose a reason for hiding this comment

yannham commented Oct 16, 2024

jneem commented Oct 16, 2024

yannham commented Oct 16, 2024

Use a persistent vector instead of an `Rc<[..]>` #2057

Use a persistent vector instead of an `Rc<[..]>` #2057

github-actions bot commented Sep 27, 2024 •

edited

Loading