Add intrusive iterators to BTree. #19796

cgaebel · 2014-12-13T01:15:51Z

Unfortunately, tree structures are intrinsically slower to
iterate over externally than internally. This can be
demonstrated in benchmarks. In fact, it's so bad at external
iteration that calling .find on each element in succession
is currently slightly faster.

This patch implements a faster intrusive way to iterate over
BTrees. This is about 5x faster, but you lose all iterator
composition infrastructure. This is a tradeoff that is
acceptable in some applications.

Relevant benchmarks:

test btree::map::bench::intrusive_iter_1000                ... bench:      2658 ns/iter (+/- 602)
test btree::map::bench::intrusive_iter_100000              ... bench:    346353 ns/iter (+/- 189565)
test btree::map::bench::intrusive_iter_20                  ... bench:        55 ns/iter (+/- 16)
test btree::map::bench::iter_1000                          ... bench:     15892 ns/iter (+/- 3717)
test btree::map::bench::iter_100000                        ... bench:   1383714 ns/iter (+/- 444706)
test btree::map::bench::iter_20                            ... bench:       366 ns/iter (+/- 104)

r? @gankro @huonw @gereeter

@aturon how does this fit into 1.0 stabilization plans. Is
marking this as #[experimental] enough?

@huonw

Unfortunately, tree structures are intrinsically slower to iterate over externally than internally. This can be demonstrated in benchmarks. In fact, it's so bad at external iteration that calling `.find` on each element in succession is currently slightly faster. This patch implements a faster intrusive way to iterate over BTrees. This is about 5x faster, but you lose all iterator composition infrastructure. This is a tradeoff that is acceptable in some applications. Relevant benchmarks: ``` test btree::map::bench::intrusive_iter_1000 ... bench: 2658 ns/iter (+/- 602) test btree::map::bench::intrusive_iter_100000 ... bench: 346353 ns/iter (+/- 189565) test btree::map::bench::intrusive_iter_20 ... bench: 55 ns/iter (+/- 16) test btree::map::bench::iter_1000 ... bench: 15892 ns/iter (+/- 3717) test btree::map::bench::iter_100000 ... bench: 1383714 ns/iter (+/- 444706) test btree::map::bench::iter_20 ... bench: 366 ns/iter (+/- 104) ``` r? @gankro @huonw @aturon how does this fit into 1.0 stabilization plans. Is marking this as #[experimental] enough?

Gankra · 2014-12-13T01:43:24Z

src/libcollections/btree/map.rs

+            }
+        }
+
+        intrusive_into_iter_impl(self.root, &mut f);


Do you believe that the recursive version will be faster/more efficient than an explicit stack?

I do, but only because the benchmarks as written agree.

Gankra · 2014-12-14T00:51:46Z

Just like to say that this is really unfortunate to have to provide for performance. 😞

bluss · 2014-12-15T01:11:41Z

What's the B value and how do the results compare with a bit larger fan out?

My immediate thought was to micro optimizations, like replacing Zip with an iterator that knows to do the bounds check once, but I realize now that the Zip part is common to both alternatives in this comparison. Is this a hint that a simpler b-tree external iterator can be written (maybe without range support?)

Gankra · 2014-12-15T02:19:44Z

I believe @pczarn was investigating this.

One "easy" one would be to not support DoubleEnded.

Another is we could probably ditch the RingBufs for Vecs, since pop_front should only happen if you're mixing next and next_back, which should already be slow. Also they can physically only get to like max ~40 depth or something.

Gankra · 2014-12-22T16:08:46Z

I'd like to flesh these ideas out out-of-tree for now. In particular https://github.com/reem/rust-traverse and https://github.com/Gankro/collect-rs/ are exploring this space right now.

Only a performance issue, not a back-compat hazard.

rust-highfive assigned Gankra Dec 13, 2014

cgaebel mentioned this pull request Dec 13, 2014

Make BTree's internals safer and do more checks at compile time instead of run time #19782

Merged

cgaebel force-pushed the btree-intrusive-iter branch from 605c97b to def348d Compare December 13, 2014 01:17

Gankra reviewed Dec 13, 2014
View reviewed changes

gereeter mentioned this pull request Dec 14, 2014

Add ManuallyDrop lang item for precise control of destructors #19822

Closed

Gankra closed this Dec 22, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add intrusive iterators to BTree. #19796

Add intrusive iterators to BTree. #19796

cgaebel commented Dec 13, 2014

Gankra Dec 13, 2014

cgaebel Dec 13, 2014

Gankra commented Dec 14, 2014

bluss commented Dec 15, 2014

Gankra commented Dec 15, 2014

Gankra commented Dec 22, 2014

Add intrusive iterators to BTree. #19796

Add intrusive iterators to BTree. #19796

Conversation

cgaebel commented Dec 13, 2014

Gankra Dec 13, 2014

Choose a reason for hiding this comment

cgaebel Dec 13, 2014

Choose a reason for hiding this comment

Gankra commented Dec 14, 2014

bluss commented Dec 15, 2014

Gankra commented Dec 15, 2014

Gankra commented Dec 22, 2014