Allow for better optimizations of iterators for zero-sized types #25434

dotdash · 2015-05-15T13:31:01Z

Using regular pointer arithmetic to iterate collections of zero-sized types
doesn't work, because we'd get the same pointer all the time. Our
current solution is to convert the pointer to an integer, add an offset
and then convert back, but this inhibits certain optimizations.

What we should do instead is to convert the pointer to one that points
to an i8*, and then use a LLVM GEP instructions without the inbounds
flag to perform the pointer arithmetic. This allows to generate pointers
that point outside allocated objects without causing UB (as long as you
don't dereference them), and it wraps around using two's complement,
i.e. it behaves exactly like the wrapping_* operations we're currently
using, with the added benefit of LLVM being able to better optimize the
resulting IR.

Using regular pointer arithmetic to iterate collections of zero-sized types doesn't work, because we'd get the same pointer all the time. Our current solution is to convert the pointer to an integer, add an offset and then convert back, but this inhibits certain optimizations. What we should do instead is to convert the pointer to one that points to an i8*, and then use a LLVM GEP instructions without the inbounds flag to perform the pointer arithmetic. This allows to generate pointers that point outside allocated objects without causing UB (as long as you don't dereference them), and it wraps around using two's complement, i.e. it behaves exactly like the wrapping_* operations we're currently using, with the added benefit of LLVM being able to better optimize the resulting IR.

rust-highfive · 2015-05-15T13:31:12Z

r? @nikomatsakis

(rust_highfive has picked a reviewer for you, use r? to override)

alexcrichton · 2015-05-15T16:15:21Z

Is it undefined behavior in LLVM to use offset to get out of bounds, or just to dereference the value? If it's only actually using the resulting value, could we manufacture a base pointer for all iterators over 0-sized types?

dotdash · 2015-05-15T22:13:13Z

Yes, offset uses an inbounds GEP, and for those even just generating a pointer that does not point inside or immediately behind an allocated object results in a poison value. That's why the new intrinsic is required.

Not sure what you mean by the base pointer thing.

alexcrichton · 2015-05-16T17:42:04Z

We talked on IRC about this a bit, and my concerns were alleviated, thanks @dotdash!

@bors: r+ eeeb2cc

Using regular pointer arithmetic to iterate collections of zero-sized types doesn't work, because we'd get the same pointer all the time. Our current solution is to convert the pointer to an integer, add an offset and then convert back, but this inhibits certain optimizations. What we should do instead is to convert the pointer to one that points to an i8\*, and then use a LLVM GEP instructions without the inbounds flag to perform the pointer arithmetic. This allows to generate pointers that point outside allocated objects without causing UB (as long as you don't dereference them), and it wraps around using two's complement, i.e. it behaves exactly like the wrapping_* operations we're currently using, with the added benefit of LLVM being able to better optimize the resulting IR.

bors · 2015-05-16T19:17:31Z

⌛ Testing commit eeeb2cc with merge d332aea...

bors · 2015-05-16T20:53:40Z

☀️ Test successful - auto-linux-32-nopt-t, auto-linux-32-opt, auto-linux-64-nopt-t, auto-linux-64-opt, auto-linux-64-x-android-t, auto-mac-32-opt, auto-mac-64-nopt-t, auto-mac-64-opt, auto-win-32-nopt-t, auto-win-32-opt, auto-win-64-nopt-t, auto-win-64-opt

rust-highfive assigned nikomatsakis May 15, 2015

bors merged commit eeeb2cc into rust-lang:master May 16, 2015

dotdash deleted the gep branch July 27, 2015 08:49

eddyb mentioned this pull request Nov 10, 2017

Safe Rust code miscompilation due to a bug in LLVM's Global Value Numbering #45839

Closed

alexcrichton mentioned this pull request Jul 16, 2018

slices: fix ZST slice iterators making up pointers; debug_assert alignment in from_raw_parts #52206

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow for better optimizations of iterators for zero-sized types #25434

Allow for better optimizations of iterators for zero-sized types #25434

dotdash commented May 15, 2015

rust-highfive commented May 15, 2015

alexcrichton commented May 15, 2015

dotdash commented May 15, 2015

alexcrichton commented May 16, 2015

bors commented May 16, 2015

bors commented May 16, 2015

Allow for better optimizations of iterators for zero-sized types #25434

Allow for better optimizations of iterators for zero-sized types #25434

Conversation

dotdash commented May 15, 2015

rust-highfive commented May 15, 2015

alexcrichton commented May 15, 2015

dotdash commented May 15, 2015

alexcrichton commented May 16, 2015

bors commented May 16, 2015

bors commented May 16, 2015