Don't make atomic loads and stores volatile #30962

Amanieu · 2016-01-16T20:18:31Z

Rust currently emits atomic loads and stores with the LLVM volatile qualifier. This is unnecessary and prevents LLVM from performing optimization on these atomic operations.

rust-highfive · 2016-01-16T20:18:44Z

r? @alexcrichton

(rust_highfive has picked a reviewer for you, use r? to override)

nagisa · 2016-01-17T01:10:57Z

I do not think we can make this change anymore. This has potential (and probably will) silently, and horribly, break certain programs (e.g. those that rely on atomics as a stable way to do volatile reads/stores). We should make new intrinsics+functions for non-volatile operations instead.

(I think we also had an issue/rfc somewhere to add volatile/non-volatile differentiation for atomic operations).

Amanieu · 2016-01-17T01:56:03Z

Does any code actually rely on this? The volatile semantics of atomic types were never documented anywhere and any code that would need it for MMIO is using nightly which has volatile read/write intrinsics.

Note that the volatile semantics only applied to atomic load and store, but not compare_and_swap or read-modify-write operations like fetch_add. This is very inconsistent if someone is relying on atomics providing volatile semantics.

Making atomic types have volatile semantics on some operations is a terrible default behavior to have, especially since it hurts compiler optimizations.

nagisa · 2016-01-17T11:52:59Z

Does any code actually rely on this?

The only way to really find out is to break a stable release and wait for complaints,

The volatile semantics of atomic types were never documented anywhere

I’ve seen atomic store/load recommended by, I think, @huonw (?) as a stable replacement for volatile loads and stores, at least once.

Note that the volatile semantics only applied to atomic load and store, but not compare_and_swap or read-modify-write operations like fetch_add. This is very inconsistent if someone is relying on atomics providing volatile semantics.

You needn’t volatile CAS/RMW at all if all you’re doing is writing and reading bytes out of a hardware port. Basically, one might rely on the volatility behaviour and not atomicity behaviour and ignoring all of the more complex atomic operations for their use cases.

Making atomic types have volatile semantics on some operations is a terrible default behavior to have, especially since it hurts compiler optimizations.

I do not disagree, I’m simply pointing out the potential hazard of making these ops non-volatile, now that their volatility, albeit undocumented, is stable.

Zoxc · 2016-01-17T12:10:20Z

Any code relying on atomics to be volatile is very misguided and is likely broken in other ways too. It is pretty well known that volatile is orthogonal to atomic operations.

huonw · 2016-01-17T12:20:14Z

@huonw (?)

Nope, although I did recommend against it in the thread on reddit, so you may've just mixed up the user names. :)

On that note, this change seems like the right choice to me. cc @rust-lang/libs and especially @aturon.

aturon · 2016-01-17T17:44:16Z

This seems reasonable to me, but I have to admit I don't fully grasp the subtleties around volatile accesses in LLVM. I think @rust-lang/compiler may be interested as well.

Gankra · 2016-01-17T17:55:25Z

Definition of LLVM's volatile, for reference: http://llvm.org/docs/LangRef.html#volatile-memory-accesses

Certain memory accesses, such as load‘s, store‘s, and llvm.memcpy‘s may be marked volatile. The optimizers must not change the number of volatile operations or change their order of execution relative to other volatile operations. The optimizers may change the order of volatile operations relative to non-volatile operations. This is not Java’s “volatile” and has no cross-thread synchronization behavior.

IR-level volatile loads and stores cannot safely be optimized into llvm.memcpy or llvm.memmove intrinsics even when those intrinsics are flagged volatile. Likewise, the backend should never split or merge target-legal volatile load/store instructions.

Rationale
Platforms may rely on volatile loads and stores of natively supported data width to be executed as single instruction. For example, in C this holds for an l-value of volatile primitive type with native hardware support, but not necessarily for aggregate types. The frontend upholds these expectations, which are intentionally unspecified in the IR. The rules above ensure that IR transformations do not violate the frontend’s contract with the language.

Gankra · 2016-01-17T17:57:19Z

I have a feeling this is why Arc couldn't be trivially pwned by a mem::forget loop. A Sufficiently Smart compiler should be able to optimize an Arc mem::forget loop into a single atomic add (I think), but it never did.

alexcrichton · 2016-01-17T19:25:43Z

To me this seems "correct" in a void (e.g. atomics should not be volatile by default), and along those lines I think we should pursue what we need to do to make this change. Unfortunately crater would not be great at evaluating a change such as this, but this is also why we have a nightly and beta period (e.g. the change will bake for ~12 weeks).

I would personally be in favor of merging close to after we branch the next beta to give this maximal time to bake, and we should be ready to back it out if necessary, but like @Amanieu I would expect very little breakage.

alexcrichton · 2016-01-17T19:26:15Z

An alternative, if it does cause breakage, would be to simultaneously add {load,store}_volatile which can be recommended if code breaks.

whitequark · 2016-01-19T02:15:45Z

Note that sequentially consistent atomic operations provide guarantees that are a strict superset of volatile. Assuming no downstream code uses the more finely grained acquire/release semantics, this PR would cause no functional change.

Amanieu · 2016-01-19T02:21:27Z

Note that sequentially consistent atomic operations provide guarantees that are a strict superset of volatile. Assuming no downstream code uses the more finely grained acquire/release semantics, this PR would cause no functional change.

That's not actually true. For example a compiler is allowed to optimize this:

x.fetch_add(2, SeqCst);
x.fetch_add(2, SeqCst);

into

x.fetch_add(4, SeqCst);

only if the operation is not volatile. Volatile semantics are only really useful for memory-mapped I/O where the compiler must preserve the exact sequence of loads and store in the program without merging or eliminating them.

whitequark · 2016-01-19T02:25:14Z

Hm, you're right, "no functional change" was too strong of an assertion. Still, I think a look at downstream code would be in order.

nikomatsakis · 2016-01-19T10:03:38Z

This is a tricky case. Given that the docs did not promise volatile
semantics, I think we are within our rights to make this change, but it
seems like it could affect some users -- it might be worth trying to
advertise the change to the "O/S" and "embedded" communities, for example.
(I'm not sure how best to reach those users.)

On Mon, Jan 18, 2016 at 9:25 PM, whitequark notifications@github.com
wrote:

Hm, you're right, "no functional change" was too strong of an assertion.
Still, I think a look at downstream code would be in order.

—
Reply to this email directly or view it on GitHub
#30962 (comment).

huonw · 2016-01-19T11:01:30Z

In practice, I expect there to be no functional change at the moment: compilers are generally pretty hands-off for atomics. For example, rustc doesn't optimise @Amanieu's example to a single instruction, and neither do any of gcc, clang or icc (for the C++11 equivalent below); it's always two lock xadd's.

#include<atomic>

void foo(std::atomic<long>& x) {
    x.fetch_add(2, std::memory_order_seq_cst);
    x.fetch_add(2, std::memory_order_seq_cst);
}

This is true even after pushing the ordering all the way down to relaxed.

And, e.g., LLVM's performance tips say:

Be wary of ordered and atomic memory operations. They are hard to optimize and may not be well optimized by the current optimizer. Depending on your source language, you may consider using fences instead.

(That's not to say there aren't examples that are optimised, but I haven't seen anything non-trivial: even x.store(1); x.store(2); isn't optimised to just x.store(2).)

brson · 2016-01-20T00:15:00Z

I also think we can do this as a bugfix, but we should advertise it.

nikomatsakis · 2016-01-20T01:17:58Z

Given @huonw's comment, seems like there's no reason not to make the change
(but yes, advertise it).

On Tue, Jan 19, 2016 at 7:15 PM, Brian Anderson notifications@github.com
wrote:

I also think we can do this as a bugfix, but we should advertise it.

—
Reply to this email directly or view it on GitHub
#30962 (comment).

sfackler · 2016-01-21T04:38:13Z

Nominating for discussion by the libs and lang teams. Sounds like people are on board with this, but it's probably a good idea to make sure everyone's are aware of what's going on.

Gankra · 2016-01-21T22:04:58Z

DIdn't the libs team already agree that this is fine?

aturon · 2016-01-30T01:00:43Z

The lang team is on board with this change. The main remaining question is whether to try to offer this functionality by some other means before landing the change (as suggested by @alexcrichton).

briansmith · 2016-02-02T21:58:09Z

This is a tricky case. Given that the docs did not promise volatile semantics, I think we are within our rights to make this change

FWIW, I agree with the above.

The main remaining question is whether to try to offer this functionality by some other means before landing the change (as suggested by @alexcrichton).

I don't think it is worth the effort. It is early days for Rust. People need to write code based on what is guaranteed on the language spec/docs, not based on what the compiler accidentally does. This change isn't going to break anybody who wrote correct code in the first place.

alexcrichton · 2016-02-04T00:39:46Z

The libs team discussed this during triage today and the conclusion was that everyone seems on board, and the only question is whether we need to also add {load,store}_volatile like I proposed above. At this time that doesn't seem urgent or pressing, so we decided to move forward with the PR.

@bors: r+ 01112d1

Thanks @Amanieu!

bors · 2016-02-04T02:46:45Z

⌛ Testing commit 01112d1 with merge 1096e7a...

Rust currently emits atomic loads and stores with the LLVM `volatile` qualifier. This is unnecessary and prevents LLVM from performing optimization on these atomic operations.

bors · 2016-02-04T06:07:25Z

☀️ Test successful - auto-linux-32-nopt-t, auto-linux-32-opt, auto-linux-64-debug-opt, auto-linux-64-nopt-t, auto-linux-64-opt, auto-linux-64-x-android-t, auto-linux-cross-opt, auto-linux-musl-64-opt, auto-mac-32-opt, auto-mac-64-nopt-t, auto-mac-64-opt, auto-mac-ios-opt, auto-win-gnu-32-nopt-t, auto-win-gnu-32-opt, auto-win-gnu-64-nopt-t, auto-win-gnu-64-opt, auto-win-msvc-32-opt, auto-win-msvc-64-opt

Don't make atomic loads and stores volatile

01112d1

rust-highfive assigned alexcrichton Jan 16, 2016

brson added the relnotes Marks issues that should be documented in the release notes of the next release. label Jan 20, 2016

sfackler added I-nominated T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Jan 21, 2016

sfackler added T-lang Relevant to the language team, which will review and decide on the PR/issue. and removed T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Jan 21, 2016

aturon removed the I-nominated label Jan 30, 2016

bors merged commit 01112d1 into rust-lang:master Feb 4, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't make atomic loads and stores volatile #30962

Don't make atomic loads and stores volatile #30962

Amanieu commented Jan 16, 2016

rust-highfive commented Jan 16, 2016

nagisa commented Jan 17, 2016

Amanieu commented Jan 17, 2016

nagisa commented Jan 17, 2016

Zoxc commented Jan 17, 2016

huonw commented Jan 17, 2016

aturon commented Jan 17, 2016

Gankra commented Jan 17, 2016

Gankra commented Jan 17, 2016

alexcrichton commented Jan 17, 2016

alexcrichton commented Jan 17, 2016

whitequark commented Jan 19, 2016

Amanieu commented Jan 19, 2016

whitequark commented Jan 19, 2016

nikomatsakis commented Jan 19, 2016

huonw commented Jan 19, 2016

brson commented Jan 20, 2016

nikomatsakis commented Jan 20, 2016

sfackler commented Jan 21, 2016

Gankra commented Jan 21, 2016

aturon commented Jan 30, 2016

briansmith commented Feb 2, 2016

alexcrichton commented Feb 4, 2016

bors commented Feb 4, 2016

bors commented Feb 4, 2016

Don't make atomic loads and stores volatile #30962

Don't make atomic loads and stores volatile #30962

Conversation

Amanieu commented Jan 16, 2016

rust-highfive commented Jan 16, 2016

nagisa commented Jan 17, 2016

Amanieu commented Jan 17, 2016

nagisa commented Jan 17, 2016

Zoxc commented Jan 17, 2016

huonw commented Jan 17, 2016

aturon commented Jan 17, 2016

Gankra commented Jan 17, 2016

Gankra commented Jan 17, 2016

alexcrichton commented Jan 17, 2016

alexcrichton commented Jan 17, 2016

whitequark commented Jan 19, 2016

Amanieu commented Jan 19, 2016

whitequark commented Jan 19, 2016

nikomatsakis commented Jan 19, 2016

huonw commented Jan 19, 2016

brson commented Jan 20, 2016

nikomatsakis commented Jan 20, 2016

sfackler commented Jan 21, 2016

Gankra commented Jan 21, 2016

aturon commented Jan 30, 2016

briansmith commented Feb 2, 2016

alexcrichton commented Feb 4, 2016

bors commented Feb 4, 2016

bors commented Feb 4, 2016