[rustc_ty_utils] Add the LLVM `noalias` parameter attribute to `drop_in_place` in certain cases. #103614

pcwalton · 2022-10-27T03:45:59Z

LLVM can make use of the noalias parameter attribute on the parameter to drop_in_place in areas like argument promotion. Because the Rust compiler fully controls the code for drop_in_place, it can soundly deduce parameter attributes on it. In the case of a value that has a programmer-defined Drop implementation, we know that the first thing drop_in_place will do is pass a pointer to the object to Drop::drop. Drop::drop takes &mut, so it must be guaranteed that there are no pointers to the object upon entering that function. Therefore, it should be safe to mark noalias there.

With this patch, we mark noalias only when the type is a value with a programmer-defined Drop implementation. This is possibly overly conservative, but I thought that proceeding cautiously was best in this instance.

rustbot · 2022-10-27T03:46:06Z

r? @jackh726

(rustbot has picked a reviewer for you, use r? to override)

pcwalton · 2022-10-27T18:44:09Z

r? @oli-obk

(Feel free to reassign review to someone else) :)

compiler/rustc_ty_utils/src/abi.rs

oli-obk · 2022-10-31T13:19:02Z

Please also add comments on drop_in_place in libstd to make sure anyone touching it is aware they should also touch this logic

pcwalton · 2022-11-03T09:23:23Z

Per discussions on Zulip, I'm going to change this to a draft while we wait on Miri results to figure out what the impact of this change will be.

…in_place` in certain cases. LLVM can make use of the `noalias` parameter attribute on the parameter to `drop_in_place` in areas like argument promotion. Because the Rust compiler fully controls the code for `drop_in_place`, it can soundly deduce parameter attributes on it. In the case of a value that has a programmer-defined Drop implementation, we know that the first thing `drop_in_place` will do is pass a pointer to the object to `Drop::drop`. `Drop::drop` takes `&mut`, so it must be guaranteed that there are no pointers to the object upon entering that function. Therefore, it should be safe to mark `noalias` there. With this patch, we mark `noalias` only when the type is a value with a programmer-defined Drop implementation. This is possibly overly conservative, but I thought that proceeding cautiously was best in this instance.

… unconditionally. We've done measurements with Miri and have determined that `noalias` shouldn't break code. The requirements that allow us to add dereferenceable and align have been long documented in the standard library documentation.

rustbot · 2022-11-18T00:14:42Z

Hey! It looks like you've submitted a new PR for the library teams!

If this PR contains changes to any rust-lang/rust public library APIs then please comment with @rustbot label +T-libs-api -T-libs to tag it appropriately. If this PR contains changes to any unstable APIs please edit the PR description to add a link to the relevant API Change Proposal or create one if you haven't already. If you're unsure where your change falls no worries, just leave it as is and the reviewer will take a look and make a decision to forward on if necessary.

Examples of T-libs-api changes:

Stabilizing library features
Introducing insta-stable changes such as new implementations of existing stable traits on existing stable types
Introducing new or changing existing unstable library APIs (excluding permanently unstable features / features without a tracking issue)
Changing public documentation in ways that create new stability guarantees
Changing observable runtime behavior of library APIs

pcwalton · 2022-11-18T00:15:03Z

r? @oli-obk

cc @RalfJung @saethlin

rustbot · 2022-11-18T00:15:05Z

Could not assign reviewer from: oli-obk.
User(s) oli-obk are either the PR author or are already assigned, and there are no other candidates.
Use r? to specify someone else to assign.

saethlin · 2022-11-18T00:33:38Z

library/core/src/ptr/mod.rs

 ///
-/// * The value `to_drop` points to must be valid for dropping, which may mean it must uphold
-///   additional invariants - this is type-dependent.
+/// * While `drop_in_place` is executing, the only way to access parts of


@RalfJung Would it be better to say

Suggested change

/// * While `drop_in_place` is executing, the only way to access parts of

/// * As soon as `drop_in_place` begins executing, the only way to access parts of

What is the difference you are getting at here?

One drop_in_place returns, old references can be used again, at least as far as the aliasing model goes.

pcwalton · 2022-11-18T05:07:58Z

This patch reduces the number of stack-to-stack copies from 2.50% to 2.36%, a 6% decrease.

RalfJung · 2022-11-18T07:34:12Z

The Miri version of this is #103957. I'd prefer if we could land that first.

thomcc · 2022-11-18T07:57:41Z

Just out of idle curiosity¹...

@bors try @rust-timer queue

I don't expect tons but who knows how many of those stack/stack copies are in important places 👀 ↩

bors · 2022-11-18T07:57:49Z

⌛ Trying commit 53f21aa with merge 7250e7fb50ffb04be8243ed3f0d9baa1caa63d4a...

pcwalton · 2022-11-18T08:40:35Z

The Miri version of this is #103957. I'd prefer if we could land that first.

As long as it doesn't take too long. That PR has been sitting idle for 5 days. I don't want this to bitrot.

RalfJung · 2022-11-18T09:21:20Z

Bitrot risk for this PR seems low and 5 days is not very long. I don't think this is an urgent change either.

RalfJung · 2022-11-18T09:23:40Z

library/core/src/ptr/mod.rs

+/// Immediately upon executing, `drop_in_place` takes out a mutable borrow on the
+/// pointed-to-value. Effectively, this function is implemented like so:
+///
+/// ```
+/// # struct Foo { x: i32 }
+/// unsafe fn drop_in_place(to_drop: *mut Foo) {
+///     let mut value = &mut *to_drop;
+///     // ... drop the fields of `value` ...
+/// }
+/// ```


What we actually do is stronger than that: we have an &mut function argument. That makes a difference because for noalias, scope matters.

So if you want to write this in code, I'd suggest something like

/// ``` /// # struct Foo { x: i32 } /// unsafe fn drop_in_place(to_drop: *mut Foo) { /// drop_in_place_inner(&mut *to_drop); /// unsafe fn drop_in_place_inner(to_drop: &mut Foo) { /// // ... drop the fields of `value` ... /// } /// } /// ```

RalfJung · 2022-11-18T09:25:40Z

library/core/src/ptr/mod.rs

+/// * `to_drop` must be properly aligned, even if T has size 0.
+///
+/// * `to_drop` must be nonnull, even if T has size 0.


Suggested change

/// * `to_drop` must be properly aligned, even if T has size 0.

///

/// * `to_drop` must be nonnull, even if T has size 0.

/// * `to_drop` must be properly aligned, even if `T` has size 0.

///

/// * `to_drop` must be nonnull, even if `T` has size 0.

bors · 2022-11-18T10:19:52Z

☀️ Try build successful - checks-actions
Build commit: 7250e7fb50ffb04be8243ed3f0d9baa1caa63d4a (7250e7fb50ffb04be8243ed3f0d9baa1caa63d4a)

rust-timer · 2022-11-18T12:46:25Z

Finished benchmarking commit (7250e7fb50ffb04be8243ed3f0d9baa1caa63d4a): comparison URL.

Overall result: ❌✅ regressions and improvements - ACTION NEEDED

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please fix the regressions and do another perf run. If the next run shows neutral or positive results, the label will be automatically removed.

@bors rollup=never
@rustbot label: +S-waiting-on-review -S-waiting-on-perf +perf-regression

Instruction count

This is a highly reliable metric that was used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.5%	[0.4%, 0.7%]	7
Regressions ❌ (secondary)	3.2%	[0.5%, 9.5%]	7
Improvements ✅ (primary)	-0.6%	[-0.6%, -0.6%]	1
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	0.4%	[-0.6%, 0.7%]	8

Max RSS (memory usage)

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-0.9%	[-0.9%, -0.9%]	1
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-0.9%	[-0.9%, -0.9%]	1

Cycles

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.9%	[0.7%, 1.1%]	7
Regressions ❌ (secondary)	4.3%	[1.2%, 9.2%]	5
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	0.9%	[0.7%, 1.1%]	7

RalfJung · 2022-11-18T12:52:04Z

Some of that regression is probably caused by emitting more attributes.

thomcc · 2022-11-18T17:15:15Z

Yeah, the classic problem of measuring performance changes by measuring the compiler itself.

pcwalton · 2022-11-18T20:36:25Z

If folks are concerned about the performance regression, I can disable this in incremental compilation mode. It doesn't sit that well with me though—it's essentially just improving compilation performance by artificially making LLVM able to optimize less.

pcwalton · 2022-11-18T20:38:05Z

Oh, also keep in mind that I'm testing against LLVM 16, while the benchmark presumably isn't.

thomcc · 2022-11-18T20:40:53Z

If folks are concerned about the performance regression, I can disable this in incremental compilation mode

I can't speak to anybody else, but I'm not concerned. Or, my concern is just that the value of this kind of thing can't really be measured using our existing perf suite, but that's an existing issue that shouldn't impact this PR (IMO).

oli-obk · 2022-12-13T08:06:55Z

compiler/rustc_ty_utils/src/abi.rs

+    let is_drop_in_place = match (cx.tcx.lang_items().drop_in_place_fn(), fn_def_id) {
+        (Some(drop_in_place_fn), Some(fn_def_id)) => drop_in_place_fn == fn_def_id,
+        _ => false,
+    };


Suggested change

let is_drop_in_place = match (cx.tcx.lang_items().drop_in_place_fn(), fn_def_id) {

(Some(drop_in_place_fn), Some(fn_def_id)) => drop_in_place_fn == fn_def_id,

_ => false,

};

let is_drop_in_place = cx.tcx.lang_items().drop_in_place_fn() == Some(fn_def_id);

oli-obk · 2022-12-13T08:08:21Z

@rustbot author

perf regression is entirely in LLVM, so since this causes actual runtime improvements, and the regressions are small in primary tests, this lgtm

RalfJung · 2022-12-22T21:10:06Z

#103957 finally landed, so from a Miri / MIR semantics perspective this is good to go now.

There are a bunch of review comments above though.

JohnCSimon · 2023-04-30T04:09:55Z

#103957 finally landed, so from a Miri / MIR semantics perspective this is good to go now.

There are a bunch of review comments above though.

@pcwalton Can you please post your status on this PR? It has sat idle for months.

[rustc_ty_utils] Treat `drop_in_place`'s *mut argument like &mut when adding LLVM attributes This resurrects PR rust-lang#103614, which has sat idle for a while. This could probably use a new perf run, since we're on a new LLVM version now. r? `@oli-obk` cc `@RalfJung` --- LLVM can make use of the `noalias` parameter attribute on the parameter to `drop_in_place` in areas like argument promotion. Because the Rust compiler fully controls the code for `drop_in_place`, it can soundly deduce parameter attributes on it. In rust-lang#103957, Miri was changed to retag `drop_in_place`'s argument as if it was `&mut`, matching this change.

pnkfelix · 2023-05-23T15:07:55Z

This ended up landing via PR #111807 if I understand correctly.

rustbot assigned jackh726 Oct 27, 2022

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Oct 27, 2022

This comment has been minimized.

Sign in to view

rustbot assigned oli-obk and unassigned jackh726 Oct 27, 2022

oli-obk reviewed Oct 31, 2022

View reviewed changes

compiler/rustc_ty_utils/src/abi.rs Outdated Show resolved Hide resolved

pcwalton marked this pull request as draft November 3, 2022 09:22

pcwalton added 4 commits November 17, 2022 14:00

Fix noalias box test

ecfb332

Update documentation for drop_in_place()

02cfabe

pcwalton force-pushed the drop-in-place-noalias branch from 3d402b0 to 02cfabe Compare November 18, 2022 00:14

pcwalton marked this pull request as ready for review November 18, 2022 00:14

saethlin reviewed Nov 18, 2022

View reviewed changes

This comment has been minimized.

Sign in to view

Add missing "unsafe" to fix doctest

53f21aa

This comment has been minimized.

Sign in to view

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Nov 18, 2022

RalfJung reviewed Nov 18, 2022

View reviewed changes

This comment has been minimized.

Sign in to view

rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Nov 18, 2022

oli-obk reviewed Dec 13, 2022

View reviewed changes

rustbot added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Dec 13, 2022

RalfJung mentioned this pull request Dec 21, 2022

Retag as FnEntry on drop_in_place #103957

Merged

erikdesjardins mentioned this pull request May 20, 2023

[rustc_ty_utils] Treat drop_in_place's *mut argument like &mut when adding LLVM attributes #111807

Merged

pnkfelix closed this May 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rustc_ty_utils] Add the LLVM `noalias` parameter attribute to `drop_in_place` in certain cases. #103614

[rustc_ty_utils] Add the LLVM `noalias` parameter attribute to `drop_in_place` in certain cases. #103614

pcwalton commented Oct 27, 2022

rustbot commented Oct 27, 2022

This comment has been minimized.

pcwalton commented Oct 27, 2022

oli-obk commented Oct 31, 2022

pcwalton commented Nov 3, 2022

rustbot commented Nov 18, 2022

pcwalton commented Nov 18, 2022

rustbot commented Nov 18, 2022

saethlin Nov 18, 2022

RalfJung Nov 18, 2022

This comment has been minimized.

pcwalton commented Nov 18, 2022

RalfJung commented Nov 18, 2022 via email

thomcc commented Nov 18, 2022

This comment has been minimized.

bors commented Nov 18, 2022

pcwalton commented Nov 18, 2022

RalfJung commented Nov 18, 2022

RalfJung Nov 18, 2022 •

edited

Loading

RalfJung Nov 18, 2022

bors commented Nov 18, 2022

This comment has been minimized.

rust-timer commented Nov 18, 2022

RalfJung commented Nov 18, 2022 via email

thomcc commented Nov 18, 2022

pcwalton commented Nov 18, 2022

pcwalton commented Nov 18, 2022

thomcc commented Nov 18, 2022 •

edited

Loading

oli-obk Dec 13, 2022

oli-obk commented Dec 13, 2022

RalfJung commented Dec 22, 2022

JohnCSimon commented Apr 30, 2023

pnkfelix commented May 23, 2023

	/// * While `drop_in_place` is executing, the only way to access parts of
	/// * As soon as `drop_in_place` begins executing, the only way to access parts of

[rustc_ty_utils] Add the LLVM noalias parameter attribute to drop_in_place in certain cases. #103614

[rustc_ty_utils] Add the LLVM noalias parameter attribute to drop_in_place in certain cases. #103614

Conversation

pcwalton commented Oct 27, 2022

rustbot commented Oct 27, 2022

This comment has been minimized.

pcwalton commented Oct 27, 2022

oli-obk commented Oct 31, 2022

pcwalton commented Nov 3, 2022

rustbot commented Nov 18, 2022

pcwalton commented Nov 18, 2022

rustbot commented Nov 18, 2022

saethlin Nov 18, 2022

Choose a reason for hiding this comment

RalfJung Nov 18, 2022

Choose a reason for hiding this comment

This comment has been minimized.

pcwalton commented Nov 18, 2022

RalfJung commented Nov 18, 2022 via email

thomcc commented Nov 18, 2022

Footnotes

This comment has been minimized.

bors commented Nov 18, 2022

pcwalton commented Nov 18, 2022

RalfJung commented Nov 18, 2022

RalfJung Nov 18, 2022 • edited Loading

Choose a reason for hiding this comment

RalfJung Nov 18, 2022

Choose a reason for hiding this comment

bors commented Nov 18, 2022

This comment has been minimized.

rust-timer commented Nov 18, 2022

Overall result: ❌✅ regressions and improvements - ACTION NEEDED

Instruction count

Max RSS (memory usage)

Cycles

RalfJung commented Nov 18, 2022 via email

thomcc commented Nov 18, 2022

pcwalton commented Nov 18, 2022

pcwalton commented Nov 18, 2022

thomcc commented Nov 18, 2022 • edited Loading

oli-obk Dec 13, 2022

Choose a reason for hiding this comment

oli-obk commented Dec 13, 2022

RalfJung commented Dec 22, 2022

JohnCSimon commented Apr 30, 2023

pnkfelix commented May 23, 2023

[rustc_ty_utils] Add the LLVM `noalias` parameter attribute to `drop_in_place` in certain cases. #103614

[rustc_ty_utils] Add the LLVM `noalias` parameter attribute to `drop_in_place` in certain cases. #103614

RalfJung Nov 18, 2022 •

edited

Loading

thomcc commented Nov 18, 2022 •

edited

Loading