Add `assume`s to slice length calls #122926

scottmcm · 2024-03-23T05:45:15Z

Since .len() on slices is safe, let's see how this impacts things vs what we could do with #121965 and LLVM 19

rustbot · 2024-03-23T05:45:23Z

rustbot has assigned @wesleywiser.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

scottmcm · 2024-03-23T06:34:02Z

tests/codegen/slice-length-limits.rs

+    // CHECK: %[[J:.+]] = phi [[USIZE]]
+    // CHECK: %[[I:.+]] = phi [[USIZE]]
+    // CHECK-NOT: phi
+    // CHECK: add nuw nsw [[USIZE]] %[[I]], %[[J]]


Demo that this is neither nuw nor nsw today: https://rust.godbolt.org/z/ne1bMPqv9

%_16.us = add i64 %iter2.sroa.0.09.us, %iter.sroa.0.011.us, !dbg !56 tail call void @do_something(i64 noundef %_16.us), !dbg !58

Since `.len()` on slices is safe, let's see how this impacts things vs what we could do with 121965 and LLVM 19

scottmcm · 2024-03-23T06:42:29Z

compiler/rustc_codegen_ssa/src/mir/rvalue.rs

+            if let Some(elem_bytes) = std::num::NonZeroU64::new(elem_ty.size.bytes()) {
+                let isize_max = (1_u64 << (bx.sess().target.pointer_width - 1)) - 1;
+                let len_max = isize_max / elem_bytes;
+                let limit = bx.icmp(IntPredicate::IntULE, length, bx.const_usize(len_max));


I first tried doing this with assume(sge(len * elem_bytes, 0)), but got worse results -- I think LLVM is confused by the SGE limit compared with other positive things, even though it actually simplifies ule(x, isize::MAX) into sgt(x, -1).

scottmcm · 2024-03-23T06:47:49Z

@bors try @rust-timer queue

Add `assume`s to slice length calls Since `.len()` on slices is safe, let's see how this impacts things vs what we could do with rust-lang#121965 and LLVM 19

bors · 2024-03-23T06:49:00Z

⌛ Trying commit eed1ccd with merge b6cd5cf...

bors · 2024-03-23T08:28:54Z

☀️ Try build successful - checks-actions
Build commit: b6cd5cf (b6cd5cfaca9486a70466c5c6743d6494d4d04104)

rust-timer · 2024-03-23T09:43:34Z

Finished benchmarking commit (b6cd5cf): comparison URL.

Overall result: ❌✅ regressions and improvements - ACTION NEEDED

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf.

Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @rustbot label: +perf-regression-triaged along with sufficient written justification. If you cannot justify the regressions please fix the regressions and do another perf run. If the next run shows neutral or positive results, the label will be automatically removed.

@bors rollup=never
@rustbot label: -S-waiting-on-perf +perf-regression

Instruction count

This is a highly reliable metric that was used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.6%	[0.2%, 2.5%]	111
Regressions ❌ (secondary)	0.5%	[0.2%, 2.5%]	40
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-0.5%	[-0.7%, -0.3%]	12
All ❌✅ (primary)	0.6%	[0.2%, 2.5%]	111

Max RSS (memory usage)

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	3.3%	[3.3%, 3.3%]	1
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-4.3%	[-4.3%, -4.3%]	1
Improvements ✅ (secondary)	-	-	0
All ❌✅ (primary)	-0.5%	[-4.3%, 3.3%]	2

Cycles

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	1.4%	[0.8%, 2.6%]	26
Regressions ❌ (secondary)	2.1%	[1.6%, 2.6%]	2
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-1.0%	[-1.0%, -1.0%]	1
All ❌✅ (primary)	1.4%	[0.8%, 2.6%]	26

Binary size

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	0.0%	[0.0%, 0.0%]	4
Regressions ❌ (secondary)	0.0%	[0.0%, 0.0%]	11
Improvements ✅ (primary)	-0.2%	[-0.3%, -0.2%]	8
Improvements ✅ (secondary)	-0.6%	[-0.6%, -0.6%]	3
All ❌✅ (primary)	-0.1%	[-0.3%, 0.0%]	12

Bootstrap: 669.588s -> 679.345s (1.46%)
Artifact size: 315.05 MiB -> 312.86 MiB (-0.70%)

saethlin · 2024-03-24T00:56:52Z

Wow it looks like that knocked a fair chunk of code out of librustc_driver.so
The compile time cost looks a bit steep but this is clearly doing something

the8472 · 2024-03-24T12:33:25Z

See also #116542, which will add range metadata and niches to the slice length. If I can get it to work.

scottmcm · 2024-04-04T05:40:32Z

I think I'll close it in this form, though -- the perf costs are pretty high.

Hopefully #121965 lets this just be parameter metadata to get more benefits at lower cost.

rustbot assigned wesleywiser Mar 23, 2024

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Mar 23, 2024

scottmcm force-pushed the assume-lengths branch from 91a7e3f to 772322d Compare March 23, 2024 06:27

scottmcm commented Mar 23, 2024

View reviewed changes

Add assumes to slice length calls

eed1ccd

Since `.len()` on slices is safe, let's see how this impacts things vs what we could do with 121965 and LLVM 19

scottmcm force-pushed the assume-lengths branch from 772322d to eed1ccd Compare March 23, 2024 06:37

scottmcm commented Mar 23, 2024

View reviewed changes

This comment has been minimized.

Sign in to view

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Mar 23, 2024

This comment has been minimized.

Sign in to view

rustbot added perf-regression Performance regression. and removed S-waiting-on-perf Status: Waiting on a perf run to be completed. labels Mar 23, 2024

scottmcm mentioned this pull request Mar 24, 2024

Elaborate on the invariants for references-to-slices #121965

Open

scottmcm closed this Apr 4, 2024

scottmcm mentioned this pull request Apr 6, 2024

nuw nsw not deduced for add 1 inbounds of range-restricted length llvm/llvm-project#87854

Closed

scottmcm mentioned this pull request Aug 1, 2024

Add range attribute to scalar function results and arguments #128371

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `assume`s to slice length calls #122926

Add `assume`s to slice length calls #122926

scottmcm commented Mar 23, 2024

rustbot commented Mar 23, 2024

scottmcm Mar 23, 2024

scottmcm Mar 23, 2024

scottmcm commented Mar 23, 2024

This comment has been minimized.

bors commented Mar 23, 2024

bors commented Mar 23, 2024

This comment has been minimized.

rust-timer commented Mar 23, 2024

saethlin commented Mar 24, 2024

the8472 commented Mar 24, 2024

scottmcm commented Apr 4, 2024

Add assumes to slice length calls #122926

Add assumes to slice length calls #122926

Conversation

scottmcm commented Mar 23, 2024

rustbot commented Mar 23, 2024

scottmcm Mar 23, 2024

Choose a reason for hiding this comment

scottmcm Mar 23, 2024

Choose a reason for hiding this comment

scottmcm commented Mar 23, 2024

This comment has been minimized.

bors commented Mar 23, 2024

bors commented Mar 23, 2024

This comment has been minimized.

rust-timer commented Mar 23, 2024

Overall result: ❌✅ regressions and improvements - ACTION NEEDED

Instruction count

Max RSS (memory usage)

Cycles

Binary size

saethlin commented Mar 24, 2024

the8472 commented Mar 24, 2024

scottmcm commented Apr 4, 2024

Add `assume`s to slice length calls #122926

Add `assume`s to slice length calls #122926