`assume` value ranges in `transmute` #109993

scottmcm · 2023-04-06T07:39:41Z

Fixes #109958

rustbot · 2023-04-06T07:39:47Z

r? @lcnr

(rustbot has picked a reviewer for you, use r? to override)

scottmcm · 2023-04-06T09:06:30Z

@bors try @rust-timer queue

bors · 2023-04-06T09:06:39Z

⌛ Trying commit c7d3c9f10a4c1b95a290936fc1acd585e42f1f5e with merge b359f2b4f1c26bbaf475b4f8cdaa87a57a4f3d82...

bors · 2023-04-06T10:50:36Z

☀️ Try build successful - checks-actions
Build commit: b359f2b4f1c26bbaf475b4f8cdaa87a57a4f3d82 (b359f2b4f1c26bbaf475b4f8cdaa87a57a4f3d82)

rust-timer · 2023-04-06T12:27:24Z

Finished benchmarking commit (b359f2b4f1c26bbaf475b4f8cdaa87a57a4f3d82): comparison URL.

Overall result: no relevant changes - no action needed

Benchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf.

@bors rollup=never
@rustbot label: -S-waiting-on-perf -perf-regression

Instruction count

This benchmark run did not return any relevant results for this metric.

Max RSS (memory usage)

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	1.6%	[1.6%, 1.6%]	1
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-2.0%	[-2.7%, -1.3%]	2
All ❌✅ (primary)	-	-	0

Cycles

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-3.9%	[-4.0%, -3.8%]	3
All ❌✅ (primary)	-	-	0

oli-obk · 2023-04-06T18:45:41Z

For as casts we did this in MIR. Can this be done here, too? Ideally sharing the code?

lcnr · 2023-04-10T06:43:37Z

r? @oli-obk

scottmcm · 2023-04-10T18:23:46Z

@oli-obk I think it depends if it's supposed to work on generics or not.

If something like #106281 (comment) happens, then at MIR level we wouldn't reliably know what the type actually is to add these -- MIR could just see type parameters.

oli-obk · 2023-04-11T15:26:08Z

Duh... yea, the enum to int casts are not possible on generics, so we never had an issue with that there.

compiler/rustc_codegen_ssa/src/mir/rvalue.rs

Fixes rust-lang#109958

scottmcm · 2023-04-20T06:22:48Z

Finally got back to this.

I've added a bunch of codegen tests to demonstrate that this works as expected, as well as some clarification comments.

@rustbot ready

oli-obk · 2023-04-20T06:44:33Z

@bors r+

bors · 2023-04-20T06:44:35Z

📌 Commit baf98e7 has been approved by oli-obk

It is now in the queue for this repository.

bors · 2023-04-20T10:46:17Z

⌛ Testing commit baf98e7 with merge 7e23d18...

bors · 2023-04-20T13:03:24Z

☀️ Test successful - checks-actions
Approved by: oli-obk
Pushing 7e23d18 to master...

rust-timer · 2023-04-20T14:21:19Z

Finished benchmarking commit (7e23d18): comparison URL.

Overall result: ✅ improvements - no action needed

@rustbot label: -perf-regression

Instruction count

This is a highly reliable metric that was used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-1.8%	[-1.8%, -1.8%]	1
All ❌✅ (primary)	-	-	0

Max RSS (memory usage)

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	3.5%	[3.5%, 3.5%]	1
Regressions ❌ (secondary)	-	-	0
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-4.1%	[-4.1%, -4.1%]	1
All ❌✅ (primary)	3.5%	[3.5%, 3.5%]	1

Cycles

This benchmark run did not return any relevant results for this metric.

Stop turning transmutes into discriminant reads in mir-opt Partially reverts rust-lang#109612, as after rust-lang#109993 these aren't actually equivalent any more, and I'm no longer confident this was ever an improvement in the first place. Having this "simplification" meant that similar-looking code actually did somewhat different things. For example, ```rust pub unsafe fn demo1(x: std::cmp::Ordering) -> u8 { std::mem::transmute(x) } pub unsafe fn demo2(x: std::cmp::Ordering) -> i8 { std::mem::transmute(x) } ``` in nightly today is generating <https://rust.godbolt.org/z/dPK58zW18> ```llvm define noundef i8 `@_ZN7example5demo117h341ef313673d2ee6E(i8` noundef %x) unnamed_addr #0 { %0 = icmp uge i8 %x, -1 %1 = icmp ule i8 %x, 1 %2 = or i1 %0, %1 call void `@llvm.assume(i1` %2) ret i8 %x } define noundef i8 `@_ZN7example5demo217h5ad29f361a3f5700E(i8` noundef %0) unnamed_addr #0 { %x = alloca i8, align 1 store i8 %0, ptr %x, align 1 %1 = load i8, ptr %x, align 1, !range !2, !noundef !3 ret i8 %1 } ``` Which feels too different when the original code is essentially identical. --- Aside: that example is different *after* optimizations too: ```llvm define noundef i8 `@_ZN7example5demo117h341ef313673d2ee6E(i8` noundef returned %x) unnamed_addr #0 { %0 = add i8 %x, 1 %1 = icmp ult i8 %0, 3 tail call void `@llvm.assume(i1` %1) ret i8 %x } define noundef i8 `@_ZN7example5demo217h5ad29f361a3f5700E(i8` noundef returned %0) unnamed_addr #1 { ret i8 %0 } ``` so turning the `Transmute` into a `Discriminant` was arguably just making things worse, so leaving it alone instead -- and thus having less code in rustc -- seems clearly better.

Update our range `assume`s to the format that LLVM prefers I found out in llvm/llvm-project#123278 (comment) that the way I started emitting the `assume`s in rust-lang#109993 was suboptimal, and as seen in that LLVM issue the way we're doing it -- with two `assume`s sometimes -- can at times lead to CVP/SCCP not realize what's happening because one of them turns into a `ne` instead of conveying a range. So this updates how it's emitted from ``` assume( x >= LOW ); assume( x <= HIGH ); ``` to ``` assume( (x - LOW) <= (HIGH - LOW) ); ``` so that we don't need multiple `icmp`s nor multiple `assume`s for a single value.

Update our range `assume`s to the format that LLVM prefers I found out in llvm/llvm-project#123278 (comment) that the way I started emitting the `assume`s in rust-lang#109993 was suboptimal, and as seen in that LLVM issue the way we're doing it -- with two `assume`s sometimes -- can at times lead to CVP/SCCP not realize what's happening because one of them turns into a `ne` instead of conveying a range. So this updates how it's emitted from ``` assume( x >= LOW ); assume( x <= HIGH ); ``` or ``` // (for ranges that wrap the range) assume( (x <= LOW) | (x >= HIGH) ); ``` to ``` assume( (x - LOW) <= (HIGH - LOW) ); ``` so that we don't need multiple `icmp`s nor multiple `assume`s for a single value, and both wrappping and non-wrapping ranges emit the same shape. (And we don't bother emitting the subtraction if `LOW` is zero, since that's trivial for us to check too.)

rustbot assigned lcnr Apr 6, 2023

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Apr 6, 2023

This comment has been minimized.

Sign in to view

rustbot added the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Apr 6, 2023

This comment has been minimized.

Sign in to view

rustbot removed the S-waiting-on-perf Status: Waiting on a perf run to be completed. label Apr 6, 2023

rustbot assigned oli-obk and unassigned lcnr Apr 10, 2023

oli-obk reviewed Apr 11, 2023

View reviewed changes

compiler/rustc_codegen_ssa/src/mir/rvalue.rs Show resolved Hide resolved

compiler/rustc_codegen_ssa/src/mir/rvalue.rs Show resolved Hide resolved

compiler/rustc_codegen_ssa/src/mir/rvalue.rs Show resolved Hide resolved

rustbot added S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Apr 11, 2023

scottmcm force-pushed the transmute-niches branch from c7d3c9f to e11d98b Compare April 13, 2023 06:28

assume value ranges in transmute

1bcb0ec

Fixes rust-lang#109958

scottmcm force-pushed the transmute-niches branch from e11d98b to 1bcb0ec Compare April 13, 2023 07:13

Add transmute optimization tests and some extra comments

baf98e7

scottmcm force-pushed the transmute-niches branch from 197ccc2 to baf98e7 Compare April 20, 2023 06:17

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. and removed S-waiting-on-author Status: This is awaiting some action (such as code changes or more information) from the author. labels Apr 20, 2023

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Apr 20, 2023

bors added the merged-by-bors This PR was explicitly merged by bors. label Apr 20, 2023

bors merged commit 7e23d18 into rust-lang:master Apr 20, 2023

rustbot added this to the 1.71.0 milestone Apr 20, 2023

scottmcm deleted the transmute-niches branch April 20, 2023 17:37

scottmcm mentioned this pull request May 14, 2023

Stop turning transmutes into discriminant reads in mir-opt #111568

Merged

saethlin mentioned this pull request Jul 12, 2024

Don't emit expect/assume in opt-level=0 #121614

Merged

scottmcm mentioned this pull request Jan 18, 2025

Update our range assumes to the format that LLVM prefers #135674

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`assume` value ranges in `transmute` #109993

`assume` value ranges in `transmute` #109993

scottmcm commented Apr 6, 2023

rustbot commented Apr 6, 2023

scottmcm commented Apr 6, 2023

This comment has been minimized.

bors commented Apr 6, 2023

bors commented Apr 6, 2023

This comment has been minimized.

rust-timer commented Apr 6, 2023

oli-obk commented Apr 6, 2023

lcnr commented Apr 10, 2023

scottmcm commented Apr 10, 2023

oli-obk commented Apr 11, 2023

scottmcm commented Apr 20, 2023

oli-obk commented Apr 20, 2023

bors commented Apr 20, 2023

bors commented Apr 20, 2023

bors commented Apr 20, 2023

rust-timer commented Apr 20, 2023

assume value ranges in transmute #109993

assume value ranges in transmute #109993

Conversation

scottmcm commented Apr 6, 2023

rustbot commented Apr 6, 2023

scottmcm commented Apr 6, 2023

This comment has been minimized.

bors commented Apr 6, 2023

bors commented Apr 6, 2023

This comment has been minimized.

rust-timer commented Apr 6, 2023

Overall result: no relevant changes - no action needed

Instruction count

Max RSS (memory usage)

Cycles

oli-obk commented Apr 6, 2023

lcnr commented Apr 10, 2023

scottmcm commented Apr 10, 2023

oli-obk commented Apr 11, 2023

scottmcm commented Apr 20, 2023

oli-obk commented Apr 20, 2023

bors commented Apr 20, 2023

bors commented Apr 20, 2023

bors commented Apr 20, 2023

rust-timer commented Apr 20, 2023

Overall result: ✅ improvements - no action needed

Instruction count

Max RSS (memory usage)

Cycles

`assume` value ranges in `transmute` #109993

`assume` value ranges in `transmute` #109993