Optimize struct RMW ops in OptimizeInstructions #7225

tlively · 2025-01-17T01:59:02Z

When the RMW operation can be proven not to change the accessed value,
optimize it to a simple atomic get instead. This is valid because a
write that does not change an in-memory value does not synchronize with
any subsequent reads of that value, since those reads can be considered
to be reading from the previous write.

Also optimize RMW operations on unshared structs to their non-atomic
equivalent operations. This can increase code size, but can also enable
follow-on optimizations of the simpler operations and can be less
expensive at runtime.

When the RMW operation can be proven not to change the accessed value, optimize it to a simple atomic get instead. This is valid because a write that does not change an in-memory value does not synchronize with any subsequent reads of that value, since those reads can be considered to be reading from the previous write. Also optimize RMW operations on unshared structs to their non-atomic equivalent operations. This can increase code size, but can also enable follow-on optimizations of the simpler operations and can be less expensive at runtime.

tlively · 2025-01-17T02:00:11Z

@conrad-watt, could I ask you to double check that these optimizations are valid? In particular, can you confirm that it is valid to optimize atomic RMW operations that do not write new values to atomic gets?

kripken · 2025-01-17T03:55:47Z

src/passes/OptimizeInstructions.cpp

+
+    Builder builder(*getModule());
+
+    // Even when the access to shared memory, we can optimize out the modify and


Suggested change

// Even when the access to shared memory, we can optimize out the modify and

// Even when we access shared memory, we can optimize out the modify and

kripken · 2025-01-17T03:58:59Z

src/passes/OptimizeInstructions.cpp

+    // operation into several non-atomic operations is safe because no other
+    // thread can observe an intermediate state in the unshared memory. This
+    // initially increases code size, but the more basic operations may be
+    // more optimizable than the original RMW.


How likely are we to succeed? I am somewhat worried that generally the size increase here will stick around, and might also be slower.

I'm not sure! We can experiment with it once we have real test cases. I figure it will be easier to experimentally disable this optimization than it would be to write the optimization just for an experiment, though. I could imagine in the long run we would want to do this only when optimizing for speed over size.

Ok, sounds good. Perhaps add a TODO for that?

kripken · 2025-01-17T16:28:54Z

src/passes/OptimizeInstructions.cpp

+    // operation into several non-atomic operations is safe because no other
+    // thread can observe an intermediate state in the unshared memory. This
+    // initially increases code size, but the more basic operations may be
+    // more optimizable than the original RMW.


Ok, sounds good. Perhaps add a TODO for that?

tlively requested a review from kripken January 17, 2025 01:59

kripken reviewed Jan 17, 2025

View reviewed changes

kripken approved these changes Jan 17, 2025

View reviewed changes

update comments

cd23371

tlively enabled auto-merge (squash) January 17, 2025 18:37

tlively merged commit 8623f73 into main Jan 17, 2025
13 checks passed

tlively deleted the optimize-struct-rmw branch January 17, 2025 19:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize struct RMW ops in OptimizeInstructions #7225

Optimize struct RMW ops in OptimizeInstructions #7225

tlively commented Jan 17, 2025

tlively commented Jan 17, 2025

kripken Jan 17, 2025

kripken Jan 17, 2025

tlively Jan 17, 2025

kripken Jan 17, 2025

kripken Jan 17, 2025


		Builder builder(*getModule());

		// Even when the access to shared memory, we can optimize out the modify and

	// Even when the access to shared memory, we can optimize out the modify and
	// Even when we access shared memory, we can optimize out the modify and

Optimize struct RMW ops in OptimizeInstructions #7225

Optimize struct RMW ops in OptimizeInstructions #7225

Conversation

tlively commented Jan 17, 2025

tlively commented Jan 17, 2025

kripken Jan 17, 2025

Choose a reason for hiding this comment

kripken Jan 17, 2025

Choose a reason for hiding this comment

tlively Jan 17, 2025

Choose a reason for hiding this comment

kripken Jan 17, 2025

Choose a reason for hiding this comment

kripken Jan 17, 2025

Choose a reason for hiding this comment