[Codegen] Allow padding of dynamic allocas #19399

Max191 · 2024-12-06T18:30:08Z

This PR adds support for padding for allocas in the PadDynamicAllocsPass. The padding works the same for alloca as for alloc.

hanhanW

Drive-by: Instead of using template, would using AllocLikeOp make sense?

(I'm not asking for a change. It is just a question.)

https://github.com/llvm/llvm-project/blob/12bdeba76eef1c7adf004a280036a7fb690ba573/mlir/include/mlir/Dialect/MemRef/IR/MemRefOps.td#L57-L67

Max191 · 2024-12-06T18:52:24Z

Drive-by: Instead of using template, would using AllocLikeOp make sense?

(I'm asking for a change. It is just a question.)

https://github.com/llvm/llvm-project/blob/12bdeba76eef1c7adf004a280036a7fb690ba573/mlir/include/mlir/Dialect/MemRef/IR/MemRefOps.td#L57-L67

Ah, I didn't know about this. That looks a bit better to me, thanks!

Max191 · 2024-12-06T19:02:44Z

Drive-by: Instead of using template, would using AllocLikeOp make sense?

Hmm, actually I think AllocLikeOp is just a tablegen class. I don't think I can access it in C++. I'll update the template typename to be more consistent with other code, though.

nirvedhmeshram

LGTM

hanhanW · 2024-12-06T19:32:36Z

Drive-by: Instead of using template, would using AllocLikeOp make sense?

Hmm, actually I think AllocLikeOp is just a tablegen class. I don't think I can access it in C++. I'll update the template typename to be more consistent with other code, though.

Chatted with Max offline. Max is right, please ignore my comment.

Signed-off-by: Max Dawkins <max.dawkins@gmail.com>

… shapes (#19484) This PR does two things 1. Allow all GEMM shapes to use padded TileAndFuse Matmul configuration. This is still behind the `iree-codegen-llvmgpu-test-tile-and-fuse-matmul=false` flag by default and does not change the default behavior. However following PRs that have landed in the past month make it possible to relax the guards we originally had on this. #19196 #19307 llvm/llvm-project#117340 2. Allow fused producers to use use padded TileAndFuse Matmul configuration. Following PRs make this possible now #19399 llvm/llvm-project#119039 Together this allows us to do padded IGEMM with intrinsics for shapes unaligned to intrinsic which we use by default. [Here](https://docs.google.com/spreadsheets/d/1O-SdUZCn5pHsxx7JTGjIIdH6PWCFnvlfe4XBbjEBaIM/edit?gid=0#gid=0) is the performance difference observed in conv cases in iree-kernel-benchmark-module that utilize this change. A median speedup of 2.26x was observed. The numeric changes I observed with enabling this path were the same between any aligned shape when comparing intrinsic vs no intrinsic use. Generally some differences are noticed for narrow types like f16 but they are within a relative error of 0.001 but since our tests use absolute errors we may have to change some test values to account for this change. The perf difference in CI seem to be within noise margin compared to main, https://github.com/iree-org/iree/actions/runs/12323399269/attempts/1#summary-34399247902 --------- Signed-off-by: Nirvedh <nirvedh@gmail.com>

Max191 requested a review from hanhanW as a code owner December 6, 2024 18:30

Max191 requested review from MaheshRavishankar, kuhar, qedawkins and nirvedhmeshram December 6, 2024 18:33

hanhanW reviewed Dec 6, 2024

View reviewed changes

Max191 force-pushed the pad-dynamic-allocas branch from 4ebd47f to 1d704a2 Compare December 6, 2024 19:07

nirvedhmeshram approved these changes Dec 6, 2024

View reviewed changes

Max191 added 2 commits December 13, 2024 10:48

[Codegen] Allow padding of dynamic allocas

a59756b

Signed-off-by: Max Dawkins <max.dawkins@gmail.com>

change template name to AllocLikeOp

03d0870

Signed-off-by: Max Dawkins <max.dawkins@gmail.com>

Max191 force-pushed the pad-dynamic-allocas branch from 1d704a2 to 03d0870 Compare December 13, 2024 16:48

Max191 enabled auto-merge (squash) December 13, 2024 16:49

Max191 merged commit 99b600f into iree-org:main Dec 13, 2024
39 checks passed

nirvedhmeshram mentioned this pull request Dec 13, 2024

[GPU] Use padding in IGEMM pipeline to support unaligned to intrinsic shapes #19484

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Codegen] Allow padding of dynamic allocas #19399

[Codegen] Allow padding of dynamic allocas #19399

Max191 commented Dec 6, 2024

hanhanW left a comment •

edited

Loading

Max191 commented Dec 6, 2024

Max191 commented Dec 6, 2024

nirvedhmeshram left a comment

hanhanW commented Dec 6, 2024

[Codegen] Allow padding of dynamic allocas #19399

[Codegen] Allow padding of dynamic allocas #19399

Conversation

Max191 commented Dec 6, 2024

hanhanW left a comment • edited Loading

Choose a reason for hiding this comment

Max191 commented Dec 6, 2024

Max191 commented Dec 6, 2024

nirvedhmeshram left a comment

Choose a reason for hiding this comment

hanhanW commented Dec 6, 2024

hanhanW left a comment •

edited

Loading