[DT] Unify encoding materialization pass into a single pass. #19454

hanhanW · 2024-12-11T11:33:09Z

The revision creates a generic materialization pass and uses it for backends that implement data-tiling. After months of development, we identify that the needs of GPU is a superset of the needs of CPU. To be more specific, it has the additional "swizzle" field in terms of layout. It means that the GPU set_encoding/unset_encoding lowering patterns cover the needs of CPU path. The lowering of contraction ops is different. CPU lowers it to mmt4d ops, while GPU lowers it to multi_mma op. However, the lowering of contraction is implemented through attribute interface. Thus, we can have a generic pattern to lower contraction ops.

To make the review process much easier, the revision is created by 5 commits.

It directly creates the MaterializeEncoding pass and copy-paste the GPU patterns: SetEncodingOpLoweringConversion, UnSetEncodingOpLoweringConversion, and MaterializeContractionOp. In the first commit, it also updates the GPU tests to use the new pass.
The GPU data-tiling does not support element-wise generic op lowering atm. The second commit moves the pattern to shared pattern set and bail out when swizzle is present. This is an NFC for both pipelines.
The third commit replaces the existing materialization pass with the generic pass, and deletes all the legacy passes.
The four commit moves the lit tests from Common/[CPU|GPU]/test to Common/test.
Now there are duplicate patterns for set_encoding, unset_encoding, and contraction ops lowering. The last commit deletes the legacy patterns, and move the patterns from MaterializeEncoding.cpp to where the legacy patterns locate. Furthermore, it renames the file as MaterializeEncodingPatterns.cpp.

The revision retains the MaterializeEncodingIntoNop pass, and add a TODO item. Because it is still used by MaterializeHomogeneousEncoding pass. It can be deleted once we deprecate the early materialization path.

hanhanW · 2024-12-11T12:43:45Z

This depends on #19452 and #19453. I push the parent commits to an upstream branch, so it is ready for review.

bjacob

Very nice reorganization!

bjacob · 2024-12-11T14:35:39Z

compiler/src/iree/compiler/Codegen/Utils/Utils.cpp

@@ -161,6 +161,11 @@ const char *getIreeArchNameForTargetTriple(llvm::Triple triple) {
  return "unknown";
 }

+bool isLLVMCPUBackend(IREE::HAL::ExecutableTargetAttr targetAttr) {
+  return targetAttr &&
+         targetAttr.getBackend().getValue().starts_with("llvm-cpu");


Can't you just check equality ? Why starts_with? If that was copying the vmvx case below, I might have done that out of ignorance in the past.

Oh, yes. Thanks for pointing it out! I'm mainly following the other cases, but can definitely check equality.

Signed-off-by: hanhanW <hanhan0912@gmail.com>

also update cpu targets in llvmcpu_materialize_encoding.mlir Signed-off-by: hanhanW <hanhan0912@gmail.com>

Signed-off-by: hanhanW <hanhan0912@gmail.com>

…rns.cpp Signed-off-by: hanhanW <hanhan0912@gmail.com>

Signed-off-by: hanhanW <hanhan0912@gmail.com>

hanhanW changed the base branch from main to users/hanhanW/data-tiling-cleanups-narrow-n December 11, 2024 11:34

hanhanW marked this pull request as ready for review December 11, 2024 12:44

hanhanW requested review from MaheshRavishankar, pashu123, antiagainst, qedawkins and benvanik as code owners December 11, 2024 12:44

hanhanW requested review from bjacob and Max191 and removed request for benvanik, antiagainst and qedawkins December 11, 2024 12:44

bjacob approved these changes Dec 11, 2024

View reviewed changes

hanhanW mentioned this pull request Dec 16, 2024

[DT][NFC] Internalize transposeNarrowN logic to LayoutAttrInterface Impl #19453

Merged

hanhanW added 6 commits December 15, 2024 21:06

Create MaterializeEncoding pass and use it for GPU tests.

71ec22f

Signed-off-by: hanhanW <hanhan0912@gmail.com>

Move MaterializeDPSOperation<linalg::GenericOp> to shared patterns

a149981

also update cpu targets in llvmcpu_materialize_encoding.mlir Signed-off-by: hanhanW <hanhan0912@gmail.com>

Switch to generic materialization encoding pass + retire legacy passes

bc2d91f

Signed-off-by: hanhanW <hanhan0912@gmail.com>

Move tests to Common/test/

f6190d3

Signed-off-by: hanhanW <hanhan0912@gmail.com>

Refactor dup patterns and rename the file as MaterializeEncodingPatte…

5b28a82

…rns.cpp Signed-off-by: hanhanW <hanhan0912@gmail.com>

Address comments

0f6ac93

Signed-off-by: hanhanW <hanhan0912@gmail.com>

hanhanW force-pushed the rework-materialize-encoding branch from 9005f5a to 0f6ac93 Compare December 16, 2024 06:08

hanhanW requested review from ScottTodd, nithinsubbiah, kuhar, IanWood1, Groverkss and stellaraccident as code owners December 16, 2024 06:08

hanhanW changed the base branch from users/hanhanW/data-tiling-cleanups-narrow-n to main December 16, 2024 06:08

hanhanW removed the request for review from ScottTodd December 16, 2024 06:09

hanhanW removed request for nithinsubbiah, kuhar, IanWood1, Groverkss and stellaraccident December 16, 2024 06:09

poke CI

14a6ab0

Signed-off-by: hanhanW <hanhan0912@gmail.com>

hanhanW enabled auto-merge (squash) December 16, 2024 06:47

hanhanW added 2 commits December 15, 2024 23:53

Fix dep -- ukernel lowering still uses the EncodingAttr

2aecf1f

Signed-off-by: hanhanW <hanhan0912@gmail.com>

add deps for CPU/GPU dialect

5224968

Signed-off-by: hanhanW <hanhan0912@gmail.com>

hanhanW force-pushed the rework-materialize-encoding branch from 1a13342 to 5224968 Compare December 16, 2024 09:51

hanhanW merged commit 05ce39f into iree-org:main Dec 16, 2024
38 checks passed

hanhanW deleted the rework-materialize-encoding branch December 16, 2024 10:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[DT] Unify encoding materialization pass into a single pass. #19454

[DT] Unify encoding materialization pass into a single pass. #19454

hanhanW commented Dec 11, 2024 •

edited

Loading

hanhanW commented Dec 11, 2024 •

edited

Loading

bjacob left a comment

bjacob Dec 11, 2024

hanhanW Dec 11, 2024

[DT] Unify encoding materialization pass into a single pass. #19454

[DT] Unify encoding materialization pass into a single pass. #19454

Conversation

hanhanW commented Dec 11, 2024 • edited Loading

hanhanW commented Dec 11, 2024 • edited Loading

bjacob left a comment

Choose a reason for hiding this comment

bjacob Dec 11, 2024

Choose a reason for hiding this comment

hanhanW Dec 11, 2024

Choose a reason for hiding this comment

hanhanW commented Dec 11, 2024 •

edited

Loading

hanhanW commented Dec 11, 2024 •

edited

Loading