feat: support lowering of channelwise quantization to linalg #3

maxbartel · 2024-09-16T16:16:15Z

No description provided.

maxbartel · 2024-09-16T16:16:27Z

Warning

This pull request is not mergeable via GitHub because a downstack PR is open. Once all requirements are satisfied, merge this PR as a stack on Graphite.
Learn more

This stack of pull requests is managed by Graphite. Learn more about stacking.

chrsmcgrr · 2024-09-17T08:36:47Z

lib/Conversion/TorchToLinalg/Linear.cpp

+QuantizationValues getQuantizationPerTensorValues(
+    ConversionPatternRewriter &rewriter, Location loc,
+    Aten_MakePerTensorQuantizedTensorOp makePerTensorQuantizedTensorOp,
+    const TypeConverter *const typeConverter) {


nit: make a reference as we assume the pointer is valid through out the function.

chrsmcgrr · 2024-09-17T08:38:27Z

lib/Conversion/TorchToLinalg/Linear.cpp

+      zeroPoint);
+
+  // create a linalg op since we need to do some arithmetic on the zero point
+  // but is it a tensor.


Suggested change

// but is it a tensor.

// as it is a tensor.

chrsmcgrr · 2024-09-17T08:38:42Z

lib/Conversion/TorchToLinalg/Linear.cpp

+
+  // create a linalg op since we need to do some arithmetic on the zero point
+  // but is it a tensor.
+  RankedTensorType zeroPointType = cast<RankedTensorType>(zeroPoint.getType());


nit gritty: use auto

chrsmcgrr

Looks ok to me. Maybe in the future we break this PR even further. To me there were 3 concepts here quantized per channel conv, handling that case with transpose and a group conv implementation.

Overall an improvement but yeah we will have to revisit this one day.,

chrsmcgrr · 2024-09-17T08:43:38Z

lib/Conversion/TorchToLinalg/Linear.cpp

+    zeroPoint = torch_to_linalg::createElementwiseLinalgGeneric(
+        rewriter, loc, zeroPoint, rewriter.getI32Type(),
+        [&](OpBuilder &b, Location loc, ValueRange payloadArgs) {
+          Value result = rewriter.create<arith::ExtUIOp>(


Suggested change

Value result = rewriter.create<arith::ExtUIOp>(

Value result = rewriter.create<arith::ExtIOp>(

Since we assume it is an integer?

chrsmcgrr · 2024-09-17T08:45:16Z

lib/Conversion/TorchToLinalg/Linear.cpp

+    ConversionPatternRewriter &rewriter, Location loc,
+    Aten_MakePerChannelQuantizedTensorOp makePerChannelQuantizedTensorOp,
+    const TypeConverter *const typeConverter) {
+  QuantizationValues values;


nit: move the definition down to where the members are being set at the end of the funciton.

chrsmcgrr · 2024-09-17T08:51:10Z

lib/Conversion/TorchToLinalg/Linear.cpp

+        convolutionAttributes.outputPadding[i]));
+
+  // Set stride to 1
+  convolutionAttributes.stride.clear();


Why is the stride cleared and simply set to 1?

It is handled by the InsertSliceOp in line 1050. Not sure if this is the most performant way, but I basically just copied code and moved it into a function.

maxbartel · 2024-09-17T09:04:08Z

Yeah, I only added the the channel wise case and the rest was refactoring. I think it would have been easy with Graphite, I will do it next time. Sorry!

feat: support lowering of channelwise quantization to linalg

446e3f1

maxbartel mentioned this pull request Sep 16, 2024

feat: support makeChannelwiseQuantizedtensor in onnx to torch #2

Open

maxbartel requested a review from chrsmcgrr September 16, 2024 16:17

maxbartel marked this pull request as ready for review September 16, 2024 16:17

chrsmcgrr reviewed Sep 17, 2024

View reviewed changes

chrsmcgrr approved these changes Sep 17, 2024

View reviewed changes

This was referenced Dec 18, 2024

refactor: added struct for ConvolutionAttributes and function to preprocessPadding #6

Open

refactor: add function to handleTranspose #7

Open

KaiJPl mentioned this pull request Jan 9, 2025

feat: support lowering of channelwise quantization to linalg #10

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support lowering of channelwise quantization to linalg #3

feat: support lowering of channelwise quantization to linalg #3

maxbartel commented Sep 16, 2024

maxbartel commented Sep 16, 2024 •

edited

Loading

chrsmcgrr Sep 17, 2024 •

edited

Loading

chrsmcgrr Sep 17, 2024

chrsmcgrr Sep 17, 2024

chrsmcgrr left a comment

chrsmcgrr Sep 17, 2024

chrsmcgrr Sep 17, 2024

chrsmcgrr Sep 17, 2024

maxbartel Sep 17, 2024

maxbartel commented Sep 17, 2024

	Value result = rewriter.create<arith::ExtUIOp>(
	Value result = rewriter.create<arith::ExtIOp>(

feat: support lowering of channelwise quantization to linalg #3

Are you sure you want to change the base?

feat: support lowering of channelwise quantization to linalg #3

Conversation

maxbartel commented Sep 16, 2024

maxbartel commented Sep 16, 2024 • edited Loading

chrsmcgrr Sep 17, 2024 • edited Loading

Choose a reason for hiding this comment

chrsmcgrr Sep 17, 2024

Choose a reason for hiding this comment

chrsmcgrr Sep 17, 2024

Choose a reason for hiding this comment

chrsmcgrr left a comment

Choose a reason for hiding this comment

chrsmcgrr Sep 17, 2024

Choose a reason for hiding this comment

chrsmcgrr Sep 17, 2024

Choose a reason for hiding this comment

chrsmcgrr Sep 17, 2024

Choose a reason for hiding this comment

maxbartel Sep 17, 2024

Choose a reason for hiding this comment

maxbartel commented Sep 17, 2024

maxbartel commented Sep 16, 2024 •

edited

Loading

chrsmcgrr Sep 17, 2024 •

edited

Loading