[WebNN] QDQ's axis should be used for broadcasting #22721

Honry · 2024-11-05T01:14:40Z

For per-axis quantization/dequantization, WebNN requires the scale and zero_point inputs to be broadcastable. Axis should be used for reshape these two inputs.

Honry · 2024-11-05T01:15:03Z

@fdwr, @guschmue, PTAL, thanks!

For per-axis quantization/dequantization, WebNN requires the scale and zero_point inputs to be broadcastable. Axis should be used for reshape these two inputs.

guschmue · 2024-11-06T17:05:06Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

guschmue · 2024-11-06T17:05:13Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

guschmue · 2024-11-06T17:05:20Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2024-11-06T17:05:23Z

Azure Pipelines successfully started running 2 pipeline(s).

guschmue · 2024-11-06T17:05:26Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2024-11-06T17:05:42Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2024-11-06T17:05:42Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2024-11-06T17:06:00Z

Azure Pipelines successfully started running 9 pipeline(s).

onnxruntime/core/providers/webnn/builders/impl/qdq_op_builder.cc

fdwr

👍

fdwr · 2024-11-09T04:24:08Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

fdwr · 2024-11-09T04:24:11Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

fdwr · 2024-11-09T04:24:13Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline

fdwr · 2024-11-09T04:24:15Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2024-11-09T04:24:23Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-11-09T04:24:28Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2024-11-09T04:24:32Z

Azure Pipelines successfully started running 4 pipeline(s).

azure-pipelines · 2024-11-09T04:24:44Z

Azure Pipelines successfully started running 9 pipeline(s).

For per-axis quantization/dequantization, WebNN requires the scale and zero_point inputs to be broadcastable. Axis should be used for reshape these two inputs.

Honry changed the title ~~[WebNN] Axis should be used for broadcasting~~ [WebNN] QDQ's axis should be used for broadcasting Nov 5, 2024

[WebNN] Axis should be used for broadcasting

7904631

For per-axis quantization/dequantization, WebNN requires the scale and zero_point inputs to be broadcastable. Axis should be used for reshape these two inputs.

guschmue added the ep:WebNN WebNN execution provider label Nov 6, 2024

fdwr reviewed Nov 7, 2024

View reviewed changes

onnxruntime/core/providers/webnn/builders/impl/qdq_op_builder.cc Outdated Show resolved Hide resolved

Address comment

2f3302c

Honry force-pushed the qdq-axis branch from dfb0779 to 2f3302c Compare November 7, 2024 04:59

fdwr approved these changes Nov 9, 2024

View reviewed changes

fdwr merged commit b9b1a03 into microsoft:main Nov 10, 2024
75 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WebNN] QDQ's axis should be used for broadcasting #22721

[WebNN] QDQ's axis should be used for broadcasting #22721

Honry commented Nov 5, 2024

Honry commented Nov 5, 2024

guschmue commented Nov 6, 2024

guschmue commented Nov 6, 2024

guschmue commented Nov 6, 2024

azure-pipelines bot commented Nov 6, 2024

guschmue commented Nov 6, 2024

azure-pipelines bot commented Nov 6, 2024

azure-pipelines bot commented Nov 6, 2024

azure-pipelines bot commented Nov 6, 2024

fdwr left a comment

fdwr commented Nov 9, 2024

fdwr commented Nov 9, 2024

fdwr commented Nov 9, 2024

fdwr commented Nov 9, 2024

azure-pipelines bot commented Nov 9, 2024

azure-pipelines bot commented Nov 9, 2024

azure-pipelines bot commented Nov 9, 2024

azure-pipelines bot commented Nov 9, 2024

[WebNN] QDQ's axis should be used for broadcasting #22721

[WebNN] QDQ's axis should be used for broadcasting #22721

Conversation

Honry commented Nov 5, 2024

Honry commented Nov 5, 2024

guschmue commented Nov 6, 2024

guschmue commented Nov 6, 2024

guschmue commented Nov 6, 2024

azure-pipelines bot commented Nov 6, 2024

guschmue commented Nov 6, 2024

azure-pipelines bot commented Nov 6, 2024

azure-pipelines bot commented Nov 6, 2024

azure-pipelines bot commented Nov 6, 2024

fdwr left a comment

Choose a reason for hiding this comment

fdwr commented Nov 9, 2024

fdwr commented Nov 9, 2024

fdwr commented Nov 9, 2024

fdwr commented Nov 9, 2024

azure-pipelines bot commented Nov 9, 2024

azure-pipelines bot commented Nov 9, 2024

azure-pipelines bot commented Nov 9, 2024

azure-pipelines bot commented Nov 9, 2024