WebGPU JSEP: Make shader code not depend on input broadcasting patterns #22536

jiangzhaoming · 2024-10-22T07:21:25Z

This PR make MatMul shaders not depend on inputs broadcasting pattern, but only depend on input ranks and their shape provided in uniform. This change fix the issue that currently shaders code are different for different broadcasting, but have identical cache key and results in wrong cache hit.

This PR adds inputs broadcasting information into the cache key of MatMul shaders, which currently impacts the shader code. This PR fixes the results for MatMul nodes with identical input ranks but different broadcasting patterns.

jiangzhaoming · 2024-10-22T07:23:03Z

@qjia7 @fs-eire Please take a look, thanks

js/web/lib/wasm/jsep/webgpu/ops/matmul.ts

jiangzhaoming · 2024-10-23T05:45:00Z

PR description edited, previous:

This PR adds inputs broadcasting information into the cache key of MatMul shaders, which currently impacts the shader code. This PR fixes the results for MatMul nodes with identical input ranks but different broadcasting patterns.

js/web/lib/wasm/jsep/webgpu/ops/matmul.ts

fs-eire · 2024-10-24T07:26:59Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline

fs-eire · 2024-10-24T07:27:01Z

/azp run Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Linux Android Emulator QNN CI Pipeline

fs-eire · 2024-10-24T07:27:03Z

/azp run Android CI Pipeline,iOS CI Pipeline,ONNX Runtime React Native CI Pipeline,CoreML CI Pipeline,Linux DNNL CI Pipeline,Linux MIGraphX CI Pipeline,Linux ROCm CI Pipeline

azure-pipelines · 2024-10-24T07:27:15Z

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines · 2024-10-24T07:27:16Z

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines · 2024-10-24T07:27:16Z

Azure Pipelines successfully started running 1 pipeline(s).

js/web/lib/wasm/jsep/webgpu/ops/3rd-party/matmul_packed_webgpu.ts

…ndency

qjia7

Please also remove the definition of getBroadcastDims in commen.ts if it's not needed anymore.

jiangzhaoming · 2024-11-01T07:27:58Z

Please also remove the definition of getBroadcastDims in commen.ts if it's not needed anymore.

Done.

guschmue · 2024-11-08T00:57:45Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

azure-pipelines · 2024-11-08T00:57:58Z

Azure Pipelines successfully started running 1 pipeline(s).

guschmue · 2024-11-08T00:58:00Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

azure-pipelines · 2024-11-08T00:58:12Z

Azure Pipelines could not run because the pipeline triggers exclude this branch/path.

guschmue · 2024-11-08T00:59:16Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

guschmue · 2024-11-08T00:59:27Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2024-11-08T00:59:28Z

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines · 2024-11-08T00:59:35Z

Azure Pipelines could not run because the pipeline triggers exclude this branch/path.

guschmue · 2024-11-08T01:57:31Z

lint not happy:

jiangzhaoming · 2024-11-08T02:42:22Z

The issue comes from merging, fixed in the latest commit. Please take a look, thanks!

guschmue · 2024-11-08T16:05:34Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

guschmue · 2024-11-08T16:05:41Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

guschmue · 2024-11-08T16:05:47Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2024-11-08T16:05:48Z

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines · 2024-11-08T16:05:51Z

Azure Pipelines could not run because the pipeline triggers exclude this branch/path.

guschmue · 2024-11-08T16:05:53Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2024-11-08T16:06:01Z

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines · 2024-11-08T16:06:02Z

Azure Pipelines could not run because the pipeline triggers exclude this branch/path.

…ns (microsoft#22536) This PR make MatMul shaders not depend on inputs broadcasting pattern, but only depend on input ranks and their shape provided in uniform. This change fix the issue that currently shaders code are different for different broadcasting, but have identical cache key and results in wrong cache hit.

…ns (#22536) This PR make MatMul shaders not depend on inputs broadcasting pattern, but only depend on input ranks and their shape provided in uniform. This change fix the issue that currently shaders code are different for different broadcasting, but have identical cache key and results in wrong cache hit.

…ns (microsoft#22536) This PR make MatMul shaders not depend on inputs broadcasting pattern, but only depend on input ranks and their shape provided in uniform. This change fix the issue that currently shaders code are different for different broadcasting, but have identical cache key and results in wrong cache hit.

qjia7 reviewed Oct 22, 2024

View reviewed changes

js/web/lib/wasm/jsep/webgpu/ops/matmul.ts Outdated Show resolved Hide resolved

Change the shader implementation to use only ranks and uniform

6d225e0

jiangzhaoming changed the title ~~WebGPU JSEP: Add inputs broadcasting into MatMul shader cache key~~ WebGPU JSEP: Make shader code not rely on input broadcasting patterns Oct 23, 2024

jiangzhaoming changed the title ~~WebGPU JSEP: Make shader code not rely on input broadcasting patterns~~ WebGPU JSEP: Make shader code not depend on input broadcasting patterns Oct 23, 2024

qjia7 reviewed Oct 24, 2024

View reviewed changes

js/web/lib/wasm/jsep/webgpu/ops/matmul.ts Outdated Show resolved Hide resolved

jiangzhaoming added 2 commits October 24, 2024 14:16

Improve code reusing

74c4d22

Make helper only handle the batch indices

b70e48c

guschmue added the ep:WebGPU ort-web webgpu provider label Oct 24, 2024

qjia7 reviewed Oct 28, 2024

View reviewed changes

js/web/lib/wasm/jsep/webgpu/ops/3rd-party/matmul_packed_webgpu.ts Outdated Show resolved Hide resolved

qjia7 mentioned this pull request Nov 1, 2024

[Web] Demucs model won't run in both WASM and WGPU #22031

Closed

Separate shaders impl from matmul.ts (OP impl) to avoid circular depe…

d30b5b5

…ndency

qjia7 reviewed Nov 1, 2024

View reviewed changes

Remove unused getBroadcastDims

54ac622

jiangzhaoming requested a review from fs-eire November 1, 2024 08:28

guschmue previously approved these changes Nov 8, 2024

View reviewed changes

jiangzhaoming added 2 commits November 8, 2024 10:34

Merge branch 'main' into MatMulShaderCacheKeyBroadcasting0

bbaa003

Fix merging issue

84fd707

jiangzhaoming dismissed guschmue’s stale review via 84fd707 November 8, 2024 02:41

guschmue approved these changes Nov 8, 2024

View reviewed changes

guschmue merged commit d9b9168 into microsoft:main Nov 8, 2024
50 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WebGPU JSEP: Make shader code not depend on input broadcasting patterns #22536

WebGPU JSEP: Make shader code not depend on input broadcasting patterns #22536

jiangzhaoming commented Oct 22, 2024 •

edited

Loading

jiangzhaoming commented Oct 22, 2024

jiangzhaoming commented Oct 23, 2024

fs-eire commented Oct 24, 2024

fs-eire commented Oct 24, 2024

fs-eire commented Oct 24, 2024

azure-pipelines bot commented Oct 24, 2024

azure-pipelines bot commented Oct 24, 2024

azure-pipelines bot commented Oct 24, 2024

qjia7 left a comment

jiangzhaoming commented Nov 1, 2024

guschmue commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

guschmue commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

guschmue commented Nov 8, 2024

guschmue commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

guschmue commented Nov 8, 2024

jiangzhaoming commented Nov 8, 2024

guschmue commented Nov 8, 2024

guschmue commented Nov 8, 2024

guschmue commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

guschmue commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

WebGPU JSEP: Make shader code not depend on input broadcasting patterns #22536

WebGPU JSEP: Make shader code not depend on input broadcasting patterns #22536

Conversation

jiangzhaoming commented Oct 22, 2024 • edited Loading

jiangzhaoming commented Oct 22, 2024

jiangzhaoming commented Oct 23, 2024

fs-eire commented Oct 24, 2024

fs-eire commented Oct 24, 2024

fs-eire commented Oct 24, 2024

azure-pipelines bot commented Oct 24, 2024

azure-pipelines bot commented Oct 24, 2024

azure-pipelines bot commented Oct 24, 2024

qjia7 left a comment

Choose a reason for hiding this comment

jiangzhaoming commented Nov 1, 2024

guschmue commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

guschmue commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

guschmue commented Nov 8, 2024

guschmue commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

guschmue commented Nov 8, 2024

jiangzhaoming commented Nov 8, 2024

guschmue commented Nov 8, 2024

guschmue commented Nov 8, 2024

guschmue commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

guschmue commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

azure-pipelines bot commented Nov 8, 2024

jiangzhaoming commented Oct 22, 2024 •

edited

Loading