[js/webgpu] Increase workgroupSize if only one workgroup is dispached #22709

qjia7 · 2024-11-04T08:18:19Z

#22031

For reduce related ops, we should increase workgroupSize to improve parallelism if only one workgroup is dispatched.

The total ReduceMean time becomes 8.98 ms from 77.79 ms on my iGPUs.

For reduce related ops, we should increase workgroupSize to improve parallelism if only one workgroup is dispatched.

qjia7 · 2024-11-04T08:23:18Z

@guschmue @fs-eire Please take a look, thanks.

fs-eire · 2024-11-04T10:13:05Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline

fs-eire · 2024-11-04T10:13:07Z

/azp run Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Linux Android Emulator QNN CI Pipeline

fs-eire · 2024-11-04T10:13:11Z

/azp run Android CI Pipeline,iOS CI Pipeline,ONNX Runtime React Native CI Pipeline,CoreML CI Pipeline,Linux DNNL CI Pipeline,Linux MIGraphX CI Pipeline,Linux ROCm CI Pipeline

azure-pipelines · 2024-11-04T10:13:19Z

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines · 2024-11-04T10:13:22Z

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines · 2024-11-04T10:13:24Z

Azure Pipelines successfully started running 1 pipeline(s).

js/web/lib/wasm/jsep/webgpu/ops/reduce-shared.ts

guschmue · 2024-11-05T18:17:13Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline

guschmue · 2024-11-05T18:17:20Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

guschmue · 2024-11-05T18:17:26Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2024-11-05T18:17:26Z

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines · 2024-11-05T18:17:30Z

Azure Pipelines could not run because the pipeline triggers exclude this branch/path.

guschmue · 2024-11-05T18:17:32Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2024-11-05T18:17:40Z

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines · 2024-11-05T18:17:40Z

Azure Pipelines could not run because the pipeline triggers exclude this branch/path.

…microsoft#22709) microsoft#22031 For reduce related ops, we should increase workgroupSize to improve parallelism if only one workgroup is dispatched. The total ReduceMean time becomes 8.98 ms from 77.79 ms on my iGPUs.

…#22709) #22031 For reduce related ops, we should increase workgroupSize to improve parallelism if only one workgroup is dispatched. The total ReduceMean time becomes 8.98 ms from 77.79 ms on my iGPUs.

…microsoft#22709) microsoft#22031 For reduce related ops, we should increase workgroupSize to improve parallelism if only one workgroup is dispatched. The total ReduceMean time becomes 8.98 ms from 77.79 ms on my iGPUs.

[js/webgpu] Increase workgroupSize if only one workgroup is dispached

38287f9

For reduce related ops, we should increase workgroupSize to improve parallelism if only one workgroup is dispatched.

fs-eire reviewed Nov 4, 2024

View reviewed changes

js/web/lib/wasm/jsep/webgpu/ops/reduce-shared.ts Show resolved Hide resolved

guschmue previously approved these changes Nov 4, 2024

View reviewed changes

guschmue added the ep:WebGPU ort-web webgpu provider label Nov 4, 2024

address comments

65f6575

qjia7 dismissed guschmue’s stale review via 65f6575 November 5, 2024 05:05

qjia7 requested a review from fs-eire November 5, 2024 05:07

guschmue approved these changes Nov 5, 2024

View reviewed changes

guschmue merged commit d5b2730 into microsoft:main Nov 5, 2024
50 checks passed

qjia7 deleted the opt_reduce branch November 7, 2024 02:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[js/webgpu] Increase workgroupSize if only one workgroup is dispached #22709

[js/webgpu] Increase workgroupSize if only one workgroup is dispached #22709

qjia7 commented Nov 4, 2024

qjia7 commented Nov 4, 2024

fs-eire commented Nov 4, 2024

fs-eire commented Nov 4, 2024

fs-eire commented Nov 4, 2024

azure-pipelines bot commented Nov 4, 2024

azure-pipelines bot commented Nov 4, 2024

azure-pipelines bot commented Nov 4, 2024

guschmue commented Nov 5, 2024

guschmue commented Nov 5, 2024

guschmue commented Nov 5, 2024

azure-pipelines bot commented Nov 5, 2024

azure-pipelines bot commented Nov 5, 2024

guschmue commented Nov 5, 2024

azure-pipelines bot commented Nov 5, 2024

azure-pipelines bot commented Nov 5, 2024

[js/webgpu] Increase workgroupSize if only one workgroup is dispached #22709

[js/webgpu] Increase workgroupSize if only one workgroup is dispached #22709

Conversation

qjia7 commented Nov 4, 2024

qjia7 commented Nov 4, 2024

fs-eire commented Nov 4, 2024

fs-eire commented Nov 4, 2024

fs-eire commented Nov 4, 2024

azure-pipelines bot commented Nov 4, 2024

azure-pipelines bot commented Nov 4, 2024

azure-pipelines bot commented Nov 4, 2024

guschmue commented Nov 5, 2024

guschmue commented Nov 5, 2024

guschmue commented Nov 5, 2024

azure-pipelines bot commented Nov 5, 2024

azure-pipelines bot commented Nov 5, 2024

guschmue commented Nov 5, 2024

azure-pipelines bot commented Nov 5, 2024

azure-pipelines bot commented Nov 5, 2024