[CUDA, DML] MatMul does not properly handle matrices with inner dim == 0 #21483

yuslepukhin · 2024-07-24T18:50:35Z

Describe the issue

MatMul is expected to produce a valid result when it is multiplying matrices with inner dimension equal to zero.
For example, operands of shapes {16,0} x {0, 16} should produce a zero filled matrix of shape {16, 16}.

This is properly supported in CPU EP, but it is confirmed not to work in CUDA and DML providers.

This feature is necessary to support current design of Lora Adapaters in GenAI, as well as for correctness.

To reproduce

CUDA complains about dimensions equal to zero.

Urgency

No response

Platform

Windows

OS Version

Windows 11

ONNX Runtime Installation

Built from Source

ONNX Runtime Version or Commit ID

1.18.1

ONNX Runtime API

C++

Architecture

X64

Execution Provider

DirectML

Execution Provider Library Version

No response

fdwr · 2024-07-25T06:17:39Z

Yeah, that's illegal from the DirectML API validator point of view, multiplying nothing times nothing and expecting something 😉. One could argue the output (since no multiplication actually occurred) should be NaN's instead. Though, why is a model generator producing such a degenerate operation, rather than just outputting a ConstantOfShape or Expand? Is there more context near the pertinent graph region you can show (via Netron) of what operators come before and after?

skottmckay · 2024-07-26T21:53:46Z

I too was very suprised that you could make magic up data from nothing, and that there was a default value to use which wasn't specified anywhere.

But the spec says "behaves like numpy.matmul" and numpy matul does indeed produce zeros.

yuslepukhin · 2024-07-26T22:44:01Z

Eigen that powers our CPU EP implementation does the same as numpy.

### Description This change addresses a case where we multiply two matrices, and their inner dimension is 0. numpy and Eigen which is being used in our CPU EP implementation correctly handle this case and output a [M, N] matrix filled with zeros. ### Motivation and Context This is required to support GenAI empty input Lora implementation. Addresses: #21483

github-actions · 2024-08-26T15:00:57Z

This issue has been automatically marked as stale due to inactivity and will be closed in 30 days if no further activity occurs. If further support is needed, please provide an update and/or more details.

yuslepukhin added ep:DML issues related to the DirectML execution provider core runtime issues related to core runtime ep:CUDA issues related to the CUDA execution provider labels Jul 24, 2024

github-actions bot added the platform:windows issues related to the Windows platform label Jul 24, 2024

yuslepukhin mentioned this issue Jul 24, 2024

Implement Core functionality for Lora Adapaters microsoft/onnxruntime-genai#679

Closed

yuslepukhin mentioned this issue Jul 26, 2024

[CUDA] Special case for K==0 in CUDA MatMul #21525

Merged

github-actions bot added the stale issues that have not been addressed in a while; categorized by a bot label Aug 26, 2024

ambroser53 mentioned this issue Jan 27, 2025

[Feature Request] Adapters DML support #23503

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDA, DML] MatMul does not properly handle matrices with inner dim == 0 #21483

[CUDA, DML] MatMul does not properly handle matrices with inner dim == 0 #21483

yuslepukhin commented Jul 24, 2024

fdwr commented Jul 25, 2024 •

edited

Loading

skottmckay commented Jul 26, 2024

yuslepukhin commented Jul 26, 2024

github-actions bot commented Aug 26, 2024

[CUDA, DML] MatMul does not properly handle matrices with inner dim == 0 #21483

[CUDA, DML] MatMul does not properly handle matrices with inner dim == 0 #21483

Comments

yuslepukhin commented Jul 24, 2024

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

fdwr commented Jul 25, 2024 • edited Loading

skottmckay commented Jul 26, 2024

yuslepukhin commented Jul 26, 2024

github-actions bot commented Aug 26, 2024

fdwr commented Jul 25, 2024 •

edited

Loading