Skip to content

Commit

Permalink
Removing the contiguous requirement, as the kernel supports arbitrary…
Browse files Browse the repository at this point in the history
… strides

Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
  • Loading branch information
gshtras committed Feb 13, 2025
1 parent bbab81f commit 2205c07
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion vllm/model_executor/layers/quantization/utils/fp8_utils.py
Original file line number Diff line number Diff line change
Expand Up @@ -477,7 +477,7 @@ def w8a8_block_fp8_matmul(
assert triton.cdiv(A.shape[-1], block_k) == As.shape[-1]
M = A.numel() // A.shape[-1]

assert B.ndim == 2 and B.is_contiguous() and Bs.ndim == 2
assert B.ndim == 2 and Bs.ndim == 2
N, K = B.shape
assert triton.cdiv(N, block_n) == Bs.shape[0]
assert triton.cdiv(K, block_k) == Bs.shape[1]
Expand Down

0 comments on commit 2205c07

Please sign in to comment.