Remove blocksize 64 for quant/dequant functions #10

pnunna93 · 2024-03-06T22:06:22Z

This PR removes 64 blocksize for quantize and dequantize functions, as ROCm warpsize doesn't support that case.

It also skips that case for tests which use quantize/dequantize functions. These are the tests enabled with this PR:

test_autograd.py::test_matmul_fp8
test_functional.py::test_dynamic_blockwise_quantization
test_functional.py::test_4bit_compressed_stats

Lzy17

LGTM

pnunna93 added 2 commits March 6, 2024 19:14

remove blocksize 64 on rocm

fa28828

remove block size 64 and enable remaining tests

d86d24c

pnunna93 requested review from Lzy17 and amathews-amd March 6, 2024 22:06

Lzy17 approved these changes Mar 12, 2024

View reviewed changes

pnunna93 merged commit 9890d5d into rocm_enabled Mar 12, 2024
1 check passed

pnunna93 deleted the remove_blocksize_64 branch March 12, 2024 21:33

Provide feedback