Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Update vllm/model_executor/layers/quantization/compressed_tensors/tri…
…ton_scaled_mm.py Great suggestion to use reshape operator. I think the change should be like this scale_a = scale_a.reshape(-1, 1) if scale_a.dim() <= 1 else scale_a scale_b = scale_b.reshape(-1, 1) if scale_b.dim() <= 1 else scale_b Co-authored-by: Michael Goin <michael@neuralmagic.com>
- Loading branch information