Skip to content

[Kernel] Support running GPTQ 8-bit models in Marlin#4533

Merged
robertgshaw2-redhat merged 9 commits intovllm-project:mainfrom neuralmagic:marlin_8bitMay 2, 2024

Commits

Commits on May 1, 2024

Commits on May 2, 2024