[Kernel] Support running GPTQ 8-bit models in Marlin#4533
Merged
robertgshaw2-redhat merged 9 commits intovllm-project:mainfrom neuralmagic:marlin_8bitMay 2, 2024
+553-324
Commits
Commits on May 1, 2024
Commits on May 2, 2024
- committed
- committed
- committed
- committed
- committed