Add Support for 2/3/8-bit GPTQ Quantization Models#2330
Merged
WoosukKwon merged 8 commits intovllm-project:mainfrom chu-tianxiang:gptq_8bitFeb 29, 2024
+1,736-229
Commits
Commits on Dec 24, 2023
- committed
Commits on Dec 25, 2023
- committed
- committed
Commits on Jan 3, 2024
- committed
- committed
- committed
Commits on Jan 5, 2024
- committed
Commits on Feb 24, 2024
- committed