Skip to content

Add Support for 2/3/8-bit GPTQ Quantization Models#2330

Merged
WoosukKwon merged 8 commits intovllm-project:mainfrom chu-tianxiang:gptq_8bitFeb 29, 2024

Commits

Commits on Dec 24, 2023

Commits on Dec 25, 2023

Commits on Jan 3, 2024

Commits on Jan 5, 2024

Commits on Feb 24, 2024