Skip to content

LLM: support quantized kv cache for Mistral in transformers >=4.36.0 #4244

LLM: support quantized kv cache for Mistral in transformers >=4.36.0

LLM: support quantized kv cache for Mistral in transformers >=4.36.0 #4244