Skip to content

[Bugfix] Update flashinfer.py with PagedAttention forwards - Fixes Gemma2 OpenAI Server Crash#6501

Merged
comaniac merged 4 commits intovllm-project:mainfrom noamgat:patch-3Jul 18, 2024

Commits

Commits on Jul 17, 2024

Commits on Jul 18, 2024