[Bugfix] Update flashinfer.py with PagedAttention forwards - Fixes Gemma2 OpenAI Server Crash#6501
Merged
comaniac merged 4 commits intovllm-project:mainfrom noamgat:patch-3Jul 18, 2024
+3-2
Commits
Commits on Jul 17, 2024
Commits on Jul 18, 2024
- authored
- authored
- authored