Skip to content

[Bugfix] Fix illegal memory access error with chunked prefill, prefix caching, block manager v2 and xformers enabled together#9532

Merged
comaniac merged 6 commits intovllm-project:mainfrom sasha0552:cuda-illegal-memory-access-fixOct 31, 2024

Commits

Commits on Oct 30, 2024

Commits on Oct 31, 2024