Skip to content

[Bugfix][Kernel] FA3 Fix - RuntimeError: This flash attention build only supports pack_gqa (for build size reasons).#12405

Merged
tlrmchlsmth merged 1 commit intovllm-project:mainfrom neuralmagic:lwilkinson/re-enable-non-packed-gqaJan 24, 2025

Commits

Commits on Jan 24, 2025