Skip to content

[Attention] Deepseek v3 MLA support with FP8 compute#12601

Merged
simon-mo merged 41 commits intovllm-project:mainfrom LucasWilkinson:mla-fp8Feb 1, 2025

Commits

Commits on Jan 30, 2025

Commits on Jan 31, 2025

Commits on Feb 1, 2025