Skip to content

CPU/CUDA: Gemma 2 FlashAttention support#8542

Merged
JohannesGaessler merged 4 commits intoggml-org:masterfrom JohannesGaessler:fattn-logit-softcapAug 24, 2024

Commits

Commits on Aug 24, 2024