Skip to content

CUDA: fix partial offloading for ne0 % 256 != 0#8572

Merged
JohannesGaessler merged 1 commit intoggerganov:masterfrom JohannesGaessler:cuda-glm4-fixJul 18, 2024

Commits

Commits on Jul 18, 2024