Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CUDA: fix MMV kernel being used for FP16 src1 #10357

Merged
merged 1 commit into from
Nov 17, 2024

Conversation

JohannesGaessler
Copy link
Collaborator

Fixes #10356 .

The problem is simply that I forgot to add a check for the type of src1. While FP16 src1 is not used for model evaluation it is used in the test code.

@github-actions github-actions bot added the Nvidia GPU Issues specific to Nvidia GPUs label Nov 17, 2024
@slaren
Copy link
Collaborator

slaren commented Nov 17, 2024

Wouldn't it be more reliable to check use_mul_mat_vec, since it has the full test for compatibility already?

@JohannesGaessler
Copy link
Collaborator Author

You're right, I forgot to adapt the logic for the first check when I added the variable (already in the previous PR).

@JohannesGaessler JohannesGaessler merged commit 76e9e58 into ggerganov:master Nov 17, 2024
54 checks passed
arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Nvidia GPU Issues specific to Nvidia GPUs
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Bug: ggml-4-x86-cuda-v100 CI Failure probably caused by #10318
3 participants