-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cublas Cuda 801 on Maxwell Titan X #1447
Comments
Commenting out #L7071 stops this error but im still curious as to what instruction wasn't supported 🤔 |
This actually amounts to burying one's head in the sand because you've only eliminated the error message, but the error itself still exists. You can set Lines 187 to 198 in 6a5d195
Lines 200 to 216 in 6a5d195
|
I have a NVIDIA GeForce GTX 860M, and I am suddenly having the same issue since the last pull. EDIT: yep falling back to commit |
I didn't notice the |
Forcing CUDA_ARCH_FLAG still results in the problem for me as well. What interesting in my case is |
I don't mean to poke, but this is still an issue. |
Does it work if you apply this patch? diff --git a/ggml-cuda.cu b/ggml-cuda.cu
index b420330..9da239a 100644
--- a/ggml-cuda.cu
+++ b/ggml-cuda.cu
@@ -96,7 +96,7 @@
// - 7B quantum model: +100-200 MB
// - 13B quantum model: +200-400 MB
//
-//#define GGML_CUDA_FORCE_MMQ
+#define GGML_CUDA_FORCE_MMQ
// TODO: improve this to be correct for more hardware
// for example, currently fails for GeForce GTX 1660 which is TURING arch (> VOLTA) but does not have tensor cores |
Its an old card I know but hopefully there is something that can be done.
https://github.com/ggerganov/whisper.cpp/blob/master/ggml-cuda.cu#L7069-#L7071
There seems to be an issue on Maxwell cards not supporting some type of function in Cuda. Im not sure exactly what instruction is not supported but maybe someone can provide some insights?
In this sample I manually disabled the tensor cores by forcing GGML_CUDA_FORCE_MMQ but the issue still exists
An important thing to note is that I compiled the library on a device with a 3070. That could likely be a root cause
The text was updated successfully, but these errors were encountered: