-
Notifications
You must be signed in to change notification settings - Fork 402
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Model loading failed with --gpulayer 80 on Metal #744
Comments
Is this a new issue? Did the same model load correctly previously? Looking at your debug logs, I don't see metal being initialized. Did you build with |
Yes, this is a new problem. When I saw your reply first time, I thought I might do a mistake. But now, I confirmed that I confirmed thaat this is really an issue. I pulled the Click Me
And I built it with Click Me
It compiled well without errors, then I tried to run it. Click Me
Failed. So I tried to run a small GGUF model on same disk volume. Click Me
It is failed again. |
For some reason, your program is not calling the Let's try troubleshoot this sequentially. If it's still not working, there are 3 commits that change the metal related files. They are be858f6 bb6d00b and 8a3012a . Of these 3, I think the most likely one to cause issues is 8a3012a . Unfortunately, you won't be able to directly revert these commits due to merge conflicts. But perhaps you could examine the changes and see if you can figure out what causes the problems. If you're still stuck, let me know and I'll create a few separate checkpoints you can try - I can't debug this on my side as I don't have a mac. It's weird as it seems the Init isn't even being called. |
If you can stick some print statements in this function https://github.com/LostRuins/koboldcpp/blob/concedo/ggml-metal.m#L2835 and within
Would also be helpful to compare the terminal output of the successfull v1.60.1 build. |
I found that the first line of
So I investigated
An error is occurred at L347 - |
And even more confusingly, |
I changed the
|
Related: ggerganov#5977 |
As a workround, this works.
|
Yeah, but it's not ideal. |
I might go with the
Might need your help to test it again after I tweak it. It's annoying cause I will not be able to test anything myself as I don't have a mac. |
After doing the tweak using
Tada! I also noticed that
So, in |
Ah yeah, that is fixed in ggerganov#6015 which I will merge together when fixing the makefile tomorrow. Thanks for helping test. |
Hi @beebopkim , if you don't mind, can you see if the latest experimental branch runs fine with |
@LostRuins I wish I do it right now but I'm afraid that I can do it after 9 hours... Sorry for your waiting. |
No problem, just let me know |
@LostRuins With f3b7651, there is no problem. Now I can run with bakllava-mistal-v1 with --gpulayers 99! Thanks alot! 😃 |
Thanks for testing! |
I also confirmed that ec5dea1 works too. You're welcome! |
Commit hash: edb05e7
Branch: concedo_experimental
With
--gpulayers 80
:Without
--gpulayers 80
:For comparison -
server
with-ngl 999
from llama.cpp commit hash 306d34b:The text was updated successfully, but these errors were encountered: