-
Notifications
You must be signed in to change notification settings - Fork 10.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
vulkan : add backend registry / device interfaces #9721
Conversation
11cb93a
to
e25c9c1
Compare
@0cc4m This PR has two additional changes:
After this change it is possible to use Vulkan and CUDA in the same llama.cpp build (you may have the disable the NVIDIA devices in the Vulkan backend using the |
e25c9c1
to
9e04f2c
Compare
9e04f2c
to
20ca856
Compare
Seems to work fine (Win10), but I'm noticing another increase in layer size. Previously with |
I don't think there are any changes here that could increase the memory usage. It's just exposing existing functionality of the vulkan backend through a different interface. |
@slaren Thank you for implementing this. I can confirm it builds on Linux and that the code looks good. I can't fully test it currently since my server is still disassembled cause I'm in the process of moving between cities. I should be able to reassemble it this weekend, but I'm still very busy. You can decide if you prefer to wait or if you think it's ready to merge. |
Can you check the changes to |
That was a bug, yeah. |
* vulkan : add backend registry / device interfaces * llama : print devices used on model load
* vulkan : add backend registry / device interfaces * llama : print devices used on model load
* vulkan : add backend registry / device interfaces * llama : print devices used on model load
* vulkan : add backend registry / device interfaces * llama : print devices used on model load
No description provided.