vulkan : add backend registry / device interfaces #9721

slaren · 2024-10-03T01:39:59Z

No description provided.

slaren · 2024-10-03T23:32:44Z

@0cc4m This PR has two additional changes:

Translates the device index in ggml_backend_vk_get_device_description (I believe this was a bug)
Changes the names of the backends/buffers etc to Vulkan<idx>. This is the intended use for the name of these objects, a more detailed description can now be obtained using the ggml-backend device interface.

After this change it is possible to use Vulkan and CUDA in the same llama.cpp build (you may have the disable the NVIDIA devices in the Vulkan backend using the GGML_VK_VISIBLE_DEVICES environment variable).

MaggotHATE · 2024-10-08T18:05:45Z

Seems to work fine (Win10), but I'm noticing another increase in layer size. Previously with Mistral-Nemo-Instruct-2407.q5_k_l I could offload 5 layers on 3GB VRAM, now it's only 3. Is it expected? The total VRAM usage is pretty much the same as before backend registry updates.

slaren · 2024-10-08T18:51:21Z

I don't think there are any changes here that could increase the memory usage. It's just exposing existing functionality of the vulkan backend through a different interface.

0cc4m · 2024-10-16T09:09:46Z

@slaren Thank you for implementing this. I can confirm it builds on Linux and that the code looks good. I can't fully test it currently since my server is still disassembled cause I'm in the process of moving between cities. I should be able to reassemble it this weekend, but I'm still very busy. You can decide if you prefer to wait or if you think it's ready to merge.

slaren · 2024-10-16T09:12:42Z

Can you check the changes to ggml_backend_vk_get_device_description? Previously, it wouldn't translate the device index to the indexes given by GGML_VK_VISIBLE_DEVICES, which I believe was a bug. Other than that, I think that there is very little chance that this PR breaks anything.

0cc4m · 2024-10-16T09:23:47Z

That was a bug, yeah.

* vulkan : add backend registry / device interfaces * llama : print devices used on model load

github-actions bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Oct 3, 2024

slaren force-pushed the sl/vulkan-reg-2 branch from 11cb93a to e25c9c1 Compare October 3, 2024 22:54

slaren marked this pull request as ready for review October 3, 2024 23:27

0cc4m self-requested a review October 4, 2024 05:23

vulkan : add backend registry / device interfaces

5f4e30d

slaren force-pushed the sl/vulkan-reg-2 branch from e25c9c1 to 9e04f2c Compare October 7, 2024 20:44

llama : print devices used on model load

20ca856

slaren force-pushed the sl/vulkan-reg-2 branch from 9e04f2c to 20ca856 Compare October 7, 2024 20:45

ggerganov approved these changes Oct 16, 2024

View reviewed changes

Merge remote-tracking branch 'origin/master' into sl/vulkan-reg-2

2363a48

slaren merged commit f010b77 into master Oct 17, 2024
54 checks passed

slaren deleted the sl/vulkan-reg-2 branch October 17, 2024 00:47

drollings pushed a commit to drollings/llama.cpp that referenced this pull request Oct 18, 2024

vulkan : add backend registry / device interfaces (ggerganov#9721)

b4e079a

* vulkan : add backend registry / device interfaces * llama : print devices used on model load

dsx1986 pushed a commit to dsx1986/llama.cpp that referenced this pull request Oct 29, 2024

vulkan : add backend registry / device interfaces (ggerganov#9721)

fcfea0b

* vulkan : add backend registry / device interfaces * llama : print devices used on model load

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

vulkan : add backend registry / device interfaces (ggerganov#9721)

5f0343d

* vulkan : add backend registry / device interfaces * llama : print devices used on model load

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

vulkan : add backend registry / device interfaces (ggerganov#9721)

b76d028

* vulkan : add backend registry / device interfaces * llama : print devices used on model load

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vulkan : add backend registry / device interfaces #9721

vulkan : add backend registry / device interfaces #9721

slaren commented Oct 3, 2024

slaren commented Oct 3, 2024

MaggotHATE commented Oct 8, 2024

slaren commented Oct 8, 2024

0cc4m commented Oct 16, 2024

slaren commented Oct 16, 2024

0cc4m commented Oct 16, 2024

vulkan : add backend registry / device interfaces #9721

vulkan : add backend registry / device interfaces #9721

Conversation

slaren commented Oct 3, 2024

slaren commented Oct 3, 2024

MaggotHATE commented Oct 8, 2024

slaren commented Oct 8, 2024

0cc4m commented Oct 16, 2024

slaren commented Oct 16, 2024

0cc4m commented Oct 16, 2024