Feature Request: RPC Cuda Build to link with cudart dlls #8912

jkfnc · 2024-08-07T17:26:03Z

Prerequisites

I am running the latest code. Mention the version if possible as well.
I carefully followed the README.md.
I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

Is it possible to get RPC Builds that work with Cuda , so we dont have to compile from scratch.

Motivation

Would be easy to get started with RPC for distributed inference if we had readymade build available. Current RPC build only works with CPU.

Possible Implementation

No response

slaren · 2024-08-08T21:58:34Z

I think it would be ok to enable the RPC backend on all the builds, and remove the RPC-specific build. It should be a simple change in the .github/workflows/build.yml file.

rgerganov · 2024-08-11T07:15:59Z

I will submit a patch for this in the next few days

ref: ggerganov#8912

ref: #8912

ref: ggerganov#8912

jkfnc added the enhancement New feature or request label Aug 7, 2024

jkfnc changed the title ~~Feature Request: Req for RPC Cuda Build to link with cudart dlls~~ Feature Request: RPC Cuda Build to link with cudart dlls Aug 7, 2024

rgerganov self-assigned this Aug 11, 2024

rgerganov added a commit to rgerganov/llama.cpp that referenced this issue Aug 12, 2024

ci : enable RPC in all of the released builds

7140070

ref: ggerganov#8912

rgerganov mentioned this issue Aug 12, 2024

ci : enable RPC in all of the released builds #9006

Merged

4 tasks

slaren linked a pull request Aug 12, 2024 that will close this issue

ci : enable RPC in all of the released builds #9006

Merged

4 tasks

rgerganov added a commit that referenced this issue Aug 12, 2024

ci : enable RPC in all of the released builds (#9006)

1f67436

ref: #8912

rgerganov closed this as completed in #9006 Aug 12, 2024

arthw pushed a commit to arthw/llama.cpp that referenced this issue Nov 15, 2024

ci : enable RPC in all of the released builds (ggerganov#9006)

19b9d2a

ref: ggerganov#8912

arthw pushed a commit to arthw/llama.cpp that referenced this issue Nov 18, 2024

ci : enable RPC in all of the released builds (ggerganov#9006)

28747ba

ref: ggerganov#8912

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: RPC Cuda Build to link with cudart dlls #8912

Feature Request: RPC Cuda Build to link with cudart dlls #8912

jkfnc commented Aug 7, 2024

slaren commented Aug 8, 2024

rgerganov commented Aug 11, 2024

Feature Request: RPC Cuda Build to link with cudart dlls #8912

Feature Request: RPC Cuda Build to link with cudart dlls #8912

Comments

jkfnc commented Aug 7, 2024

Prerequisites

Feature Description

Motivation

Possible Implementation

slaren commented Aug 8, 2024

rgerganov commented Aug 11, 2024