docs: update fedora cuda guide for 12.8 release #11393

teihome · 2025-01-24T16:08:20Z

In this pull request the cuda-fedora.md guide has been updated to use the latest release of CUDA 12.8 (previously 12.6), that was uploaded on 2025-01-17.

The new release uses the current version of Fedora 41 (previously 39).

This guide continues to use the Toolbox environment to allow easy installation on Silverblue or Workstation systems alike.

This pull request also updates the CUDA section of the build document to be more clear and descriptive for compiling for explicit compute compatibility targets.

teihome · 2025-01-24T18:27:51Z

Converted to a draft, as there are some compatibility issues with using the newer NVIDIA drivers in the toolbox over the host.

teihome · 2025-01-24T19:35:58Z

Okay, I have resolved the issue, the issue was that nvidia-driver-cuda-libs was installed in the guest even when /usr/lib64/libcuda.so.1 was supplied by the host.

When the guest libcuda was older, it would not be updated, so it matched the version of the host, but now with the guest having a newer version (570) of the libraries than the host (565), it would break.

It is fixed by never installing nvidia-driver-cuda-libs to the guest filesystem if the host is supplying CUDA.

da2ce7 · 2025-02-05T13:53:41Z

With only minor modification it worked for the www.runpod.io config, using the docker image: registry.fedoraproject.org/fedora-toolbox:41

dnf distro-sync --assumeyes --quiet > /dev/null;
dnf install vim-default-editor -y --allowerasing  --assumeyes --quiet > /dev/null;
dnf install @c-development @development-tools cmake sshd  --assumeyes --quiet > /dev/null;
dnf config-manager addrepo --from-repofile=https://developer.download.nvidia.com/compute/cuda/repos/fedora41/x86_64/cuda-fedora41.repo;
dnf download --destdir=/tmp/nvidia-driver-libs --resolve --arch x86_64 nvidia-driver-cuda nvidia-driver-libs nvidia-driver-cuda-libs nvidia-persistenced --quiet > /dev/null;
rpm --install --verbose --hash --justdb /tmp/nvidia-driver-libs/* --quiet > /dev/null;
rm -rf /tmp/nvidia-driver-libs;
dnf install cuda  --assumeyes --quiet > /dev/null;
echo "export PATH=\$PATH:/usr/local/cuda/bin" >> /etc/profile.d/cuda.sh;
chmod +x /etc/profile.d/cuda.sh;
source /etc/profile.d/cuda.sh;
nvcc --version;
git clone --depth=1 https://github.com/ggerganov/llama.cpp.git /tmp/llama.cpp
cd /tmp/llama.cpp;
cmake -B build -DGGML_CUDA=ON;
cmake --build build --config Release -j 20;
cmake --install build;
cd ~;
rm -rf /tmp/llama.cpp;
echo "/usr/local/lib" | sudo tee /etc/ld.so.conf.d/local-lib.conf;
echo "/usr/local/lib64" | sudo tee /etc/ld.so.conf.d/local-lib64.conf;
ldconfig;

I think that this documentation update can be merged as it is.

da2ce7 · 2025-02-05T19:26:47Z

I now have made a template based upon the guide provided here: https://runpod.io/console/deploy?template=mtwj86pqgc&ref=r0lfrx3d

da2ce7 · 2025-02-06T10:26:14Z

Perhaps @ericcurtin and @ngxson would like to review this documentation update, I feel that it might have been lost in the history.

ngxson

Also cc @ericcurtin if you want to take a look

ericcurtin · 2025-02-06T12:15:39Z

Approved, Nvidia provide UBI9 based containers which are quite useful also, we use them in RamaLama

* docs: update fedora cuda guide for 12.8 release * docs: build cuda update

github-actions bot added the documentation Improvements or additions to documentation label Jan 24, 2025

teihome force-pushed the cuda-fedora-guide-update branch from c0bf064 to ecb81a4 Compare January 24, 2025 16:16

teihome marked this pull request as draft January 24, 2025 18:26

teihome force-pushed the cuda-fedora-guide-update branch from c87f807 to e87d080 Compare January 24, 2025 18:35

teihome added 2 commits January 25, 2025 03:28

docs: update fedora cuda guide for 12.8 release

448ce6a

docs: build cuda update

6f9a843

teihome force-pushed the cuda-fedora-guide-update branch from e87d080 to 6f9a843 Compare January 24, 2025 19:28

teihome marked this pull request as ready for review January 24, 2025 19:36

slaren approved these changes Jan 24, 2025

View reviewed changes

ngxson approved these changes Feb 6, 2025

View reviewed changes

ericcurtin approved these changes Feb 6, 2025

View reviewed changes

ericcurtin merged commit 9ab42dc into ggml-org:master Feb 6, 2025
2 checks passed

tinglou pushed a commit to tinglou/llama.cpp that referenced this pull request Feb 13, 2025

docs: update fedora cuda guide for 12.8 release (ggml-org#11393)

d05e9b6

* docs: update fedora cuda guide for 12.8 release * docs: build cuda update

orca-zhang pushed a commit to orca-zhang/llama.cpp that referenced this pull request Feb 26, 2025

docs: update fedora cuda guide for 12.8 release (ggml-org#11393)

f267269

* docs: update fedora cuda guide for 12.8 release * docs: build cuda update

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Feb 26, 2025

docs: update fedora cuda guide for 12.8 release (ggml-org#11393)

3a169b4

* docs: update fedora cuda guide for 12.8 release * docs: build cuda update

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: update fedora cuda guide for 12.8 release #11393

docs: update fedora cuda guide for 12.8 release #11393

teihome commented Jan 24, 2025 •

edited

Loading

teihome commented Jan 24, 2025

teihome commented Jan 24, 2025

da2ce7 commented Feb 5, 2025 •

edited

Loading

da2ce7 commented Feb 5, 2025

da2ce7 commented Feb 6, 2025

ngxson left a comment

ericcurtin commented Feb 6, 2025

docs: update fedora cuda guide for 12.8 release #11393

docs: update fedora cuda guide for 12.8 release #11393

Conversation

teihome commented Jan 24, 2025 • edited Loading

teihome commented Jan 24, 2025

teihome commented Jan 24, 2025

da2ce7 commented Feb 5, 2025 • edited Loading

da2ce7 commented Feb 5, 2025

da2ce7 commented Feb 6, 2025

ngxson left a comment

Choose a reason for hiding this comment

ericcurtin commented Feb 6, 2025

teihome commented Jan 24, 2025 •

edited

Loading

da2ce7 commented Feb 5, 2025 •

edited

Loading