Skip to content
This repository has been archived by the owner on Feb 15, 2025. It is now read-only.

Unable to deploy whisper and text-embeddings when setting GPU_LIMIT variable #385

Closed
YrrepNoj opened this issue Apr 12, 2024 · 0 comments · Fixed by #386
Closed

Unable to deploy whisper and text-embeddings when setting GPU_LIMIT variable #385

YrrepNoj opened this issue Apr 12, 2024 · 0 comments · Fixed by #386
Labels
possible-bug 🐛 Something may not be working

Comments

@YrrepNoj
Copy link
Member

The chart resource limits & requests is misconfigured when trying to deploy the whisper and text-embeddings packages with the GPU_LIMIT variable.

Because of the way the values.yaml file was written, if someone tried to set the nvidia.com/gpu to a value >0 the limits & requests wouldn't be equal because 0 != 1

Additional Context

https://kubernetes.io/docs/tasks/manage-gpus/scheduling-gpus/

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
possible-bug 🐛 Something may not be working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant