You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I haven't been able to find much documentation or examples on enabling GPU usage on functions.
I'm trying to deploy a function that requires a GPU. On the k8s cluster only some specific nodes have GPUs on them. If I try to specify in the function config to use a GPU by setting it in the resource limits like this:
resources:
limits:
nvidia.com/gpu: 1
I get an error on deployment for insufficient resources:
0/X nodes are available: X Insufficient nvidia.com/gpu.
I tried adding a nodeSelector to the function config similar to how you would in a deployment manifest:
spec:
nodeSelector:
accelerator: nvidia
But that didn't work. I can spin up a pod normally using the steps above (specifying GPU limit in resource and applying node selector to use the GPU enabled node) and can access the GPU just fine.
For reference I'm deploying via nuctl with a command like this:
Is there something I'm missing? Also will it require building a custom image for use with GPU using a CUDA enabled base image or will Nuclio know if a GPU is enabled to use a GPU compatible image?
The text was updated successfully, but these errors were encountered:
I have run the gpu functions on local machine if that would help: cvat-ai/cvat#2714
I needed to add --triggers '{"myHttpTrigger": {"maxWorkers": 1}}' and also limit the gpu memory use inside of my function to make it work robustly for tensorflow. otherwise sometimes cudnn was failing to initialize.
I haven't been able to find much documentation or examples on enabling GPU usage on functions.
I'm trying to deploy a function that requires a GPU. On the k8s cluster only some specific nodes have GPUs on them. If I try to specify in the function config to use a GPU by setting it in the resource limits like this:
I get an error on deployment for insufficient resources:
0/X nodes are available: X Insufficient nvidia.com/gpu.
I tried adding a nodeSelector to the function config similar to how you would in a deployment manifest:
But that didn't work. I can spin up a pod normally using the steps above (specifying GPU limit in resource and applying node selector to use the GPU enabled node) and can access the GPU just fine.
For reference I'm deploying via
nuctl
with a command like this:Is there something I'm missing? Also will it require building a custom image for use with GPU using a CUDA enabled base image or will Nuclio know if a GPU is enabled to use a GPU compatible image?
The text was updated successfully, but these errors were encountered: