Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

NIM on GKE Tutorial #737

Merged
merged 16 commits into from
Jul 29, 2024
Prev Previous commit
Next Next commit
added namespace and removed context setting command
  • Loading branch information
brandonroyal committed Jul 23, 2024
commit 0672094112e5e34efcadaf39c2207aef0e12762c
3 changes: 1 addition & 2 deletions tutorials-and-examples/nvidia-nim/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -85,7 +85,6 @@ ngc registry model list
4. Create a Kuberntes namespace and switch context to that namespace
```bash
kubectl create namespace nim
kubectl config set-context --current --namespace nim
```

## Deploy a PVC to persist the model
Expand Down Expand Up @@ -128,7 +127,7 @@ helm --namespace nim install demo-nim nim-llm/ --set model.ngcAPIKey=$NGC_CLI_AP
## Test the NIM
1. Expose the service
```bash
kubectl port-forward services/demo-nim-nim-llm 8000
kubectl port-forward --namespace nim services/demo-nim-nim-llm 8000
```

2. Send a test prompt - A100
Expand Down