-
Notifications
You must be signed in to change notification settings - Fork 128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
onnx example not working with triton #224
Comments
@Naegionn thanks for trying this out! Yes the example should be updated, are you interested in contribute the onnx example ? |
Sure, if I find the time to do that next week. |
Hi, wondering if there is an update on this. I'm getting 400 errors when trying to run the ONNX example. Thanks! |
Sorry I did not have time to update the example |
Merged
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
/kind bug
What steps did you take and what happened:
I closely followed https://github.com/kserve/kserve/blob/release-0.8/docs/samples/v1beta1/onnx/README.md
to test kserve with the onnx model provided (
storageUri: "gs://kfserving-examples/onnx/style"
) as I want to use kserve with .onnx models.The problem here is that the style model provided is a single .onnx file which was needed for the onnx runtime i suppose.
Now with onnx runtime replaced by triton this exampel does not work anymore as triton will not load from a single onnx file (/mnt/models/model.onnx).
Log from the triton server in kserve-container:
Triton requires a the onnx file to arranged something like this:
After creating my own triton compatible model triton loaded it correctly:
So I assume that the example needs to be updated.
Environment:
kubectl version
): 1.22 (microk8s)/etc/os-release
):The text was updated successfully, but these errors were encountered: