Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[v0.14] Cherry-pick Inference Graph stabilizations #427

Conversation

israel-hdez
Copy link

What this PR does / why we need it:

Cherry-pick the following PRs:

Which issue(s) this PR fixes (optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close the issue(s) when PR gets merged):
Fixes https://issues.redhat.com/browse/RHOAIENG-13448

Type of changes
Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds functionality)
  • Breaking change (fix or feature that would cause existing functionality to not work as expected)
  • This change requires a documentation update

@israel-hdez israel-hdez force-pushed the j13448-cp-trust-ca-and-sidecar-check branch from 3f572cc to ae39d23 Compare November 28, 2024 18:57
@israel-hdez israel-hdez requested review from Jooho and removed request for terrytangyuan and mwaykole November 28, 2024 19:05
Copy link

openshift-ci bot commented Nov 28, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: israel-hdez, spolti

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

This changes kserve-controller to mount the OpenShift Service CA bundle into kserve-router container and a configures it to trust the bundle. This affects InferenceGraph deployed in Serverless mode.

With these changes, InferenceGraphs will work correctly when deployed without an Istio sidecar.

These changes are needed because in ODH the InferenceServices are secured with TLS. The internal endpoints (which are the ones InferenceGraph uses) are using OpenShift service serving certificates.

Related to: https://issues.redhat.com/browse/RHOAIENG-13448

Signed-off-by: Edgar Hernández <23639005+israel-hdez@users.noreply.github.com>
In topologies using Istio mesh, applications can use plain-text HTTP to send traffic to other mesh-member workloads. Handling of TLS is delegated to Istio.

Thus, when KServe workloads have the Istio sidecar (e.g. by using auto-injection), the Inference Graph router should send its traffic without TLS even if the schema of the service URL is specified as HTTPS. Istio would originate TLS when needed, and the double TLS is prevented (see https://istio.io/latest/docs/ops/common-problems/network-issues/#double-tls).

These changes implement a detection of the Istio sidecar by querying a well known port that the sidecar is using. If the sidecar is found, inference requests are sent using plain-text HTTP.

Signed-off-by: Edgar Hernández <23639005+israel-hdez@users.noreply.github.com>
@israel-hdez israel-hdez force-pushed the j13448-cp-trust-ca-and-sidecar-check branch from ae39d23 to 1ce708d Compare November 28, 2024 19:31
@israel-hdez israel-hdez merged commit 66b6202 into opendatahub-io:release-v0.14 Nov 28, 2024
19 of 20 checks passed
@israel-hdez israel-hdez deleted the j13448-cp-trust-ca-and-sidecar-check branch November 28, 2024 23:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

3 participants