feat: have pre-req retry upon check fail #574

HumairAK · 2024-02-20T14:13:54Z

The issue resolved by this Pull Request:

Resolves https://issues.redhat.com/browse/RHOAIENG-2099

Same as: #571 for v1.6

Signed-off-by: Humair Khan <HumairAK@users.noreply.github.com>

dsp-developers · 2024-02-20T14:16:08Z

A new image has been built to help with testing out this PR: quay.io/opendatahub/data-science-pipelines-operator:pr-574
An OCP cluster where you are logged in as cluster admin is required.

To use this image run the following:

cd $(mktemp -d)
git clone git@github.com:opendatahub-io/data-science-pipelines-operator.git
cd data-science-pipelines-operator/
git fetch origin pull/574/head
git checkout -b pullrequest fd3aae9110a4857e71e81f8ab9774e13186d389f
oc new-project opendatahub
make deploy IMG="quay.io/opendatahub/data-science-pipelines-operator:pr-574"

More instructions here on how to deploy and test a Data Science Pipelines Application.

amadhusu

I didn't face the issue mentioned in Jira as the Database was available within 1 second after the failure of the Database Health Check as you can see in the screenshot.

My question is only regarding the aggressive 'RequeAfter' duration of 20 seconds. Is it more of a end-user experience kind of thing to go ahead with provisioning the rest of the pods with DSPA once the Database Health Check passes by reconciling aggressively? Any performance hiccups with such a short duration for reconciliation would be my only question. This can be taken offline but the code looks perfect and Works perfectly fine with sanity checks.

gregsheremeta · 2024-02-21T19:03:15Z

/lgtm

it's a little more common to do exponential backoff in controllers. Can be a future improvement (or not).

gregsheremeta · 2024-02-21T19:03:55Z

/approve

openshift-ci · 2024-02-21T19:04:01Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: amadhusu, gregsheremeta

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [gregsheremeta]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

feat: have pre-req retry upon check fail

fd3aae9

Signed-off-by: Humair Khan <HumairAK@users.noreply.github.com>

openshift-ci bot requested review from gmfrasca and gregsheremeta February 20, 2024 14:13

amadhusu approved these changes Feb 21, 2024

View reviewed changes

openshift-ci bot assigned amadhusu Feb 21, 2024

openshift-ci bot added the lgtm label Feb 21, 2024

openshift-ci bot assigned gregsheremeta Feb 21, 2024

openshift-ci bot added the approved label Feb 21, 2024

HumairAK merged commit ef6fd1e into opendatahub-io:v1.6.x Feb 21, 2024
5 of 7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: have pre-req retry upon check fail #574

feat: have pre-req retry upon check fail #574

HumairAK commented Feb 20, 2024 •

edited

Loading

dsp-developers commented Feb 20, 2024

amadhusu left a comment

gregsheremeta commented Feb 21, 2024

gregsheremeta commented Feb 21, 2024

openshift-ci bot commented Feb 21, 2024

feat: have pre-req retry upon check fail #574

feat: have pre-req retry upon check fail #574

Conversation

HumairAK commented Feb 20, 2024 • edited Loading

The issue resolved by this Pull Request:

dsp-developers commented Feb 20, 2024

amadhusu left a comment

Choose a reason for hiding this comment

gregsheremeta commented Feb 21, 2024

gregsheremeta commented Feb 21, 2024

openshift-ci bot commented Feb 21, 2024

HumairAK commented Feb 20, 2024 •

edited

Loading