[SHIPA-2322] adds wait-retry to helm update & delete #233

stinkyfingers · 2022-02-08T14:51:53Z

Description

2322
Helm update and helm delete currently fail when the object status (app) is not deployed. This change checks a map for actionable & retryable statuses for updates and deletions. For wait-retry statuses, a wait-retry loop is entered, and ultimately the status is manually updated to deployed.

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
Chore (documentation addition or typo, file relocation)

Testing

New tests were added with this PR that prove my fix is effective or that my feature works (describe below this bullet)
This change requires no testing (i.e. documentation update)

Documentation

All added public packages, funcs, and types have been documented with doc comments
I have commented my code, particularly in hard-to-understand areas

Final Checklist:

I followed standard GitHub flow guidelines
I have performed a self-review of my own code
My changes generate no new warnings

clean up tests

aleksej-paschenko · 2022-02-09T12:53:26Z

internal/chart/helm_client.go

+type statusFunc func(cfg *action.Configuration, appName string) (*release.Release, release.Status, error)
+
+const (
+	WaitRetry = iota


it looks like there is no need to export WaitRetry, TakeAction and NoAction.
Is it possible to use waitRetry, takeAction and noAction correspondingly?

You bet. Privatized!

aleksej-paschenko · 2022-02-09T12:54:14Z

internal/chart/helm_client.go

+
+// helmStatusActionMapUpdate maps a Release Status to a Ketch action for helm updates
+var helmStatusActionMapUpdate = map[release.Status]int{
+	"not-found":                   TakeAction,


maybe a new const for not-found?

aleksej-paschenko · 2022-02-09T14:00:27Z

internal/chart/helm_client.go

+func (c HelmClient) waitForActionableStatus(statusFunc statusFunc, appName string, statusActionMap map[release.Status]int) (bool, error) {
+	ticker := time.NewTicker(statusRetryInterval)
+	done := time.After(statusRetryTimeout)
+	var helmRelease *release.Release


I like this approach, but will it block the appReconciler's loop?
if yes, what are the negative consequences we are going to deal with?

Per our slack discussion: I removed the wait loop. Now, "wait-retry" statuses throw an error, which the reconciler loop is expected to handle. This means that 1) we don't manually update a chart's status here (not sure if that's good or bad) 2) there is a possiblity that a chart will get stuck in a weird status like pending-uninstall and the reconciler will just keep re-trying. Not sure if that's something to be concerned about.

…const labels

aleksej-paschenko · 2022-02-10T13:44:58Z

internal/chart/helm_client.go

+		return false, nil
+	case takeAction:
+		return true, nil
+	default:


would it help if we implement something like

ketch/internal/chart/helm_client.go

Lines 129 to 136 in b9e8b85

if lastRelease.Info.FirstDeployed.Before(helmTime.Time{Time: timeoutLimit}) {

newStatus := release.StatusDeployed

c.log.Info(fmt.Sprintf("Setting status of release that has timeouted to: %s", newStatus))

lastRelease.SetStatus(newStatus, "manually canceled")

if err := c.cfg.Releases.Update(lastRelease); err != nil {

return nil, err

}

}

i

Yes, that makes sense. This PR's addition almost duplicates that FirstDeployed.Before check, but considers additional statuses. I removed the original block of code.

aleksej-paschenko

looks amazing!

adds wait-retry to helm update & delete

894e26f

clean up tests

stinkyfingers force-pushed the shipa-2322 branch from 30f592f to 894e26f Compare February 8, 2022 15:10

aleksej-paschenko reviewed Feb 9, 2022

View reviewed changes

avoids wait loop; returns error on non-actionable statuses; clean up …

b9e8b85

…const labels

aleksej-paschenko reviewed Feb 10, 2022

View reviewed changes

stinkyfingers added 2 commits February 10, 2022 09:50

adds default timeout/update to helm chart checks

15e2dee

rm duplicated code

a064225

aleksej-paschenko approved these changes Feb 11, 2022

View reviewed changes

stinkyfingers merged commit a6830cd into main Feb 11, 2022

stinkyfingers deleted the shipa-2322 branch February 11, 2022 14:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SHIPA-2322] adds wait-retry to helm update & delete #233

[SHIPA-2322] adds wait-retry to helm update & delete #233

stinkyfingers commented Feb 8, 2022 •

edited

Loading

aleksej-paschenko Feb 9, 2022

stinkyfingers Feb 9, 2022

aleksej-paschenko Feb 9, 2022

stinkyfingers Feb 9, 2022

aleksej-paschenko Feb 9, 2022

stinkyfingers Feb 9, 2022

aleksej-paschenko Feb 10, 2022

stinkyfingers Feb 10, 2022

aleksej-paschenko left a comment

	if lastRelease.Info.FirstDeployed.Before(helmTime.Time{Time: timeoutLimit}) {
	newStatus := release.StatusDeployed
	c.log.Info(fmt.Sprintf("Setting status of release that has timeouted to: %s", newStatus))
	lastRelease.SetStatus(newStatus, "manually canceled")
	if err := c.cfg.Releases.Update(lastRelease); err != nil {
	return nil, err
	}
	}

[SHIPA-2322] adds wait-retry to helm update & delete #233

[SHIPA-2322] adds wait-retry to helm update & delete #233

Conversation

stinkyfingers commented Feb 8, 2022 • edited Loading

Description

Type of change

Testing

Documentation

Final Checklist:

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aleksej-paschenko left a comment

Choose a reason for hiding this comment

stinkyfingers commented Feb 8, 2022 •

edited

Loading