-
Notifications
You must be signed in to change notification settings - Fork 162
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"gke-resource-quotas" error updating resources #2346
Comments
It seems to be a Kubernetes bug - argoproj/argo-workflows#3217? |
Here's the kubernetes bug: kubernetes/kubernetes#67761 |
This follows @vaikas PR here: knative/eventing#3215 The kubernetes issue is tracked here: kubernetes/kubernetes#67761 Fixes: knative/test-infra#2346
Well, given that the issue is still open and kingress conformance creates pods, svcs, and ingresses all of which go through the resourcequota I figured I'd port Ville's hack to kingress conformance: knative/networking#79 |
cc @tcnghia for context |
This follows @vaikas PR here: knative/eventing#3215 The kubernetes issue is tracked here: kubernetes/kubernetes#67761 Fixes: knative/test-infra#2346
This follows @vaikas PR here: knative/eventing#3215 The kubernetes issue is tracked here: kubernetes/kubernetes#67761 Fixes: knative/test-infra#2346
* Retry Creates on resourcequota conflicts. (#79) This follows @vaikas PR here: knative/eventing#3215 The kubernetes issue is tracked here: kubernetes/kubernetes#67761 Fixes: knative/test-infra#2346 * Include Get in the UpdateRetry. (#82) This adjusts the update retry around the ingress update in #79 to include the Get. The original change was to guard against issues updating gke-resource-quotas, but there is a low incidence of conflicts simply updating the kingress itself. Here's an example from net-contour: ``` === CONT TestIngressConformance/5/update update.go:88: Error updating Ingress: Operation cannot be fulfilled on ingresses.networking.internal.knative.dev "ingress-conformance-5-update-eghinekn": the object has been modified; please apply your changes to the latest version and try again ``` However, to resolve this, we actually have to refetch the kingress shell we've stuck the desired IngressSpec into otherwise it will just retry until it has exhausted its attempts because the resourceVersion we're sending back is never changed (and this is what the optimistic concurrency keys off of).
Awesome, so this is back, but in a form that our previous workaround no longer works:
|
Here's a net-contour run where this really exploded: https://prow.knative.dev/view/gcs/knative-prow/logs/ci-knative-sandbox-net-contour-continuous/1301862429919023104 |
Looks like: kubernetes/kubernetes#82130 |
This is an expansion of the workaround that vaikas initially added to address knative/test-infra#2346, however, a few new error types have emerged, which this will retry.
This is an expansion of the workaround that vaikas initially added to address knative/test-infra#2346, however, a few new error types have emerged, which this will retry.
This issue is stale because it has been open for 90 days with no |
I have been intermittently seeing flakes with:
an example
I think @vaikas mentioned something like this in eventing as well.
The text was updated successfully, but these errors were encountered: