-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
NSM pods can freeze on latest k8s versions #1041
Comments
Status update: Newest k8s versions reproduce the same problem
We also managed to reproduce the problem with old NSM releases v1.13.0, v1.14.0: https://github.com/networkservicemesh/integration-k8s-kind/actions/runs/11701859018/job/32588822494 And also I've managed to successfully run previous fixed versions 3 times in row:
|
Hi, I've run some tests on my local machine, there I've build some node images for kind with different containerd versions, and run a script that constantly reloads forwarder daemonset, to check. So I think as they put in the linked issue, a solution could be to run a custom node image with containerd v1.7.22, until they release a kind image with a more recent containerd. |
There already seems to be a pull request to update the containerd version in kind: kubernetes-sigs/kind#3801 |
Is it possible to check if kubernetes-sigs/kind#3801 fixes our problem? |
I can build a docker image with 1.7.23 containerd for kind, if its uploaded in some public space, if I see it correctly, changing the image_image url at these places to the custom built one should do the trick:
and then run the ci with these changes. EDIT: |
Szilard uploded an image here: registry.nordix.org/cloud-native/kind-node:containerd1723 |
FYI: created a PR with using previous versions https://github.com/networkservicemesh/integration-k8s-kind/pull/1036 Am I getting it correct that for now, all we need to do is wait for releases that include fixes? Do we know any ETA? |
yes I believe so, not sure how long it will take, now that I look at it, based on the dates of the version tags, they seem to do quarterly releases, so if it gets merged, then probably february? (I was hoping it would be earlier) |
The problem is sounds like CRITICAL for many folks, so do we have any ticket where folks report the same issue? |
Description
On the newest K8S clusters, NSM pods can freeze on deleting, which ruines tests.
Incidents
Logs
Affected versions
The text was updated successfully, but these errors were encountered: