[CI:ALL] Bump to v5.2.0-rc3 #23462

mheon · 2024-07-31T18:38:42Z

As the title says

Does this PR introduce a user-facing change?

NONE

Two tests failing in gating but never CI; add some debug instrumentation to make it possible to find out what is going on Signed-off-by: Ed Santiago <santiago@redhat.com>

The test assumes that if more than 1 ip on the host we should be able to set host.containers.internal. This however is not how the logic works in the code. What it actually does is to check all ips in the rootless-netns and then it knows that it cannot use any of these ips. This includes any podman bridge ips. You can reproduce the error when you have only one ipv4 on the host then run a container as root in the background and run the test: hack/bats --rootless 505:host.containers.internal So the failure here was that there was already a podman container running as root on the default bridge thus the test saw 2 ips but then the rootless run also uses the same subnet for its bridge and the code knew that ip would not work either. I could have made another special condition in test but the better way to work around it is to create a new network. A new network will make sure there are no conflicting subnets assigned so the test will pass. Signed-off-by: Paul Holzinger <pholzing@redhat.com>

The tests didn't check anything actually because default_ifname requires an ip version argument to work. Thus pasta_iface was empty, add new checks to prevent this kind of error again. Signed-off-by: Paul Holzinger <pholzing@redhat.com>

This contains a fix for a gvproxy crash on macos on fast connections with heavy network load. This should fix containers#23114 Signed-off-by: Christophe Fergeau <cfergeau@redhat.com>

The value of the pointer might be changed while creating the container causing unexpected side effects. Signed-off-by: Paul Holzinger <pholzing@redhat.com>

We bind ports to ensure there are no conflicts and we leak them into conmon to keep them open. However we bound the ports after the network was set up so it was possible for a second network setup to overwrite the firewall configs of a previous container as it failed only later when binding the port. As such we must ensure we bind before the network is set up. This is not so simple because we still have to take care of PostConfigureNetNS bool in which case the network set up happens after we launch conmon. Thus we end up with two different conditions. Also it is possible that we "leak" the ports that are set on the container until the garbage collector will close them. This is not perfect but the alternative is adding special error handling on each function exit after prepare until we start conmon which is a lot of work to do correctly. Fixes https://issues.redhat.com/browse/RHEL-50746 Signed-off-by: Paul Holzinger <pholzing@redhat.com>

Signed-off-by: Matt Heon <mheon@redhat.com>

Add the `--compat-volumes option from Buildah v1.37 into Podman in preparation of Podman v5.2 Signed-off-by: tomsweeneyredhat <tsweeney@redhat.com>

This commit was automatically cherry-picked by buildah-vendor-treadmill v0.3 from the buildah vendor treadmill PR, containers#13808 /vvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvvv > The git commit message from that PR is below. Please review it, > edit as necessary, then remove this comment block. \^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Changes since 2024-05-21: * document --compat-volumes * Fix conflict caused by Ed's local-registry PR in buildah Signed-off-by: Ed Santiago <santiago@redhat.com> Signed-off-by: tomsweeneyredhat <tsweeney@redhat.com>

When using service containers and play kube we create a complicated set of dependencies. First in a pod all conmon/container cgroups are part of one slice, that slice will be removed when the entire pod is stopped resulting in systemd killing all processes that were part in it. Now the issue here is around the working of stopPodIfNeeded() and stopIfOnlyInfraRemains(), once a container is cleaned up it will check if the pod should be stopped depending on the pod ExitPolicy. If this is the case it wil stop all containers in that pod. However in our flaky test we calle podman pod kill which logically killed all containers already. Thus the logic now thinks on cleanup it must stop the pod and calls into pod.stopWithTimeout(). Then there we try to stop but because all containers are already stopped it just throws errors and never gets to the point were it would call Cleanup(). So the code does not do cleanup and eventually calls removePodCgroup() which will cause all conmon and other podman cleanup processes of this pod to be killed. Thus the podman container cleanup process was likely killed while actually trying to the the proper cleanup which leaves us in a bad state. Following commands such as podman pod rm will try to the cleanup again as they see it was not completed but then fail as they are unable to recover from the partial cleanup state. Long term network cleanup needs to be more robust and ideally should be idempotent to handle cases were cleanup was killed in the middle. Fixes containers#21569 Signed-off-by: Paul Holzinger <pholzing@redhat.com>

Fix up a couple of versions in comments in the pkg/api/server/register_images.go file. Based on comments from containers#23440 Signed-off-by: tomsweeneyredhat <tsweeney@redhat.com>

Signed-off-by: Matt Heon <mheon@redhat.com>

packit-as-a-service · 2024-07-31T18:38:53Z

We were not able to find or create Copr project packit/containers-podman-23462 specified in the config with the following error:

Packit received HTTP 500 Internal Server Error from Copr Service. Check the Copr status page: https://copr.fedorainfracloud.org/status/stats/, or ask for help in Fedora Build System matrix channel https://matrix.to/#/#buildsys:fedoraproject.org.

Unless the HTTP status code above is >= 500, please check your configuration for:

typos in owner and project name (groups need to be prefixed with @)
whether the project name doesn't contain not allowed characters (only letters, digits, underscores, dashes and dots must be used)
whether the project itself exists (Packit creates projects only in its own namespace)
whether Packit is allowed to build in your Copr project
whether your Copr project/group is not private

mheon · 2024-07-31T18:39:06Z

Eeek it's against the wrong branch

Signed-off-by: Matt Heon <mheon@redhat.com>

openshift-ci · 2024-07-31T18:40:57Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: mheon

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [mheon]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

mheon · 2024-07-31T18:41:01Z

Alright, fixed, it's against 5.2 now

mheon · 2024-07-31T19:31:15Z

@containers/podman-maintainers PTAL

TomSweeneyRedHat · 2024-07-31T19:31:25Z

RELEASE_NOTES.md

@@ -2,6 +2,7 @@

 ## 5.2.0
 ### Features
+- Podman now supports `libkrun` as a backend for creating virtual machines on MacOS. The `libkrun` backend has the advantage of allowing GPUs to be mounted into the virtual machine to accelerate tasks. The default backend remains `applehv`.


It's a nit as almost everyone writes it "MacOS", but officially, it's "macOS"

TomSweeneyRedHat · 2024-07-31T19:32:13Z

LGTM
assuming happy tests

mheon · 2024-07-31T20:16:55Z

Restarted one machine flake

baude · 2024-07-31T20:29:35Z

assuming a flake ... be sure it isnt

/lgtm

edsantiago · 2024-07-31T21:13:44Z

assuming a flake ... be sure it isnt

You've been missing out. It's ALWAYS a flake. podman-machine is failing ~ 2 out of 3 runs. Sometimes more.

edsantiago and others added 12 commits July 31, 2024 14:21

CI: system tests: instrument to allow failure analysis

b1ad869

Two tests failing in gating but never CI; add some debug instrumentation to make it possible to find out what is going on Signed-off-by: Ed Santiago <santiago@redhat.com>

build: Update gvisor-tap-vsock to 0.7.4

02a9323

This contains a fix for a gvproxy crash on macos on fast connections with heavy network load. This should fix containers#23114 Signed-off-by: Christophe Fergeau <cfergeau@redhat.com>

pkg/api: do not leak config pointers into specgen

3f14fcf

The value of the pointer might be changed while creating the container causing unexpected side effects. Signed-off-by: Paul Holzinger <pholzing@redhat.com>

Bump Buildah, c/storage, c/image, c/common

8bc4933

Signed-off-by: Matt Heon <mheon@redhat.com>

Add --compat-volumes option to build and farm build

a8f4c12

Add the `--compat-volumes option from Buildah v1.37 into Podman in preparation of Podman v5.2 Signed-off-by: tomsweeneyredhat <tsweeney@redhat.com>

Tweak versions in register_images.go

784856b

Fix up a couple of versions in comments in the pkg/api/server/register_images.go file. Based on comments from containers#23440 Signed-off-by: tomsweeneyredhat <tsweeney@redhat.com>

Update release notes for v5.2.0-rc3

23c6e0f

Signed-off-by: Matt Heon <mheon@redhat.com>

github-actions bot added the kind/api-change Change to remote API; merits scrutiny label Jul 31, 2024

openshift-ci bot added the release-note-none label Jul 31, 2024

mheon changed the base branch from main to v5.2 July 31, 2024 18:39

mheon added 2 commits July 31, 2024 14:40

Bump to v5.2.0-rc3

028bee2

Signed-off-by: Matt Heon <mheon@redhat.com>

Bump to v5.2.0-dev

c83c891

Signed-off-by: Matt Heon <mheon@redhat.com>

openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jul 31, 2024

TomSweeneyRedHat reviewed Jul 31, 2024

View reviewed changes

benoitf mentioned this pull request Jul 31, 2024

feat: update podman to 5.2.0 release podman-desktop/podman-desktop#8306

Merged

1 task

openshift-ci bot assigned baude Jul 31, 2024

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Jul 31, 2024

openshift-merge-bot bot merged commit 8b246b6 into containers:v5.2 Jul 31, 2024
88 checks passed

This was referenced Aug 1, 2024

machine rm: unable to clean up gvproxy: remove xxxx.pid: The process cannot access the file because it is being used by another process. #23472

Closed

podman-machine uncategorized flakes #22551

Open

stale-locking-app bot added the locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. label Oct 30, 2024

stale-locking-app bot locked as resolved and limited conversation to collaborators Oct 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CI:ALL] Bump to v5.2.0-rc3 #23462

[CI:ALL] Bump to v5.2.0-rc3 #23462

mheon commented Jul 31, 2024

packit-as-a-service bot commented Jul 31, 2024

mheon commented Jul 31, 2024

openshift-ci bot commented Jul 31, 2024

mheon commented Jul 31, 2024

mheon commented Jul 31, 2024

TomSweeneyRedHat Jul 31, 2024

TomSweeneyRedHat commented Jul 31, 2024

mheon commented Jul 31, 2024

baude commented Jul 31, 2024

edsantiago commented Jul 31, 2024

[CI:ALL] Bump to v5.2.0-rc3 #23462

[CI:ALL] Bump to v5.2.0-rc3 #23462

Conversation

mheon commented Jul 31, 2024

Does this PR introduce a user-facing change?

packit-as-a-service bot commented Jul 31, 2024

mheon commented Jul 31, 2024

openshift-ci bot commented Jul 31, 2024

mheon commented Jul 31, 2024

mheon commented Jul 31, 2024

TomSweeneyRedHat Jul 31, 2024

Choose a reason for hiding this comment

TomSweeneyRedHat commented Jul 31, 2024

mheon commented Jul 31, 2024

baude commented Jul 31, 2024

edsantiago commented Jul 31, 2024