test: Move from `restore_dir()` to `podman system reset` for user #1598

martinpitt · 2024-03-01T09:08:30Z

The restore_dir() for podman's data directory is highly problematic:
This interferes with btrfs subvolumes and overlayfs mounts, and often
causes cp failures like

cp: cannot stat '/home/admin/.local/share/containers/storage/overlay/compat3876082856': No such file or directory

So move to podman system reset, and restore the test images
with podman load for each test.

Unfortunately podman system reset defaults to the 10 s wait timeout
(containers/podman#21874), so we still need
the separate rm --time 0 hack. But conceptually that can go away once
that bug is fixed.

This approach would also be nice on the system podman side, but it is super
hard to get right there especially on CoreOS: There we simultaneously want a
thorough cleanup, but also rely on the running cockpit/ws container. It also
collides with the "force unmount everything below /var/lib/containers" hack
that we unfortunately still need for some OSes. But doing it for the user at
least solves half of the problem. The observed failures in the field
all occurred on the user directory, anyway.

Fixes #1591

I tried the "full" approach in #1592, with pieces of it in #1596 and #1597, but this is a mine field 😢

The `restore_dir()` for podman's data directory is highly problematic: This interferes with btrfs subvolumes and overlayfs mounts, and often causes `cp` failures like ``` cp: cannot stat '/home/admin/.local/share/containers/storage/overlay/compat3876082856': No such file or directory ``` So move to `podman system reset`, and restore the test images with `podman load` for each test. Unfortunately `podman system reset` defaults to the 10 s wait timeout (containers/podman#21874), so we still need the separate `rm --time 0` hack. But conceptually that can go away once that bug is fixed. This approach would also be nice on the system podman side, but it is super hard to get right there especially on CoreOS: There we simultaneously want a thorough cleanup, but also rely on the running cockpit/ws container. It also collides with the "force unmount everything below /var/lib/containers" hack that we unfortunately still need for some OSes. But doing it for the user at least solves half of the problem. The observed failures in the field all occurred on the user directory, anyway. Fixes cockpit-project#1591

martinpitt · 2024-03-01T09:45:54Z

This by and large worked, I just forgot the pause container for ubuntu-2204. Fixed, and so it can re-run again to make sure.

jelly

Thanks!

jelly · 2024-03-01T10:26:00Z

test/vm.install

 done
-	loginctl disable-linger $(id -u admin)


Great to cleanup the lingering!

jelly · 2024-03-01T10:27:03Z

test/check-application

+            podman system reset --force
+            """)
+        # HACK: system reset has 10s timeout, make that faster with an extra `stop`
+        # https://github.com/containers/podman/issues/21874


Thanks for reporting that!

martinpitt force-pushed the cleanup-user branch from 5c67e86 to a0aa517 Compare March 1, 2024 09:45

martinpitt marked this pull request as ready for review March 1, 2024 09:46

martinpitt requested a review from jelly March 1, 2024 09:46

jelly approved these changes Mar 1, 2024

View reviewed changes

martinpitt merged commit 9e3d0c7 into cockpit-project:main Mar 1, 2024
30 checks passed

martinpitt deleted the cleanup-user branch March 1, 2024 10:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: Move from `restore_dir()` to `podman system reset` for user #1598

test: Move from `restore_dir()` to `podman system reset` for user #1598

martinpitt commented Mar 1, 2024 •

edited

Loading

martinpitt commented Mar 1, 2024

jelly left a comment

jelly Mar 1, 2024

jelly Mar 1, 2024

test: Move from restore_dir() to podman system reset for user #1598

test: Move from restore_dir() to podman system reset for user #1598

Conversation

martinpitt commented Mar 1, 2024 • edited Loading

martinpitt commented Mar 1, 2024

jelly left a comment

Choose a reason for hiding this comment

jelly Mar 1, 2024

Choose a reason for hiding this comment

jelly Mar 1, 2024

Choose a reason for hiding this comment

test: Move from `restore_dir()` to `podman system reset` for user #1598

test: Move from `restore_dir()` to `podman system reset` for user #1598

martinpitt commented Mar 1, 2024 •

edited

Loading