[WIP] libct/cg/sd/v1: Set: don't leave cgroup frozen #3072

kolyshkin · 2021-07-07T22:43:36Z

This is the alternative to #3065 with a different approach and the same test case.

I. In case a parent cgroup is frozen, the current code reads the cgroup
state (into targetFreezerState) as FROZEN, and sets it explicitly after
setting unit properties. This is obviously wrong, as we should not
explicitly freeze the cgroup because its parent was frozen.

The issue happens because:

The m.GetFreezerState method does not distinguish between
the "self frozen" and "parent frozen" states (i.e. it does not
consult freezer.self_freezing).
The m.Freeze method changes m.cgroups.Resources.Frozen field.

II. The current code does freeze/thaw unconditionally, meaning it
freezes the already frozen cgroup (which is a no-op except it
still requires some reads and writes to freezer.state), and
thaws the cgroup which is about to be frozen.

Solve all this by figuring out whether we need to freeze and thaw,
and introducing and using a method which does not change value of
m.cgroups.Resources.Frozen.

Add a test case (initially developed in #3065 (comment), and tested to fail without the fix in #3066).

NOTE the current code assume that the container that is currently frozen
will remain frozen for the duration of SetUnitProperties. This might not
be true but there's no way to guarantee it (even in case we freeze the
cgroup ourselves).

I. In case a parent cgroup is frozen, the current code reads the cgroup state (into targetFreezerState) as FROZEN, and sets it explicitly after setting unit properties. This is obviously wrong, as we should not explicitly freeze the cgroup because its parent was frozen. The issue happens because: 1. The m.GetFreezerState method does not distinguish between "self frozen" and "parent frozen" states (i.e. it does not consult freezer.self_freezing). 2. The m.Freeze method changes m.cgroups.Resources.Frozen field. II. The current code does freeze/thaw unconditionally, meaning it freezes the already frozen cgroup (which is a no-op except it still requires some reads and writes to freezer.state), and thaws the cgroup which is about to be frozen. Solve all this by figuring out whether we need to freeze and thaw, and introducing and using a method which does not change value of m.cgroups.Resources.Frozen. NOTE the current code assume that the container that is currently frozen will remain frozen for the duration of SetUnitProperties. This might not be true but there's no way to guarantee it (even in case we freeze the cgroup ourselves). Reported-by: Odin Ugedal <odin@uged.al> Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>

@kolyshkin

Initial test work was done by Kir Kolyshkin (@kolyshkin). Co-Authored-By: Kir Kolyshkin <kolyshkin@gmail.com> Signed-off-by: Odin Ugedal <odin@uged.al>

odinuge · 2021-07-08T09:34:05Z

libcontainer/cgroups/systemd/v1.go

-	if targetFreezerState == configs.Undefined {
-		targetFreezerState = configs.Thawed
+	needsFreeze := true
+	if freezerState == configs.Frozen {


This could result in a race between the freezing of the current cgroup and one of its ancestors.

If one ancestor is FROZEN, this will not freeze the control group. If the parent is thowed before systemd does its device update sequence, things will not work as expected.

This does also not fix the difference between cgroup v1 and v2.

If the parent control group is frozen, and the control group itself is not, GetFreezerState will return FROZEN on v1 and THAWED on v2. I don't think the naming or definition is important, other than the fact that it works the same way on both.

If that make sense to you @kolyshkin and @cyphar?

~~But overall, I agree that always thawing the cgroup is a valid way to do it. If people want to freeze it.~~

However, doing:

runc pause container runc update container # Contianer will now still be paused runc resume container

~~now works as expected, but with your pr it will just resume the container on update (or)?~~

edit: that should work as expected

This could result in a race between the freezing of the current cgroup and one of its ancestors.

You are right, and I have anticipated that -- this is noted in the commit and PR description.

odinuge · 2021-07-08T09:39:37Z

libcontainer/cgroups/systemd/systemd_test.go

+		t.Fatalf("expected container cgroup path %q to be under pod cgroup path %q",
+			cm.Path("freezer"), pm.Path("freezer"))
+	}
+


Depending on the definition of GetFreezerState, we should do a check of its output here as well, to make sure it works the same on cgroup v1 and v2.

cf, err := cm.GetFreezerState() if err != nil { t.Fatal(err) } if cf != configs.Thawed { t.Fatalf("expected container to be thawed, got %v", cf) }

Good catch; will add.

kolyshkin · 2021-07-09T00:26:55Z

Closing in favor of #3080 which carries this one.

kolyshkin and others added 2 commits July 7, 2021 15:35

Add freezer tests

44a685a

Initial test work was done by Kir Kolyshkin (@kolyshkin). Co-Authored-By: Kir Kolyshkin <kolyshkin@gmail.com> Signed-off-by: Odin Ugedal <odin@uged.al>

This was referenced Jul 7, 2021

Make cgroup freezer only care about current control group #3065

Closed

Release 1.0.1 #3076

Closed

odinuge reviewed Jul 8, 2021

View reviewed changes

kolyshkin mentioned this pull request Jul 9, 2021

systemd cgroup v1 freeze fixes #3080

Closed

kolyshkin closed this Jul 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] libct/cg/sd/v1: Set: don't leave cgroup frozen #3072

[WIP] libct/cg/sd/v1: Set: don't leave cgroup frozen #3072

kolyshkin commented Jul 7, 2021 •

edited

Loading

odinuge Jul 8, 2021

odinuge Jul 8, 2021

odinuge Jul 8, 2021

odinuge Jul 8, 2021 •

edited

Loading

kolyshkin Jul 8, 2021

odinuge Jul 8, 2021

kolyshkin Jul 9, 2021

kolyshkin commented Jul 9, 2021

[WIP] libct/cg/sd/v1: Set: don't leave cgroup frozen #3072

[WIP] libct/cg/sd/v1: Set: don't leave cgroup frozen #3072

Conversation

kolyshkin commented Jul 7, 2021 • edited Loading

odinuge Jul 8, 2021

Choose a reason for hiding this comment

odinuge Jul 8, 2021

Choose a reason for hiding this comment

odinuge Jul 8, 2021

Choose a reason for hiding this comment

odinuge Jul 8, 2021 • edited Loading

Choose a reason for hiding this comment

kolyshkin Jul 8, 2021

Choose a reason for hiding this comment

odinuge Jul 8, 2021

Choose a reason for hiding this comment

kolyshkin Jul 9, 2021

Choose a reason for hiding this comment

kolyshkin commented Jul 9, 2021

kolyshkin commented Jul 7, 2021 •

edited

Loading

odinuge Jul 8, 2021 •

edited

Loading