daemon: Use IOSchedulingClass=idle #2164

cgwalters · 2020-07-18T14:34:35Z

Be nice to concurrent processes; operating system updates
are usually a background thing. See e.g.
openshift/machine-config-operator#1897
ostreedev/ostree#2152
This option is most effective in combination with
a block scheduler such as bfq, which is the systemd
default since systemd/systemd#13321

Pairs with: ostreedev/ostree#2152 Be nice to concurrent processes; operating system updates are usually a background thing. See e.g. openshift/machine-config-operator#1897 ostreedev/ostree#2152 This option is most effective in combination with a block scheduler such as `bfq`, which is the systemd default since systemd/systemd#13321

openshift-ci-robot · 2020-07-18T14:34:40Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: cgwalters

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [cgwalters]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

jlebon · 2020-07-20T14:19:54Z

One doubt about applying this globally: on a busy machine, this may delay updates significantly, right? Across a large cluster, this could mean rollouts take much longer. I see in the BFQ docs:

A very thin extra bandwidth is however guaranteed to the Idle class, to prevent it from starving.

So we know that it at least couldn't be delayed indefinitely. I couldn't find a similar statement for CFQ but I suspect it's the same. It makes more sense in the OCP /OKD workflow though where the MCO drains the node first though, but can we make that assumption across the board wherever rpm-ostree is used? (And even there, what if there's some buggy service hogging the disk?)

Hmm, also I think it would make sense to have different profiles depending on whether it's initiated by another service, vs by a user interactively at the CLI?

All this to say; WDYT about making this configurable at the transaction level instead? (Or at worse, just make the MCO configure the service via a systemd drop-in).

Also in your tests, how does IDLE compare to BE at e.g. level 7 wrt I/O latencies? Would that be a more reasonable upstream default?

cgwalters · 2020-07-20T15:07:19Z

I'm OK changing this to be done via drop-in in MCO instead. It is hard to have a global default, and I do think ultimately cgroups v2 is the right solution for this.

openshift-ci-robot requested review from jlebon and miabbott July 18, 2020 14:34

openshift-ci-robot added the approved label Jul 18, 2020

cgwalters mentioned this pull request Jul 18, 2020

Bug 1850057: stage OS updates (nicely) while etcd is still running openshift/machine-config-operator#1897

Closed

cgwalters closed this Jul 20, 2020

cgwalters mentioned this pull request Jul 29, 2020

Bug 1850057: Use bfq scheduler on control plane, idle I/O for rpm-ostreed openshift/machine-config-operator#1957

Merged

jlebon mentioned this pull request Jan 18, 2021

Niceness configuration option #2450

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

daemon: Use IOSchedulingClass=idle #2164

daemon: Use IOSchedulingClass=idle #2164

cgwalters commented Jul 18, 2020

openshift-ci-robot commented Jul 18, 2020

jlebon commented Jul 20, 2020

cgwalters commented Jul 20, 2020

daemon: Use IOSchedulingClass=idle #2164

daemon: Use IOSchedulingClass=idle #2164

Conversation

cgwalters commented Jul 18, 2020

openshift-ci-robot commented Jul 18, 2020

jlebon commented Jul 20, 2020

cgwalters commented Jul 20, 2020