Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Avoid SIGKILL at shutdown #56

Merged
merged 1 commit into from
May 3, 2024
Merged

Avoid SIGKILL at shutdown #56

merged 1 commit into from
May 3, 2024

Conversation

DavidePrincipi
Copy link
Member

@DavidePrincipi DavidePrincipi commented May 3, 2024

The default lifeCycle.graceTimeOut is 10 seconds and races with Podman stop period: increase Podman wait, to avoid killing Traefik with signal 9.

May 03 08:35:16 rl1.dp.nethserver.net systemd[30247]: Stopping Traefik edge proxy...
May 03 08:35:16 rl1.dp.nethserver.net traefik[39585]: time="2024-05-03T08:35:16Z" level=info msg="I have to go..."
May 03 08:35:16 rl1.dp.nethserver.net traefik[39585]: time="2024-05-03T08:35:16Z" level=info msg="Stopping server gracefully"
May 03 08:35:16 rl1.dp.nethserver.net traefik[39585]: time="2024-05-03T08:35:16Z" level=error msg="accept tcp [::]:443: use of closed network connection" entryPointName=https
May 03 08:35:16 rl1.dp.nethserver.net traefik[39585]: time="2024-05-03T08:35:16Z" level=error msg="accept tcp [::]:80: use of closed network connection" entryPointName=http
May 03 08:35:16 rl1.dp.nethserver.net traefik[39585]: time="2024-05-03T08:35:16Z" level=error msg="Error while starting server: accept tcp [::]:80: use of closed network connection" entryPoi>
May 03 08:35:16 rl1.dp.nethserver.net traefik[39585]: time="2024-05-03T08:35:16Z" level=error msg="Error while starting server: accept tcp [::]:443: use of closed network connection" entryPo>
May 03 08:35:26 rl1.dp.nethserver.net traefik[39585]: time="2024-05-03T08:35:26Z" level=info msg="Server stopped"
May 03 08:35:26 rl1.dp.nethserver.net traefik[39585]: time="2024-05-03T08:35:26Z" level=info msg="Shutting down"
May 03 08:35:26 rl1.dp.nethserver.net traefik[39585]: 80.17.99.73 - - [03/May/2024:08:34:26 +0000] "GET /cluster-admin/ws HTTP/1.1" 0 0 "-" "-" 125 "ApiServer-https@file" "http://127.0.0.1:9>
May 03 08:35:26 rl1.dp.nethserver.net traefik1[39736]: time="2024-05-03T08:35:26Z" level=warning msg="StopSignal SIGTERM failed to stop container traefik in 10 seconds, resorting to SIGKILL"
May 03 08:35:26 rl1.dp.nethserver.net conmon[39585]: conmon 42a3e3222a19daf1ba16 <nwarn>: Failed to open cgroups file: /sys/fs/cgroup/user.slice/user-1001.slice/user@1001.service/user.slice/>
May 03 08:35:26 rl1.dp.nethserver.net traefik1[39736]: 42a3e3222a19daf1ba169ec671c7990134ac3ce26bd479f24451bc8c43d42ff0
May 03 08:35:26 rl1.dp.nethserver.net traefik1[39755]: 42a3e3222a19daf1ba169ec671c7990134ac3ce26bd479f24451bc8c43d42ff0
May 03 08:35:26 rl1.dp.nethserver.net systemd[30247]: Stopped Traefik edge proxy.

Refs NethServer/dev#6912

See also https://doc.traefik.io/traefik/routing/entrypoints/#lifecycle

The default `lifeCycle.graceTimeOut` is 10 seconds and races with Podman
stop period: increase Podman wait, to avoid killing Traefik with signal
9.
@DavidePrincipi DavidePrincipi requested a review from Tbaile May 3, 2024 08:55
@DavidePrincipi DavidePrincipi merged commit 7a307e9 into main May 3, 2024
1 check passed
@DavidePrincipi DavidePrincipi deleted the fix-shutdown branch May 3, 2024 09:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants