Skip to content

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The runner has received a shutdown signal. #6709

Closed
2 of 11 tasks
JakubMosakowski opened this issue Dec 7, 2022 · 37 comments
Closed
2 of 11 tasks

The runner has received a shutdown signal. #6709

JakubMosakowski opened this issue Dec 7, 2022 · 37 comments
Assignees
Labels
bug report investigate Collect additional information, like space on disk, other tool incompatibilities etc. OS: Ubuntu

Comments

@JakubMosakowski
Copy link

Description

Since yesterday, our GitHub action builds started to randomly fail (we didn't change anything in our configuration). The error is not very precise, unfortunately.

The process is stopped in random stages of the build (but always after at least 15 minutes or so). Even if the build passes it takes much longer than before (~25 min clean build to ~35 min now).

2022-12-07T10:18:10.5771753Z ##[error]The runner has received a shutdown signal. This can happen when the runner service is stopped, or a manually started runner is canceled.
2022-12-07T10:18:10.7098386Z ##[error]The operation was canceled.
2022-12-07T10:18:10.7710701Z Cleaning up orphan processes
2022-12-07T10:18:10.8338404Z Terminate orphan process: pid (1849) (java)

Sometimes before the shutdown signal, there is also such log:
Idle daemon unexpectedly exit. This should not happen.

Workflow passes normally on builds that are shorter (for example those from cache).

Platforms affected

  • Azure DevOps
  • GitHub Actions - Standard Runners
  • GitHub Actions - Larger Runners

Runner images affected

  • Ubuntu 18.04
  • Ubuntu 20.04
  • Ubuntu 22.04
  • macOS 10.15
  • macOS 11
  • macOS 12
  • Windows Server 2019
  • Windows Server 2022

Image version and build link

Image: ubuntu-22.04
Version: 20221127.1
Current runner version: '2.299.1'

Unfortunately, it happens on the private repo.

Is it regression?

No

Expected behavior

Job should pass

Actual behavior

Job fails

Repro steps

Looks similar to: #6680

@erik-bershel erik-bershel added OS: Ubuntu investigate Collect additional information, like space on disk, other tool incompatibilities etc. and removed needs triage labels Dec 7, 2022
@erik-bershel
Copy link
Contributor

@JakubMosakowski we cannot do any investigation without additional info.
I see that your machine got the shutdown signal. Most often, this means that the resources consumed in the process exceeded the limits. We can theoretically check whether this is so. But we need to see an example of the pipeline that caused the outage and links to failed uses. Even if they belong to a private repository.

@JakubMosakowski
Copy link
Author

The interesting part is that it doesn't seem to be related to any of our changes. I created a branch that is reverted by the last X commits (to the point in history where our builds were smooth) and they are not passing anymore.

@mvarrieur
Copy link

We are also seeing this after upgrading our self hosted runners from 20.04 to 22.04 with no other seemingly related changes. Do the 22.04 runners have more conservative limits even when using self hosted?

@ihor-panasiuk95
Copy link

The same happens to us in private repo.
Builds started to randomly fail with this error:
image

We didn't do any significant changes to workflows.

@erik-bershel
Copy link
Contributor

Hi @ihor-panasiuk95, please send me a links to workflow runs both with positive and negative results.

@ihor-panasiuk95
Copy link

@erik-bershel will you be able to visit them taking into account that they are in private repo?

@erik-bershel
Copy link
Contributor

@ihor-panasiuk95 it's not a problem. There is no need to check what is going on in your private repository in the first step. I want to check the load on agents and compare successful and failed jobs. If the information is not enough, then we will discuss the repro-steps. For example: https://github.com/owner/repo/actions/runs/runID or https://github.com/erik-bershel/erik-tests/actions/runs/3680567148.

@ihor-panasiuk95
Copy link

ihor-panasiuk95 commented Dec 21, 2022

@erik-bershel
Negative - https://github.com/anecdotes-ai/frontend/actions/runs/3742047670
Positive (I replaced ubuntu-latest with ubuntu-22.04 and it started to work) - https://github.com/anecdotes-ai/frontend/actions/runs/3748531101

@qhy040404
Copy link

I find that this issue only occurs when using ubuntu-latest which means ubuntu-22.04. However, it doesn't happen when using ubuntu-20.04
Negative - https://github.com/qhy040404/LibChecker/actions/runs/3747087072
Positive - https://github.com/qhy040404/LibChecker/actions/runs/3748900876/jobs/6366765819

meain added a commit to alcionai/corso that referenced this issue Jan 3, 2023
Previously Github Actions was killing us resulting in the
Publish-Binary job failing with the following error message:

The runner has received a shutdown signal.

Related: actions/runner-images#6709
aviator-app bot pushed a commit to alcionai/corso that referenced this issue Jan 4, 2023
## Description

Previously Github Actions was killing (probably because gorelease was using up quite a bit of resources) the process resulting in the Publish-Binary job failing with the following error message:

```
The runner has received a shutdown signal.
```

Related: actions/runner-images#6709

<!-- Insert PR description-->

## Does this PR need a docs update or release note?

- [ ] ✅ Yes, it's included
- [ ] 🕐 Yes, but in a later PR
- [x] ⛔ No 

## Type of change

<!--- Please check the type of change your PR introduces: --->
- [ ] 🌻 Feature
- [ ] 🐛 Bugfix
- [ ] 🗺️ Documentation
- [ ] 🤖 Test
- [x] 💻 CI/Deployment
- [ ] 🐹 Trivial/Minor

## Issue(s)

<!-- Can reference multiple issues. Use one of the following "magic words" - "closes, fixes" to auto-close the Github issue. -->
* fixes #2014

## Test Plan

<!-- How will this be tested prior to merging.-->
- [x] 💪 Manual
- [ ] ⚡ Unit test
- [ ] 💚 E2E
shonfeder pushed a commit to apalache-mc/apalache that referenced this issue Sep 18, 2023
Workaround for actions/runner-images#6709
to address the flaky CI failures we've been experiencing.
shonfeder pushed a commit to apalache-mc/apalache that referenced this issue Sep 18, 2023
Workaround for actions/runner-images#6709
to address the flaky CI failures we've been experiencing.
naik-aakash added a commit to naik-aakash/LobsterPy that referenced this issue Oct 27, 2023
Try changing ubuntu version as per actions/runner-images#6709
cmonfortep added a commit to duckduckgo/Android that referenced this issue Oct 30, 2023
Task/Issue URL: https://app.asana.com/0/488551667048375/1205793949269201/f

### Description
Uses ubuntu-20.04 instead of latest to avoid issues on ci

More info: actions/runner-images#6709

### Steps to test this PR

_Feature 1_
- [ ]
- [ ]

### UI changes
| Before  | After |
| ------ | ----- |
!(Upload before screenshot)|(Upload after screenshot)|
joshliebe pushed a commit to duckduckgo/Android that referenced this issue Nov 7, 2023
Task/Issue URL: https://app.asana.com/0/488551667048375/1205793949269201/f

### Description
Uses ubuntu-20.04 instead of latest to avoid issues on ci

More info: actions/runner-images#6709

### Steps to test this PR

_Feature 1_
- [ ]
- [ ]

### UI changes
| Before  | After |
| ------ | ----- |
!(Upload before screenshot)|(Upload after screenshot)|
fallwith added a commit to newrelic/newrelic-ruby-agent that referenced this issue Nov 21, 2023
lassepe added a commit to lassepe/peters2024ral-code that referenced this issue Feb 14, 2024
pdoerner added a commit to temporalio/temporal that referenced this issue May 30, 2024
## What changed?
<!-- Describe what has changed in this PR -->
Downgrading Github `All Tests` actions to use `ubuntu-20.04` instead of
`ubuntu-latest`

## Why?
<!-- Tell your future self why have you made these changes -->
It seems we started hitting an issue with Github's action runners. These
errors seem to appear most often when runners are killed because they
are using too much CPU or memory, but we have been unable to
conclusively say what might have caused an increase in resource usage.

Downgrading the ubuntu version worked for others and seems to be working
for us.

For more info:
* actions/runner-images#6709
* actions/runner-images#7188
* Users reported a similar issue with ubuntu-24.04 runners (not yet GA):
actions/runner-images#9848

## How did you test it?
<!-- How have you verified this change? Tested locally? Added a unit
test? Checked in staging env? -->
Reran all PR check jobs several times

## Potential risks
<!-- Assuming the worst case, what can be broken when deploying this
change to production? -->

## Documentation
<!-- Have you made sure this change doesn't falsify anything currently
stated in `docs/`? If significant
new behavior is added, have you described that in `docs/`? -->

## Is hotfix candidate?
<!-- Is this PR a hotfix candidate or does it require a notification to
be sent to the broader community? (Yes/No) -->
BlurryLight added a commit to BlurryLight/pbrt3-fork that referenced this issue Jul 28, 2024
shenxianpeng added a commit to cpp-linter/clang-tools-static-binaries that referenced this issue Sep 27, 2024
shenxianpeng added a commit to cpp-linter/clang-tools-static-binaries that referenced this issue Sep 27, 2024
ligurio added a commit to ligurio/lua-c-api-tests that referenced this issue Nov 25, 2024
GitHub action builds started to randomly fail with a reason:
"The runner has received a shutdown signal.".
The issue [1] and discussion [2] points to a workaround:
replace a "ubuntu-latest" with "ubuntu-22.04" and it started to
work.

The patch replaces "ubuntu-latest" with "ubuntu-22.04" in CFL
workflows.

1. actions/runner-images#6709
2. actions/runner-images#7188
ligurio added a commit to ligurio/lua-c-api-tests that referenced this issue Nov 25, 2024
GitHub action builds started to randomly fail with a reason:
"The runner has received a shutdown signal.".
The issue [1] and discussion [2] points to a workaround:
replace a "ubuntu-latest" with "ubuntu-22.04" and it started to
work.

The patch replaces "ubuntu-latest" with "ubuntu-22.04" in CFL
workflows.

1. actions/runner-images#6709
2. actions/runner-images#7188
ligurio added a commit to ligurio/lua-c-api-tests that referenced this issue Nov 25, 2024
GitHub action builds started to randomly fail with a reason:
"The runner has received a shutdown signal.".
The issue [1] and discussion [2] points to a workaround:
replace a "ubuntu-latest" with "ubuntu-20.04" and it started to
work. Note that the problem reproduce with "ubuntu-22.04" as well.

The patch replaces "ubuntu-latest" with "ubuntu-20.04" in CFL
workflows.

1. actions/runner-images#6709
2. actions/runner-images#7188
ligurio added a commit to ligurio/lua-c-api-tests that referenced this issue Nov 25, 2024
GitHub action builds started to randomly fail with a reason:
"The runner has received a shutdown signal.".
The issue [1] and discussion [2] points to a workaround:
replace a "ubuntu-latest" with "ubuntu-20.04" and it started to
work. Note that the problem reproduce with "ubuntu-22.04" as well.

The patch replaces "ubuntu-latest" with "ubuntu-20.04" in CFL
workflows.

1. actions/runner-images#6709
2. actions/runner-images#7188

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

Labels
bug report investigate Collect additional information, like space on disk, other tool incompatibilities etc. OS: Ubuntu
Projects
None yet
Development

No branches or pull requests

17 participants