KubernetesPipelineTest.errorPod
flake revealed bug in Reaper
#1417
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
While checking why #1415 did not automatically deploy, I saw that the trunk commit failed with a flake in
KubernetesPipelineTest.errorPod
I found that the same failure could be reproduced easily:
Turns out the eager hard-kill was preventing the normal event sequence from being observed:
The log lines being asserted were introduced in #1050. It appears that #1118 prevented these from being shown in some cases, but due to the odd structure of the test this regression was not caught. Fixing the production code to print failing pod log lines before cancelling the queue item (which will abort the
node
step and thus alsopodTemplate
), and improving the test to wait for the build to terminate naturally.