-
Notifications
You must be signed in to change notification settings - Fork 361
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[main] Update dependencies from dotnet/xliff-tasks dotnet/arcade #11954
[main] Update dependencies from dotnet/xliff-tasks dotnet/arcade #11954
Conversation
…20221213.1 Microsoft.DotNet.XliffTasks From Version 1.0.0-beta.22612.1 -> To Version 1.0.0-beta.22613.1
…213.4 Microsoft.DotNet.Arcade.Sdk , Microsoft.DotNet.Build.Tasks.Feed , Microsoft.DotNet.Helix.Sdk , Microsoft.DotNet.SignTool , Microsoft.DotNet.SwaggerGenerator.MSBuild From Version 8.0.0-beta.22612.4 -> To Version 8.0.0-beta.22613.4
@dotnet/dnceng any idea why this is failing? The job seems to have returned 0 |
Taking a look. |
This is super weird. I took notes and details are below, but the TL;DR of it is that Service Bus allowed re-delivery of the message of this work item during the 28 minutes the first attempt was hanging and timing out, and due to random luck the 2nd attempt succeeded before the first one did. There's not much we can do about this, since we didn't even log losing the lock on the work item in the first attempt; this is just a transient bad behavior from Service bus and the mitigation is to retry. If this starts happening frequently, we should open an IcM with the Service Bus team. Details: Since we never expect to have to handle erroneous multi-delivery of the same work item's attempts, and especially not the 2nd attempt finishing before the 1st attempt, both the times in the Kusto DB are wrong (they show both attempts starting at the same time) and the pass/fail API is wrong (because it gets the last-finished version of the work item, in this case the first attempt) Based off what we see in https://helix.dot.net/api/jobs/79079c0d-2a7e-47f4-9007-eb1989d81b3f/workitems/System.Numerics.Vectors.Tests?api-version=2019-06-17, the work item definitely is considered passed (it timed out on attempt #1, then got retried and passed in the log you've noted.) However: In the log from attempt 1, on dci-mac-build-014 Execution runs from 2022-12-14T14:21:39.911Z to 2022-12-14T14:49:50.244Z (quite a long timeout) In the log from attempt 2, on dci-mac-build-188: Execution runs from 2022-12-14T14:48:59.598Z until 2022-12-14T14:49:49.520Z, finishing literally a second before the first attempt. SInce it finished last, the first attempt is considered the "last" attempt in the eyes of the pass/fail API (we might be able to tighten up its query to include the max attempt?) and thus a call to |
Ah, there was one more twist here, specifically the machine was messed up, detected it, and tried to reboot (which would have prevented the -3 exit code from finishing), but the machine telling itself to reboot didn't actually work:
(... goes on to not reboot) I'll see if I can figure out why this is. |
Ah, interesting! I did smell something fishy. I then noticed the other console log so I was thinking a retry did happen at some point. |
This pull request updates the following dependencies
From https://github.com/dotnet/xliff-tasks
From https://github.com/dotnet/arcade