-
Notifications
You must be signed in to change notification settings - Fork 6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CI] linux://doc/source/tune/examples:bohb_example
is failing/flaky on master.
#35428
Comments
Initial investigation: Still need to figure out how to reproduce this locally, but here are some questions:
Would this happen every time we get to this 3rd case in
|
I think this is the right track! So we
When the reset is resolved, it triggers the unstage and that fails. Notice that in both other cases (1 and 2) we add the trials to It might be that this only comes up if we have more than 2 cached actors. Generally I think the main reason we didn't notice this so far is that the situation is relatively unique, maybe even specifically to BOHB. |
See an example failure here: https://buildkite.com/ray-project/oss-ci-build-branch/builds/3997#018830f1-91f5-4f3d-a4ae-7ee8abf313ab
This is the flaky example: https://docs.ray.io/en/latest/tune/examples/bohb_example.html
Note that this flakiness is DIFFERENT from the recent
tune/bohb_example.py
flakiness. This issue is tracking flakiness that has been present for longer.Stack trace:
Todo
Oldest flakiness that I found (when this issue was created)
....
Generated from flaky test tracker. Please do not edit the signature in this section.
DataCaseName-linux://doc/source/tune/examples:bohb_example-END
....
The text was updated successfully, but these errors were encountered: