Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix SingleNodeDiscoveryStabilisesEvenWhenDisrupted #91325

Merged

Conversation

kingherc
Copy link
Contributor

@kingherc kingherc commented Nov 4, 2022

By increasing the voting duration in case of high
delays, to avoid the possible endless repetition
of voting rounds.

Fixes #89867

By increasing the voting duration in case of high
delays, to avoid the possible endless repetition
of voting rounds.

Fixes elastic#89867
@kingherc kingherc added >test-failure Triaged test failures from CI :Distributed Coordination/Cluster Coordination Cluster formation and cluster state publication, including cluster membership and fault detection. Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. labels Nov 4, 2022
@kingherc kingherc self-assigned this Nov 4, 2022
@kingherc kingherc marked this pull request as ready for review November 4, 2022 20:55
@elasticsearchmachine
Copy link
Collaborator

Pinging @elastic/es-distributed (Team:Distributed)

@kingherc kingherc requested a review from DaveCTurner November 7, 2022 09:40
Copy link
Contributor

@DaveCTurner DaveCTurner left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I expect this will eventually fail because an election involves a couple of round-trips through the transport layer so should sometimes take as much as 4*delayVariabilityMillis (possibly longer, I didn't look for other delays). However I've run over 30,000 iterations of this test without seeing a failure, so maybe I'm misunderstanding or maybe with this delay a bad collision just becomes very unlikely. So empirically this LGTM (except for a couple of nits).

kingherc and others added 2 commits November 7, 2022 14:06
…ordinatorTests.java

Co-authored-by: David Turner <david.turner@elastic.co>
@kingherc kingherc merged commit d83d19a into elastic:main Nov 7, 2022
@kingherc kingherc deleted the test-failure/89867-leaders-again-empty branch November 7, 2022 12:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
:Distributed Coordination/Cluster Coordination Cluster formation and cluster state publication, including cluster membership and fault detection. Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. >test-failure Triaged test failures from CI v8.6.0
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Failure in CoordinatorTests.testSingleNodeDiscoveryStabilisesEvenWhenDisrupted
3 participants