Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cluster coordination indicator - report if the master is stable and an impact/troubleshoot guide otherwise #85624

Closed
17 tasks done
andreidan opened this issue Apr 1, 2022 · 3 comments
Assignees
Labels

Comments

@andreidan
Copy link
Contributor

andreidan commented Apr 1, 2022

Each node will need to keep track of the last few master node changes (this should be fine in memory) and its local node information (a node might see the master node flapping, but the master node itself is fine). e.g. if the master node has changed >3 times in the last 30 minutes, then it's not stable. otherwise, nothing to report.

The coordinating node might have to contact a master eligible node which in turn might have to contact other master eligible nodes, but this is the worst-case scenario and definitely does not involve fanning out to all nodes.

In case the node has master according to above definition:

In case the node does not have a master node:

@elasticmachine
Copy link
Collaborator

Pinging @elastic/es-data-management (Team:Data Management)

@masseyke
Copy link
Member

masseyke commented Jul 5, 2022

2022-04-18 Coordination health
Adding the diagram we're using to drive this work.

@masseyke
Copy link
Member

masseyke commented Sep 1, 2022

Closing this but with feedback from @dakrone --

  1. We need to not use dynamic keys (node Ids) in the cluster_formation map in the details within the result
  2. We need to check that selection of the master eligible node when polling master eligible nodes is truly random (we're currently using getMasterEligibleNodes().stream().findAny())

@masseyke masseyke closed this as completed Sep 1, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants