-
Notifications
You must be signed in to change notification settings - Fork 25k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reduce connection timeout for intra-cluster connections #29022
Comments
Pinging @elastic/es-distributed |
We assumed it would be simpler to change the profiles used by NodeConnectionService and the pinging but we should double check. Maybe it is better to reduce |
Altering the profile used by the |
We discussed this and decided it's likely that connection management is at risk of broader changes than this, so it makes sense to park this for now. To be revisited when the dust has settled. |
A 30 second timeout for establishing node-to-node connections within a cluster is unreasonably long. We need longer timeouts for connections out of the cluster, so simply reducing
transport.tcp.connect_timeout
isn't feasible, but a separate connection profile for node-to-node connections with a separately configurable timeout would mean that attempts to connect to an unresponsive node would be able to fail much more quickly.Relates #28920 in which cluster state application is blocked for multiple minutes while repeated attempts to connect to unresponsive nodes take place.
The text was updated successfully, but these errors were encountered: