Pull Queries: Heartbeat: Don't log errors when heartbeating dead nodes #4807

vpapavas · 2020-03-17T21:47:33Z

Is your feature request related to a problem? Please describe.
KsqlTarget LL:202 logs errors when an async http request fails. The HeartbeatAgent of a server keeps sending heartbeats to all previously discovered servers, whether they are alive or dead. If a server is dead, the async request fails and returns an error code which is logged. This spams the log as 1) heartbeating happens very often and 2) these are not really errors so we don't need them in the log.

Background on why the heartbeat agent sends heartbeats to servers that are dead: Assume servers A and B. On startup, if server B is delayed for some reason and does not send heartbeats, A will mark it as dead. If the agent was not sending heartbeats to dead server, then A would not send to B, hence B would mark A as dead as well. Now, both nodes have marked each other as dead and none sends heartbeats to the other. So, we have essentially a deadlock. That is why servers send heartbeats even to servers that are dead.

Describe the solution you'd like
Do not log errors when heartbeats fail if a server is down.

The text was updated successfully, but these errors were encountered:

vpapavas added the enhancement label Mar 17, 2020

vpapavas added this to the 0.9.0 milestone Mar 17, 2020

vpapavas assigned AlanConfluent Mar 17, 2020

AlanConfluent mentioned this issue Mar 17, 2020

fix: Removes unnecessary error logging for heartbeat since this is ex… #4809

Merged

2 tasks

AlanConfluent closed this as completed in #4809 Mar 19, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pull Queries: Heartbeat: Don't log errors when heartbeating dead nodes #4807

Pull Queries: Heartbeat: Don't log errors when heartbeating dead nodes #4807

vpapavas commented Mar 17, 2020

Pull Queries: Heartbeat: Don't log errors when heartbeating dead nodes #4807

Pull Queries: Heartbeat: Don't log errors when heartbeating dead nodes #4807

Comments

vpapavas commented Mar 17, 2020