memberlist gossip #1389

stuartnelson3 · 2018-05-18T14:18:42Z

meant to address #1387

one current issue is that it seems peer0 has a few notifications queued, but hasn't cleared them in over an hour. Some way of clearing old messages is probably needed.

The method for choosing RetransmitMulti is also something to be discussed. It's an important value, but I'm wary of exposing too many configuration options in alertmanager to alter memberlist. I think we need to come up with a "smart" value that also doesn't overwhelm the transmit queues .

simonpasquier

Do we want to re-gossip new/updated silences too?

simonpasquier · 2018-05-18T15:13:46Z

cluster/cluster.go

@@ -107,7 +107,7 @@ func Join(
 		readyc: make(chan struct{}),
 		logger: l,
 	}
-	p.delegate = newDelegate(l, reg, p)
+	p.delegate = newDelegate(l, reg, p, len(knownPeers)/2)


nit but I would do the computation here rather than add a new argument to newDelegate().

you mean setting RetransmitMulti on p.delegate?

Sorry for being inaccurate. I meant:

retransmit := len(knownPeers)/2 if restransmit < 3 { retransmit = 3 } p.delegate = newDelegate(l, reg, p, retransmit)

stuartnelson3 · 2018-05-18T16:58:12Z

Do we want to re-gossip new/updated silences too?

Yeah, that would make sense

stuartnelson3 · 2018-06-01T10:59:46Z

Updated.

When running this branch at SC with only nflog entries being further gossiped, we were seeing messages queued up. Newer messages are sent preferentially by memberlist, but it's still an issue that we have an unbounded queue. Ideally, the queue would be able to empty itself, but that's not what I was seeing. I'll investigate this.

mxinden · 2018-06-01T11:44:44Z

The method for choosing RetransmitMulti is also something to be discussed. It's an important value, but I'm wary of exposing too many configuration options in alertmanager to alter memberlist. I think we need to come up with a "smart" value that also doesn't overwhelm the transmit queues .

I agree with keeping the amount of configuration options small (fow now).

What is your reasoning for:

retransmit := len(knownPeers) / 2

Thanks a lot for working on this!

stuartnelson3 · 2018-06-01T15:49:51Z

Note: The probability calculation I was doing was based on changing GossipPeers, not RetransmitMult. Both are important values, and I tried to factor them both in for this.

RetransmitMult is how many times a message will be sent. Each send goes to one machine.
GossipPeers is how many machines to gossip to in a single gossip interval.

Each gossip interval will send messages to all machines picked by kRandomNodes, but it will not necessarily exhaust all retransmits of a message; the message may remain in the queue to be sent at the next gossip interval. But, if numNodes > retransmits, then a message will only be sent to the first N nodes, where N = retransmits. I imagine this is why I was seeing messages being queued -- I was instructing memberlist to send a message many times, but based on the number of peers it was sending messages to, it wouldn't hit that retransmit limit and remove the message from its internal queue. Newer gossip messages would arrive, and the older ones could never be sent.

Probability of a machine not being gossiped to during a gossip interval

Caveat: This might be wrong :)

Play along at:
https://gist.github.com/stuartnelson3/81ba1f750196e9fccaf2f7b23bdbf5ac

Tweaking the numbers available (probabilities are the chance of not receiving a message, and given as a percentage out of 100):

retransmits	GossipPeers	num AMs	p(NoGossip)	p(Gossip)
1	3	3	50.0030	25.0030
2	3	3	2.6065	0.0679
3	3	3	2.6065	0.0679
3	3	4	3.1591	0.0032
3	3	10	66.6667	2.6012
5	5	10	44.4444	0.0677

p(Gossip) is if each node is configured to gossip only newly received messages. Any message they've already seen will not be gossiped.

The testing was far from thorough given that I doubt anyone runs more than 10 instances, but it seemed like len(knownPeers) / 2 was a good place to start. If more peers join, however, maybe this value should be updated.

If a peer receives an nflog that it hasn't seen before, queue the message and propagate it further to other peers. This should ensure that all peers within a cluster receive all gossip messages. Signed-off-by: stuart nelson <stuartnelson3@gmail.com>

For alertmanagers that are brought up with a list of peers, set the number of message retransmits to be half of that number. If there are no peers on start, or there are few, continue to use the default value of 3. Signed-off-by: stuart nelson <stuartnelson3@gmail.com>

Signed-off-by: stuart nelson <stuartnelson3@gmail.com>

During a gossip, we send messages to at most GossipNodes nodes. If possible, we only a message to reach all nodes as soon as possible. Signed-off-by: stuart nelson <stuartnelson3@gmail.com>

Signed-off-by: stuart nelson <stuartnelson3@gmail.com>

These are useful as a direct indication of CPU contention and task scheduler latency. Handy references: - https://github.com/torvalds/linux/blob/master/Documentation/scheduler/sched-stats.txt - https://doc.opensuse.org/documentation/leap/tuning/html/book.sle.tuning/cha.tuning.taskscheduler.html procfs is updated to pull in the enabling change: prometheus/procfs#186 Signed-off-by: Phil Frost <phil@postmates.com>

stuartnelson3 mentioned this pull request May 18, 2018

Alertmanager memberlist not sending to all peers #1387

Closed

simonpasquier reviewed May 18, 2018

View reviewed changes

stuartnelson3 mentioned this pull request May 30, 2018

Release Alertmanager v0.15.0 #1340

Closed

stuartnelson3 added 6 commits June 5, 2018 14:38

[nflog] Move retransmit calculation

aa37209

Signed-off-by: stuart nelson <stuartnelson3@gmail.com>

[silence] further gossip silence messages

4142be4

Signed-off-by: stuart nelson <stuartnelson3@gmail.com>

Set GossipNodes to equal RetransmitMulti

9d97d48

During a gossip, we send messages to at most GossipNodes nodes. If possible, we only a message to reach all nodes as soon as possible. Signed-off-by: stuart nelson <stuartnelson3@gmail.com>

Fix rebase

4125ca8

Signed-off-by: stuart nelson <stuartnelson3@gmail.com>

stuartnelson3 force-pushed the stn/memberlist-gossip branch from 5f9378b to 4125ca8 Compare June 5, 2018 12:51

stuartnelson3 mentioned this pull request Jun 5, 2018

memberlist reconnect #1384

Merged

4 tasks

stuartnelson3 merged commit 36588c3 into master Jun 8, 2018

stuartnelson3 deleted the stn/memberlist-gossip branch June 8, 2018 09:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

memberlist gossip #1389

memberlist gossip #1389

stuartnelson3 commented May 18, 2018

simonpasquier left a comment

simonpasquier May 18, 2018

stuartnelson3 May 18, 2018

simonpasquier May 22, 2018

stuartnelson3 commented May 18, 2018

stuartnelson3 commented Jun 1, 2018

mxinden commented Jun 1, 2018 •

edited

Loading

stuartnelson3 commented Jun 1, 2018

memberlist gossip #1389

memberlist gossip #1389

Conversation

stuartnelson3 commented May 18, 2018

simonpasquier left a comment

Choose a reason for hiding this comment

simonpasquier May 18, 2018

Choose a reason for hiding this comment

stuartnelson3 May 18, 2018

Choose a reason for hiding this comment

simonpasquier May 22, 2018

Choose a reason for hiding this comment

stuartnelson3 commented May 18, 2018

stuartnelson3 commented Jun 1, 2018

mxinden commented Jun 1, 2018 • edited Loading

stuartnelson3 commented Jun 1, 2018

Probability of a machine not being gossiped to during a gossip interval

mxinden commented Jun 1, 2018 •

edited

Loading