Shorter Timeout #22

Stebalien · 2018-02-19T22:58:13Z

IMO, it's reasonable to assume that we don't really want to talk to a node with an RTT latency of over a few seconds.

Kubuxu · 2018-03-10T23:10:51Z

I disagree, sometimes this is the only node we might be able to talk to. As in, low bandwidth high ping situations.

Stebalien · 2018-03-11T00:52:45Z

The issue is that file descriptors are a scarce resource and hung dials prevent us from trying other addresses. Ideally we'd have some way to determine if we're running low on file descriptors and start killing long dials but that gets really messy.

Another thing to note is that we currently have a global timeout of 1m on fully establishing a connection. Given that it takes us (at least) 6 round trips (!) to establish a connection, we only really have 10s (at most) for establishing the TCP connection. So, how about setting TCP timeout to be timeout/6.

In general, I wonder if a rolling average timeout would work. That is, the TCP transport can track how long dials usually take (possibly ignoring dials to private addresses) and pick a reasonable timeout based on that.

ajbouh · 2018-05-03T15:36:38Z

Since @Kubuxu's concern applies only in situations where our only option is a high latency link, perhaps an escalating schedule of TCP timeouts is the simplest solution? That is, if we can't connect to anyone with a timeout of 4 seconds, try 8, then 32, etc. Some basic statistics on TCP connection times would probably yield a reasonable schedule that improves median, 80th, and maybe even 90th percentile connection times at the expense of 99th percentile times.

Thoughts?

vyzo · 2018-05-03T16:18:11Z

cc myself

marten-seemann · 2021-09-25T10:06:48Z

Given that it takes us (at least) 6 round trips (!) to establish a connection, we only really have 10s (at most) for establishing the TCP connection. So, how about setting TCP timeout to be timeout/6.

Not sure why it takes 6 roundtrips, I guess this number was still from secio times? I count 1 for the 3-way handshake, 1 for security protocol negotiation, 1 for the security handshake, 1 for muxer negotiation. So that's 4 in total.
By moving the security protocol into the multiaddr, we'll further reduce this, so (very soon) we'll be down to 3 roundtrips. Now the 5s timeout doesn't look as unreasonable any more.

Stebalien mentioned this issue Feb 19, 2018

DHT Query Performance libp2p/go-libp2p-kad-dht#88

Closed

marten-seemann closed this as completed Sep 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Shorter Timeout #22

Shorter Timeout #22

Stebalien commented Feb 19, 2018

Kubuxu commented Mar 10, 2018

Stebalien commented Mar 11, 2018

ajbouh commented May 3, 2018

vyzo commented May 3, 2018

marten-seemann commented Sep 25, 2021

Shorter Timeout #22

Shorter Timeout #22

Comments

Stebalien commented Feb 19, 2018

Kubuxu commented Mar 10, 2018

Stebalien commented Mar 11, 2018

ajbouh commented May 3, 2018

vyzo commented May 3, 2018

marten-seemann commented Sep 25, 2021