Add socket backlog metric #2407

raags · 2020-08-20T09:44:05Z

If all the workers are busy or max connections are reached, new connections will queue in the socket backlog, which defaults to 2048. The gunicorn.backlog metric provides visibility into this queue and gives an idea on concurrency, and worker saturation. However, this is only available on Linux platforms.

This also adds a distinction between the timer and histogram statsd metric types, which although treated the same, can be the difference, for e.g. in this case histogram is not a timer: https://github.com/b/statsd_spec#timers

Also, another point to note is on Linux the backlog is also limited by net.core.somaxconn which is 128 by default. Not sure if that is the case on other platforms as well. Would it then make sense to reduce the default backlog from 2048?

Partially Fixes: #2057

gunicorn/sock.py

hleb-albau · 2021-02-05T09:55:13Z

desired feature

vgrebenschikov · 2021-02-11T17:03:49Z

I'll also vote for that feature

tilgovi

I think this change looks good. How do other feel about it?

benoitc · 2021-02-16T06:22:33Z

I am concerned bah the performance. Querying the backlog each time. Such things can be done externally. For example by recording accepted requests in a counter and reduce a backlog counter. IMO just maintenant a counter is enough since the backlog setting is fixed.

On Tue 16 Feb 2021 at 03:57, Randall Leeds ***@***.***> wrote: ***@***.**** approved this pull request. I think this change looks good. How do other feel about it? — You are receiving this because you were assigned. Reply to this email directly, view it on GitHub <#2407 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAADRIXB2KQEYLIABP6WUBDS7HNIFANCNFSM4QF3NICQ> .

-- Sent from my Mobile

tilgovi · 2021-02-16T06:27:58Z

I don't think this can be done by recording accepting requests. The application cannot know how many requests are in the backlog without the OS telling it. This is happening in the arbiter, so once per second at the most, I think. I doubt that making a syscall to get a number from an OS struct is expensive, but it would be great if someone could chime in who knows a bit more about how this works than I do.

benoitc · 2021-02-16T06:57:06Z

Well 1 accepted request is done by 1 worker. You can sum it all. It would be a good approximation imo of what could still be done. And raise an alert. Coupled with system stats. but what about adding a worker there and separate metrics in another module to make it customisable?

On Tue 16 Feb 2021 at 07:28, Randall Leeds ***@***.***> wrote: I don't think this can be done by recording accepting requests. The application cannot know how many requests are in the backlog without the OS telling it. This is happening in the arbiter, so once per second at the most, I think. I doubt that making a syscall to get a number from an OS struct is expensive, but it would be great if someone could chime in who knows a bit more about how this works than I do. — You are receiving this because you were assigned. Reply to this email directly, view it on GitHub <#2407 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAADRIQLCA2NNC3G4FFNJBLS7IGADANCNFSM4QF3NICQ> .

-- Sent from my Mobile

tilgovi · 2021-02-16T23:13:45Z

Well 1 accepted request is done by 1 worker. You can sum it all.

That tells you how many requests are in progress, not how many are waiting in the OS backlog.

I think I have to remove my approval anyway, though. I don't know if this information is available. I can't find any documentation about tcp_unacked and why that gives any measure of the socket backlog.

need more info on tcp_unacked

raags · 2021-03-14T13:34:33Z

I think I have to remove my approval anyway, though. I don't know if this information is available. I can't find any documentation about tcp_unacked and why that gives any measure of the socket backlog.

The documentation is definitely scarce on this. I can provide a few references:

The uwsgi server implements an alarm mechanism based on the socket backlog and is calculated in the same way:
```
 uwsgi_sock->queue = (uint64_t) ti.tcpi_unacked;
```
This is the relevant kernel code that also has comment alluding to its meaning.
This article explains how backlog works in Linux well.

Also I the net.core.somaxconn value which defaults to 128, has been changed to 4096 in 5.4. By default, gunicorn sets the backlog to 2048, but this will get silently truncated to 128. I think the documentation should note this somewhere.

tilgovi · 2021-03-20T21:27:36Z

This looks correct. Thank you for doing the research. I think I was confused because the name suggests un-ACK'd messages, but it seems the kernel is re-using this field to have a different meaning for listening sockets than for connected sockets.

I doubt the performance of a single system call per listener should worry us in the arbiter loop. That loop is not tight. It sleeps often.

dnlserrano · 2021-09-02T12:26:31Z

It'd be awesome to have this work out of the box and exposed as a metric in gunicorn. Any idea on when we can expect it to land in master/a release? Thanks in advance, and thanks for all the work on gunicorn! 🤗

patrickmariglia · 2021-09-23T17:29:46Z

Certainly a very desired feature.

Would love to know if this can be included in a release.

benoitc · 2021-09-25T05:41:05Z

@patrickmariglia why do you need it? Can you elaborate ?

I still don’t really see the point of such metric. it is not really operational since you can’t change the value without restarting and well you know it will fails due to a socket error. The question is what do you do with this alarm?

Keeping counters of accepted , in error and released requests may be more usefull to understand the pressure. Afterall the client should normally expect to fail and retry. Servers have limited resources that can’t be scaled indefinitely. We should add it imo.

I think it is acceptable as an option but this feature must be cross platform and standard. Why does it target specifically linux? How to make it cross platform?

benoitc · 2021-09-25T05:51:07Z

edited comment above.

vgrebenschikov · 2021-09-25T20:48:22Z

@patrickmariglia why do you need it? Can you elaborate ?

I still don’t really see the point of such metric. it is not really operational since you can’t change the value without restarting and well you know it will fails due to a socket error. The question is what do you do with this alarm?

Well, backlog metric - number of connections waiting for be answered, so, if that metric is above 0 - worth to check/fix number of workers/threads

What I would desire even more then backlog - number of active/spare threads - in fact it is connected, as far as number of threads will reach maximum (workers x threads) - backlogs starts to grow.

So, if you can monitor number of spare threads - you can forecast (more or less) when you will have no enough threads to process all parallel incoming connections.

patrickmariglia · 2021-09-27T14:16:34Z

@patrickmariglia why do you need it? Can you elaborate ?

I still don’t really see the point of such metric. it is not really operational since you can’t change the value without restarting and well you know it will fails due to a socket error. The question is what do you do with this alarm?

Keeping counters of accepted , in error and released requests may be more usefull to understand the pressure. Afterall the client should normally expect to fail and retry. Servers have limited resources that can’t be scaled indefinitely. We should add it imo.

I think it is acceptable as an option but this feature must be cross platform and standard. Why does it target specifically linux? How to make it cross platform?

@vgrebenschikov explains very well what my use case is, I am trying to calculate or approximate worker saturation. Using socket backlog, or rather the number of waiting requests, can be a good proxy metric for worker saturation in situations where CPU is not a sufficient metric. So if this value even occasionally increases from 0 it could be an indicator that the number of workers may need to be increased or perhaps more replicas may need to be spun up (if you are in a k8s environment).

If there is another way to determine saturation, such as number of spare workers as @vgrebenschikov also points out, that would be equally as useful. I had thought this possible with the gunicorn.workers metric using a Datadog integration (datadog source reference). The metric is documented as being tagged by state: idle or working, however after talking to their support it seems that this will not work in a k8s environment.

israelbgf · 2021-11-04T12:07:11Z

I still don’t really see the point of such metric. it is not really operational since you can’t change the value without restarting and well you know it will fails due to a socket error. The question is what do you do with this alarm?

Scaling up pods in a k8s environment with this metric could be a usecase. The saturation means that there's not enough workers to handle the requests.

matthew-walters · 2022-01-05T14:42:18Z

I've found the gunicorn.backlog metric from this forked version of Gunicorn to be extremely useful when diagnosing issues running Gunicorn apps in Kubernetes. For example, it explained why some simple liveness and readiness probes were failing, the requests timed out while still in the Gunicorn backlog.

Also, it can explain discrepancies when our ingress in Kubernetes says a request to a Gunicorn app took 10 seconds, but Gunicorn says it took 1 second. In this example, the request spent 9 seconds in the Gunicorn backlog and only after that, when it was picked up by a Gunicorn worker, did Gunicorn start counting the request duration.

Would really appreciate this PR getting merged.

flovilmart · 2022-01-05T18:28:16Z

@benoitc , if I understand correctly, you would be willing to merge this feature if it was available on all platforms and not only linux or do you have other concerns or an alternate implementation you'd rather see merged?

This metric is useful for us in understanding stalls in request processing as @matthew-walters and @israelbgf point out.

StasEvseev · 2022-01-14T14:40:42Z

Very desired feature!

tilgovi · 2022-05-08T03:06:24Z

I still think this makes sense. I could imagine someone using this to decide when to scale up an autoscaling deployment. It does give a sense of whether the workers are able to process requests as fast as they arrive or not. Its utility may vary by worker type, but that's okay.

I don't see the harm in the feature, unless we are worried about the extra overhead. If we are, we can make it optional, no?

MatthieuToulemont · 2022-05-17T16:41:00Z

I agree, this feature would be very useful !

vishalkuo · 2022-05-23T21:44:28Z

For folks that need this behavior, I think you can implement this locally by creating a when_ready hook that has the exact same behavior here: basically spawn a background monitor (similar to the pattern we see here) and periodically look at server.LISTENERS

robotadam · 2022-07-13T21:57:35Z

I was looking to expose saturation metrics for a service my team's responsible for, and this metric combined with a metric on the # of workers that are actively handling a request would be an ideal combination. With the latter we would have an effective metric of saturation — e.g. at a moment in time, 4/5 workers are busy then we are at 80% saturation. This PR would then give us the impact of saturation — that is if we hit 5/5 workers a small backlog size would not be an issue, but a large one would tell us how severe the issue is.

As of right now I can look at the difference in response times between the downstream load balancer and the gunicorn application server, but it's not ideal.

vladyslav-bezpalko · 2022-07-22T09:20:22Z

There's a lack of metrics to scale gunicorn horizontally in k8s env now. This PR would definitely solve it!

sitaram-manatal · 2024-02-28T12:05:00Z

Hello good folks,
A thank you to all the people involved in committing and reviewing the code.
As this PR is approved, any foreseeable timeline this could be merged and released?
We are considering the backlog size as a potential metric for horizontal autoscaling and would love to give this a try

wildcardops · 2024-04-02T17:10:52Z

This would be a very helpful metric for a scaling issue that we are running into. As such, I'm also interested in the timeline on the release.

beaugunderson · 2024-04-02T18:01:28Z

data point regarding safety: we've been running this in production for the last 60 days across ~500 instances of gunicorn without any issues (master + this PR on top), and it has helped us to scale those instances a few times

Sharathmk99 · 2024-05-03T22:24:27Z

I see PR is approved. Is it possible to include in next version. We are looking forward for this metrics. Thanks.

yassinebelmamoun · 2024-08-07T01:48:42Z

Any updates or visibility regarding the merge/release timeline?

This will be very helpful for a lot of folks looking to auto scale.

Big thanks to all contributors 🙏

beaugunderson · 2024-08-07T06:20:03Z

Still running this in production without issues, would love if it was merged and released. 👍

benoitc

Thank you for the PR and feedbacks.

Let say t I am still not convinced this metric is useful compared its impact on the arbiter. Backlog is set statically once, so it's easy to extrapolate about its usage using the average connections or can be checked by comparing the number of request landing in the proxy with the number of running or accepted requests in gunicorn. That would reduce the contention there. Especially since this requires to decode a value.

That said, I understand that people may want to get it. so I suggest the following changes (see comment inlines):

only trigger this metric on system that support it
possibly make this metric configurable.

This would let the system perform as usual when it can. Can you make these changes? As for 2 this will be easier soon with the new opentelemetry backend but can be passed using the setting module for now.

gunicorn/sock.py

gunicorn/arbiter.py

If all the workers are busy or max connections is reached, new connections will queue in the socket backlog, which defaults to 2048. The `gunicorn.backlog` metric provide visibility into this queue, and give an idea on concurrency, and worker saturation. This also adds a distinction between the `timer` and `histogram` statsd metric types, which although treated the same, can be difference, for e.g. in this case histogram is not a timer: https://github.com/b/statsd_spec#timers

Fix failing lint tests

raags · 2024-08-14T08:14:09Z

@benoitc I've updated the PR with the changes requested, please review.

benoitc

The latest changes look good tor me, thank for them! Can you look at my comment, either way I think it's good for merging.

gunicorn/sock.py

benoitc

Thanks for the update. Sorry , I missed the backlock check in my previous review. Can you fix it as well? Also Look at the failing CI tests

benoitc · 2024-08-14T09:40:05Z

gunicorn/arbiter.py

+            backlog = sum(sock.get_backlog() or 0
+                          for sock in self.LISTENERS)
+
+            if backlog:


this will be always true there.even when backlog is set to -1.
I think correct test is :`if backlog >= 0'

Ah yes, I missed this - pushing the fix

raags · 2024-08-14T13:54:33Z

Hi @benoitc the CI has passed

docs/source/settings.rst

Barsoomx · 2024-08-28T18:44:07Z

Won't this break if the connection is already submitted to the ThreadPoolExecutor in a gthread worker?
they will be submitted up to worker_connections amount to the ThreadPoolExecutor. Is it possible to also pull this data?

if not, i think it's worth specifying it in the docs.

raags · 2024-08-31T07:41:41Z

@Barsoomx I didn't understand, can you elaborate? the backlog is from the sockets and isn't dependent on the number of worker connections.

Barsoomx · 2024-08-31T11:45:30Z

@Barsoomx I didn't understand, can you elaborate? the backlog is from the sockets and isn't dependent on the number of worker connections.

I've tried to implement this locally with gthread worker class and it doesn't show the correct backlog count. I suspect the reason is keepalive + worker_connections logic (sockets are ACKed)

what happens in a sync worker: connections are in tcp backlog until worker can pick them up

what happens in a threaded worker: the connections are enqueued into a threadpool and are being keepalived until a thread can process them, up to worker_connections count.

In that case (the threaded worker) the metric has no value and the real "worker backlog" is the connections that exceed the thread count for each worker.

len(worker.futures) - cfg.threads (for threaded worker only)

matthew-walters · 2024-09-19T15:23:40Z

Come on folks, it's been over 4 years now.
Can we just limit scope of this to sync worker and get it merged

karlanke reviewed Jan 5, 2021

View reviewed changes

gunicorn/sock.py Outdated Show resolved Hide resolved

benoitc self-assigned this Jan 17, 2021

tilgovi previously approved these changes Feb 16, 2021

View reviewed changes

tilgovi self-requested a review February 16, 2021 23:13

tilgovi approved these changes Mar 20, 2021

View reviewed changes

dnlserrano mentioned this pull request Sep 2, 2021

Add extra statsd metrics for normal and/or abnormal worker exits #2126

Closed

sitaram-manatal mentioned this pull request May 12, 2024

Question: Measurement of gunicorn.request.duration #3208

Closed

benoitc requested changes Aug 7, 2024

View reviewed changes

gunicorn/sock.py Outdated Show resolved Hide resolved

gunicorn/arbiter.py Outdated Show resolved Hide resolved

benoitc added Waiting for feedback and removed To Review labels Aug 10, 2024

raags added 3 commits August 14, 2024 12:57

Do not emit backlog metric if its unsupported

4d53263

Avoid calling get_backlog twice

aa73f3c

Fix failing lint tests

raags force-pushed the master branch from 0d12c43 to c35802f Compare August 14, 2024 08:01

benoitc requested changes Aug 14, 2024

View reviewed changes

gunicorn/sock.py Outdated Show resolved Hide resolved

raags force-pushed the master branch from c35802f to e8c1ba5 Compare August 14, 2024 09:34

benoitc requested changes Aug 14, 2024

View reviewed changes

raags force-pushed the master branch 3 times, most recently from 02389e4 to cf861a2 Compare August 14, 2024 13:32

pajod mentioned this pull request Aug 14, 2024

Preview: point release v23.1.0 pajod/gunicorn#3

Open

pajod reviewed Aug 14, 2024

View reviewed changes

docs/source/settings.rst Outdated Show resolved Hide resolved

Enable only on Linux platforms, and add config flag

d5aa52e

raags force-pushed the master branch from cf861a2 to d5aa52e Compare August 15, 2024 14:06

Add socket backlog metric #2407

Are you sure you want to change the base?

Add socket backlog metric #2407

Conversation

raags commented Aug 20, 2020

hleb-albau commented Feb 5, 2021

vgrebenschikov commented Feb 11, 2021

tilgovi left a comment

Choose a reason for hiding this comment

benoitc commented Feb 16, 2021 via email

tilgovi commented Feb 16, 2021

benoitc commented Feb 16, 2021 via email

tilgovi commented Feb 16, 2021

raags commented Mar 14, 2021

tilgovi commented Mar 20, 2021

dnlserrano commented Sep 2, 2021

patrickmariglia commented Sep 23, 2021

benoitc commented Sep 25, 2021 • edited Loading

benoitc commented Sep 25, 2021

vgrebenschikov commented Sep 25, 2021

patrickmariglia commented Sep 27, 2021 • edited Loading

israelbgf commented Nov 4, 2021

matthew-walters commented Jan 5, 2022

flovilmart commented Jan 5, 2022

StasEvseev commented Jan 14, 2022

tilgovi commented May 8, 2022

MatthieuToulemont commented May 17, 2022

vishalkuo commented May 23, 2022

robotadam commented Jul 13, 2022

vladyslav-bezpalko commented Jul 22, 2022 • edited Loading

sitaram-manatal commented Feb 28, 2024

wildcardops commented Apr 2, 2024

beaugunderson commented Apr 2, 2024

Sharathmk99 commented May 3, 2024

yassinebelmamoun commented Aug 7, 2024 • edited Loading

beaugunderson commented Aug 7, 2024

benoitc left a comment • edited Loading

Choose a reason for hiding this comment

raags commented Aug 14, 2024

benoitc left a comment

Choose a reason for hiding this comment

benoitc left a comment • edited Loading

Choose a reason for hiding this comment

benoitc Aug 14, 2024

Choose a reason for hiding this comment

raags Aug 14, 2024 • edited Loading

Choose a reason for hiding this comment

raags commented Aug 14, 2024

Barsoomx commented Aug 28, 2024 • edited Loading

raags commented Aug 31, 2024

Barsoomx commented Aug 31, 2024

matthew-walters commented Sep 19, 2024

benoitc commented Sep 25, 2021 •

edited

Loading

patrickmariglia commented Sep 27, 2021 •

edited

Loading

vladyslav-bezpalko commented Jul 22, 2022 •

edited

Loading

yassinebelmamoun commented Aug 7, 2024 •

edited

Loading

benoitc left a comment •

edited

Loading

benoitc left a comment •

edited

Loading

raags Aug 14, 2024 •

edited

Loading

Barsoomx commented Aug 28, 2024 •

edited

Loading