-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
status.master is slow, CPU intensive with high number of TCP connections #53580
Comments
We have a TCP leak right now. Thank you for bring this up. I will try to fix the leak in status.master and bring down the TCP connection count. This should also bring down the CPU workload. |
#53581 seems to be only part of the fix |
once #53581 is merged I will fix the leak. |
@cmcmarrow is there an issue open for the TCP leak? I would be interested to look at it and find if we hit the problem in our setup. |
@cifvts TCP does leak. We are currently trying to find and fix them. We have not made an issue for it. I know if you run the salt test on windows you can see the TCP count go way up. Right now we are trying to find all the tcp leaks and patch them. The other goal is to lower the number of made TCP connects by reusing them when it is safe. If we do this in salt.utils.network._netlink_tool_remote_on I believe this would help lower your cpu work load too. |
I see what you mean. The workload with the patch is completely gone. For the TCP, I will be happy to look at it since we have thousands of them in our production env. |
A fix should come out in the next update! I'll link it once I get the fix |
Parsing the output of all TCP connections might be really slow for high number of connections. This patch address the problem using the filtering provided by `ss` to reduce the number of line returned by the command. Fixes saltstack#53580
Parsing the output of all TCP connections might be really slow for high number of connections. This patch address the problem using the filtering provided by `ss` to reduce the number of line returned by the command. Fixes saltstack#53580
Parsing the output of all TCP connections might be really slow for high number of connections. This patch address the problem using the filtering provided by `ss` to reduce the number of line returned by the command. Fixes saltstack#53580
Parsing the output of all TCP connections might be really slow for high number of connections. This patch address the problem using the filtering provided by `ss` to reduce the number of line returned by the command. Fixes saltstack#53580
Description of Issue
When the number of TCP connections get high,
salt.utils.network._netlink_tool_remote_on
becomes really slow and CPU intensive.The functions is used by
status.master
which is called as a scheduled function to test Salt Master availability.Setup
We noticed the issue on servers with ~100k TCP connections (using
ss -ant | wc -l
Versions Report
The text was updated successfully, but these errors were encountered: