-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SNMP plugin hits open file limit errors #2104
Comments
I believe this issue has come to light because the user is using many (thousands) of
in 1.0, these fields were not getting translated, but that was changed in 1.1 with #1836 @phemmer do you have any ideas on how to solve this? A couple questions I have:
FWIW, I think that fixing #1665 would have prevented this from being hit in the first place, as the user wouldn't have needed to specify thousands of separate fields just to parallelize their agent collections. |
Actually it's because of the number of
The
I would vote no. It just complicates the code for a rather uncommon edge case. One that would become near impossible with the change I mention below.
#1665 might have prevented (the object-instance-per-agent thing) this yes as the config probably wouldn't have had so many The proper way to fix this I think is to share the |
I don't think this is happening in failure scenarios, because |
Ah, good call. Its init is called every gather because there is no explicit plugin initialization call. And since telegraf kicks them all off in parallel, the issue would recur over and over. The global snmptranslate data I think is enough of a fix for this. To where we wouldn't need to otherwise change the way the plugin initializes itself. |
@phemmer It sounds like you're on top of this, but let me know if you need a hand or eyes on a PR. We've got a few people who have asked about this, so we can probably get a 1.1.2 release lined up whenever this is fixed. Thanks for digging in! |
PR is up: #2115 |
Prevents the same data from being looked up multiple times. Also prevents multiple simultaneous lookups. closes influxdata#2115 closes influxdata#2104
Issue found in Telegraf v1.1.1
Config file loads approximately 390 inputs.snmp plugins - works well with Telegraf v1.0.1.
Telegraf tries to snmptranslate all OID's defined in
[[inputs.snmp.table.field]]
under[[inputs.snmp.table]]
.Multiple concurrent snmptranslate processes found to be running at the same time on the system (over 200 at a given point). The snmptranslate commands fail to translate the OID's, return code is 0 though.
The system eventually runs out of open file descriptors.
Sample output:
The text was updated successfully, but these errors were encountered: