-
Notifications
You must be signed in to change notification settings - Fork 4.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[metricbeat] Stack monitoring modules may ignore xpack
configuration on error
#30809
Comments
Pretty sure I've seen |
In 8.x the codepath that logs errors when This makes me think that status quo is acceptable for 8.x since all errors are available and queryable, and it is an improvement over the 7.x inconsistent behavior that logs the error in some cases. The questions left are whether a dedicated index like |
If we change the location at all, it'd be interesting to consider something like |
Hi! We're labeling this issue as |
Summary
Stack monitoring related modules have a specific configuration (
xpack.enabled: true
) that allows them to write events to.monitoring-{module}-*
indices instead of the usualmetricbeat-*
.In these modules, failing to generate metricsets in some code paths will send an event to the
metricbeat-*
indice regardless of theirxpack.enabled
configuration. This is aligned with the way metricbeat reports error in all its modules but stack monitoring modules are different beasts since they allow an override of the destination indice for the regular events. Considering that, it can be counter-intuitive ux to ship regular and errors events to different indices and could hinder discoverability of the errors from a user perspective.7.x versions also have an inconsistent behavior in error handling where in some cases the errors will only be logged and not returned back to the metricbeat error reporting (example).
Questions:
Next steps
Ideally we would build a mechanism that ingests all errors generated when
xpack.enabled: true
. The first step would be to stop reporting them tometricbeat-*
and route them to the logger. As a next step we could use this mechanism to standardize and store these errors in a dedicated place (ie a new.monitoring-errors-mb
datastream ?). The idea is to enable easy consumptions of these values:The text was updated successfully, but these errors were encountered: