-
Notifications
You must be signed in to change notification settings - Fork 8.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Stack Monitoring] [Test Scenario] Out of the box alerting #93072
Comments
Pinging @elastic/stack-monitoring (Team:Monitoring) |
What would this minimum set of permissions be? Only the |
Is it a known issue that saving a changed alert and clicking "edit" again immediately after the success toast shows the second-to-last state? |
This might be stupid question, but can the dedicated monitoring cluster monitor itself in addition to the remote clusters? |
Yes, it's being changed now -> #94167. We actually never updated the test for this change, but we are reverting it so it's accurate now and should be available to test soon. Correct. It is It is not. Worth filing a ticket for sure Absolutely not stupid! Yes that's fine as well. |
It seems a user with only But GET /_security/user/_has_privileges
{
"index": [
{
"names": [
".monitoring-*-6-*,.monitoring-*-7-*"
],
"privileges": [
"read"
]
}
]
} yields {
"username" : "monitoring-only",
"has_all_requested" : true,
"cluster" : { },
"index" : {
".monitoring-*-6-*,.monitoring-*-7-*" : {
"read" : true
}
},
"application" : { }
}
Is that expected? |
What is the interaction between stack monitoring alerting and CCS? When does stack monitoring access monitoring data via CCS? |
🪲 I filed "Alert editing fly-out shows outdated alert parameters after editing" (#94440) |
Yes this is actually! Setup mode requires additional permissions that are not available to the basic permission set. See #50421 Apologies for the confusion. There are three ways in which the Stack Monitoring UI can find monitoring data.
The intention of the bullet in this test scenario is to ensure that when looking at monitoring data through a CCS "way", the alerts work as expected. We've seen a few regressions around this "way" which is why I added it. |
Thanks for the clarification. So how do I configure it to access the monitoring data from a remote cluster via CCS? |
Now it's setup with CCS |
Ah, so it's automatically looking at all the remote clusters. Thanks! |
Whenever I visit the stack monitoring app on my cloud test cluster, a toast is displayed that informs me about newly created alerts. All 14 alerting rules were already present before, though, and no new alerting rules show up. Would that warrant a bug report or would you consider this a non-issue since the toast is just informative? |
🪲 I filed "Alert grouping toggle never renders as enabled" (#94556) |
Are you hard refreshing the page? If so, that's expected. It shouldn't show up again in the same session (in that you don't ever hard refresh). The logic to avoid showing it is only an in-memory variable. |
Yes, because without the refresh switching to a cluster takes a long time. Thanks for the clarification. |
This seems like a problem too. Is this in a cloud environment I can see? |
https://p.elstc.co/paste/aPPgTbob#UNKkP9yx2rmOX6kOZYd4nXDeiol-wVXH2orboTaykYI Sometimes switching to a cluster view takes 10 seconds or longer. And switching back to the list doesn't work reliably either. I think I already filed an issue about that during last release's test phase, though. |
@weltenwort That's fine. I think the slowness is a separate issue and we can follow up later. |
I somehow can't get the license expiration to trigger. I use an ingest processor that forces the
This looks like the |
I went through every item, but (as noted in the description) was unable to produce the necessary conditions for some. I'd appreciate it if someone with more in-depth knowledge could double-check those. I linked the issues I created underneath the corresponding item in the description. |
@weltenwort Here is the pipeline I use for license expiration: https://gist.github.com/chrisronline/9d4d3d740e535d3c01410cac2cc74653 Does this match what you have? |
Yes, that's what I used. The resulting document looks fine, but it doesn't trigger the alert somehow. |
@weltenwort Are you trying on cloud? I imagine you are hitting against this work: #84361 There is a config option to hide the license expiration logic from the SM UI and I think cloud sets it by default. I'll make sure to note this in the test in the future. Apologies for the confusion. |
Yes, I was testing mostly on cloud. The alert is visible and enabled in the UI, but never fires. |
I opened #95090 to address that issue |
@weltenwort Testing thread pool rejections is a bit cumbersome. The only viable way is to set your max queue size to 0 via: https://www.elastic.co/guide/en/elasticsearch/reference/current/modules-threadpool.html. Though I was only able to achieve this locally on a bare-bone/throw-away cluster. For some reason this did not work on cloud, I imagine it's because the rejections are collected from the container and not the machine itself. The other way is to create a pipeline that accumulates those counts (since it's a derivative) in incoming |
Summary
Stack Monitoring provides a set of out-of-the-box alerts, created by simply loading the Stack Monitoring UI within Kibana. The default action for each alert is a server log and the action messaging is controlled by the Stack Monitoring UI code directly.
PRs
Original, and CPU alert: #68805
Disk usage alert: #75419
JVM memory usage alert: #79039
Missing monitoring data alert: #78208
Threadpool rejections alert: #79433
Testing
Creation
Management
UX
Specific alerts
Information in reproducting legacy alerts -> #87377
Edge cases
Previous issue: #85841
The text was updated successfully, but these errors were encountered: