-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Managed alertmanager no longer running in clusters #990
Comments
The fact that you're observing this only in GKE Autopilot clusters is likely related to the timing of our release rollouts. We introduced a change in #691 that scales alertmanager to zero when rules are not configured using our Can you provide a bit more information about how you are using Alertmanager? Do you have any of those |
We manage alert rules through Grafana, and so we don't have any of the |
It may also be worth checking if the StatefulSet that manages the alertmanager pods still exists: |
The
I just applied the
I'll try to craft a rule that doesn't actually alert us just to hang around and make sure the alertmanager doesn't scale to 0, unless you have another option I can try |
Something like our example rule should be a good starting point. That will be the best workaround for now. I'll discuss with the team whether it make sense for us to implement another solution for future releases. |
Not sure if this is the right place to report such a bug, but we've been using managed alertmanager for a year now, and it seems that in the last few days, the pod has disappeared from a few of our clusters, all GKE Autopilot clusters (if that matters).
Config Secret is still there and unchanged
alertmanager pod is gone
On another non-autopilot cluster:
I also see that the namespace for gmp pods is not the same on an autopilot vs regular cluster, not sure if that has anything to do with it.
The text was updated successfully, but these errors were encountered: