Prevent alerts from inhibiting themselves #1017

alin-amana · 2017-10-02T12:38:58Z

This resolves #535.

Simply checks for the fingerprint of the potentially inhibiting alert not being equal to the inhibited alert.

brian-brazil · 2017-10-02T12:43:26Z

We've already discussed this elsewhere and decided that this is not worth making an exception for. The desired effect can be achieved by an appropriate label layout.

alin-amana · 2017-10-02T12:49:06Z

Not trying to be obnoxious, but I don't understand why #666 is still open in this case. I.e. if this is the intended behavior and it is documented as such.

brian-brazil · 2017-10-02T12:50:53Z

It's still open pending documentation to clarify that this is the behaviour.

alin-amana · 2017-10-02T12:58:49Z

Well, AFAICT https://prometheus.io/docs/alerting/configuration/#<inhibit_rule> already has this note.

alin-amana · 2017-10-04T08:11:01Z

Since there seem to be no takers, I'm closing this. :o)

beorn7 · 2017-10-04T10:22:13Z

We've already discussed this elsewhere and decided that this is not worth making an exception for. The desired effect can be achieved by an appropriate label layout.

IIRC the outcome was that the desired effect cannot be achieved by an appropriate label layout.

The stated reason to not prevent alerts from inhibiting itself was that the implementation would be too tricky. From the user's perspective, it does never make sense for an alert to inhibit itself.

I would at least like to have one of the Alertmanager people check if this attempt pulls it off. @stuartnelson3 @brancz @fabxc

brian-brazil · 2017-10-04T11:14:55Z

IIRC the outcome was that the desired effect cannot be achieved by an appropriate label layout.

I don't see that outcome, merely that one specific way of doing it doesn't work. Have you tried something along the lines of severity=page and severity=page-nodedown?

beorn7 · 2017-10-04T11:16:28Z

Our intention is to not bloat our severity levels.

brian-brazil · 2017-10-04T11:28:34Z

The proposal to change the inhibition bloats the alertmanger config with knowledge of alertnames, and I expect will be more fragile overall to keep in sync. The expected way to use inhibition is via severity labels and equivalents, I'm generally against changing things merely because a user doesn't like the recommended way of doing things.

alin-amana · 2017-10-04T11:39:44Z

For context, I realized after I submitted this PR that it is not a complete fix.

E.g. if you have 2 ALERT metrics that match source_match/source_match_re they will not inhibit themselves, but they will inhibit each other. It would be slightly more involved (but not impossible) to prevent this from occurring.

beorn7 · 2017-10-04T13:07:49Z

E.g. if you have 2 ALERT metrics that match source_match/source_match_re they will not inhibit themselves, but they will inhibit each other.

Isn't that the intended behavior? Could you give a full example (with the full inhibition rule) so that I can understand what's the problem with that?

brian-brazil · 2017-10-04T13:20:17Z

Off the top of my head, say you had NodeDownReasonFoo and NodeDownReasonBar which you wanted either to inhibit all other alerts from a machine. Even if self-inhibition was stopped, if both were firing for a node then both would get inhibited and you'd get no notifications.

My gut feeling is that this isn't solvable in general, though I haven't thought through it fully.

beorn7 · 2017-10-04T14:18:41Z

How about this take: Any alert that is matched by the source matchers of a given inhibition rules is automatically prevented from matching the target side of the same inhibition rule.

brian-brazil · 2017-10-04T14:25:12Z

At a first pass that'd seem to do the right thing, I'm still not convinced the setups it enables are a good idea though.

fabxc · 2017-10-04T14:44:47Z

An alert inhibiting so far has been unexpected and unintuitive behavior 100% of the time so far. I see little reason to keep it around just for the sake of it. If there's an intuitive and correct solution, I'd be in favor of it.

alin-amana · 2017-11-02T09:28:38Z

I've taken @beorn7's suggestion and came up with a one-line change that prevents alerts that match both the source and target filters from inhibiting themselves.

You can still set up multiple inhibition rules (such as A inhibits B, B inhibits A) that will essentially end up inhibiting all alerts, but you can't do it with a single '<inhibit_rule>' anymore. Progress. :o)

stuartnelson3 · 2017-11-02T10:28:27Z

Seems like a sound solution to me, but I've not been involved in the discussion so far, so I leave it to the others to decide. The circular inhibition issue already exists, so 🤷‍♀️ from me.

beorn7 · 2017-11-02T10:55:26Z

I'll have a look ASAP…

alin-amana · 2017-11-03T14:01:21Z

ASAP

I do not think it means what you think it means. Sorry, I just had to take that. Low hanging fruit and all... :o)

Any chance of looking at it though? I've added a comment to get CircleCI to retry so it's now a 2 line change, but only one is code.

beorn7 · 2017-11-06T14:29:49Z

ASAP

I do not think it means what you think it means. Sorry, I just had to take that. Low hanging fruit and all... :o)

I know exactly what it means, sadly…

beorn7 · 2017-11-06T14:30:34Z

I think this semantics makes sense. 👍 from my side.

beorn7 · 2017-11-07T10:26:12Z

@stuartnelson3 I'm not sure if your last comment here including that emoji (purple angel?) means approval or not. If you agree with this change, please simply merge.

stuartnelson3 · 2017-11-07T10:30:03Z

Sorry for the confusion, it was supposed to be the shrugging emoji. I'll go ahead and merge, thanks for reviewing this.

alin-amana · 2017-11-07T13:04:18Z

Whee! Thanks!

Signed-off-by: Dan Fredell <Dan.Fredell@gmail.com>

kelein

.

Prevent alerts from inhibiting themselves.

8666d3e

alin-amana closed this Oct 4, 2017

alin-amana deleted the dont_inhibit_self branch October 4, 2017 08:11

beorn7 mentioned this pull request Oct 4, 2017

Alerts can inhibit themselves #666

Closed

alin-amana added 3 commits November 1, 2017 15:30

Merge remote-tracking branch 'upstream/master' into dont_inhibit_self

110e6a8

Merge remote-tracking branch 'upstream/master' into dont_inhibit_self

4e9cab9

Don't inhibit alerts that match the source filter.

be864e9

alin-amana reopened this Nov 2, 2017

No-op, nudge CircleCI.

e103d01

stuartnelson3 approved these changes Nov 7, 2017

View reviewed changes

stuartnelson3 merged commit dc3c78e into prometheus:master Nov 7, 2017

hh pushed a commit to ii/alertmanager that referenced this pull request Sep 5, 2018

Fix SmartOS build prometheus#1017 (prometheus#1018)

c52e0d3

Signed-off-by: Dan Fredell <Dan.Fredell@gmail.com>

tbregolin mentioned this pull request Jan 28, 2019

Clarify inhibition heuristics with equal labels prometheus/docs#1269

Merged

kelein reviewed Jul 5, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent alerts from inhibiting themselves #1017

Prevent alerts from inhibiting themselves #1017

alin-amana commented Oct 2, 2017

brian-brazil commented Oct 2, 2017

alin-amana commented Oct 2, 2017

brian-brazil commented Oct 2, 2017

alin-amana commented Oct 2, 2017

alin-amana commented Oct 4, 2017

beorn7 commented Oct 4, 2017

brian-brazil commented Oct 4, 2017 •

edited

Loading

beorn7 commented Oct 4, 2017

brian-brazil commented Oct 4, 2017

alin-amana commented Oct 4, 2017

beorn7 commented Oct 4, 2017

brian-brazil commented Oct 4, 2017

beorn7 commented Oct 4, 2017

brian-brazil commented Oct 4, 2017

fabxc commented Oct 4, 2017

alin-amana commented Nov 2, 2017 •

edited

Loading

stuartnelson3 commented Nov 2, 2017

beorn7 commented Nov 2, 2017

alin-amana commented Nov 3, 2017

beorn7 commented Nov 6, 2017

beorn7 commented Nov 6, 2017

beorn7 commented Nov 7, 2017

stuartnelson3 commented Nov 7, 2017

alin-amana commented Nov 7, 2017

kelein left a comment

Prevent alerts from inhibiting themselves #1017

Prevent alerts from inhibiting themselves #1017

Conversation

alin-amana commented Oct 2, 2017

brian-brazil commented Oct 2, 2017

alin-amana commented Oct 2, 2017

brian-brazil commented Oct 2, 2017

alin-amana commented Oct 2, 2017

alin-amana commented Oct 4, 2017

beorn7 commented Oct 4, 2017

brian-brazil commented Oct 4, 2017 • edited Loading

beorn7 commented Oct 4, 2017

brian-brazil commented Oct 4, 2017

alin-amana commented Oct 4, 2017

beorn7 commented Oct 4, 2017

brian-brazil commented Oct 4, 2017

beorn7 commented Oct 4, 2017

brian-brazil commented Oct 4, 2017

fabxc commented Oct 4, 2017

alin-amana commented Nov 2, 2017 • edited Loading

stuartnelson3 commented Nov 2, 2017

beorn7 commented Nov 2, 2017

alin-amana commented Nov 3, 2017

beorn7 commented Nov 6, 2017

beorn7 commented Nov 6, 2017

beorn7 commented Nov 7, 2017

stuartnelson3 commented Nov 7, 2017

alin-amana commented Nov 7, 2017

kelein left a comment

Choose a reason for hiding this comment

brian-brazil commented Oct 4, 2017 •

edited

Loading

alin-amana commented Nov 2, 2017 •

edited

Loading