subject:"\[prometheus\-users\] Alert rules"

Re: [prometheus-users] alert rules: multiple rules for general & special cases

2023-01-18 Thread Brian Candler

If you refactor the rules a bit, you may find them easier to maintain: alert1: expr: probe_success{somelabel="XYZ"} == 0 labels: someswitch: foo alert2: expr: probe_success{somelabel="ABC"} == 0 labels: someswitch: bar alert3: expr: | probe_success == 0 unless probe_suc

Re: [prometheus-users] alert rules: multiple rules for general & special cases

2023-01-18 Thread Julius Volz

If your special cases have a *longer* "for" duration than the general ones, then I guess they won't be useful for inhibiting the general ones, since the special cases will start firing too late relative to the general ones to inhibit them. I guess you could introduce a copy of each special case ale

[prometheus-users] alert rules: multiple rules for general & special cases

2023-01-18 Thread Mario Cornaccini

hi, for the same metric, i want to have multiple rules in alert manager, to have longer FOR: times for some special cases. the way i do this now is: alert1 # general probe_succes{ somelabel !~"specialcase1|specialcase2"} alert2 # special probe_success{somelabel =~"specialcase1|specialcase2"} ..

[prometheus-users] alert rules between regions - to avoid triggering absent metric

2021-10-03 Thread Dan S

Hi, Looking for some general advice about shared prom alert rules between regions. We currently push the same alert rules to all regions, and sometimes we run into situations where we have a specific job in region X but not Y. This is fine for basic cases, such as *up{job="jenkins"} == 0* whi

[prometheus-users] Alert rules

2021-02-01 Thread Антон Зубков

Hi, I'm new in Prometheus. Now I try configure alerts and have problem. Have 2 rules: alert.rules1.yml and alert.rules2.yml *alert.rules1.yml* *groups:* * - name: iDrac* *rules:* *# ICMP ##* *- alert: host_is_not_available_via_icmp* * expr: probe_success{job="idrac-

[prometheus-users] Alert Rules expression value not getting changed to expected value after evaluation_interval

2020-06-04 Thread Aarti Nagdev

I am trying to do scaling operation with Prometheus Alert Manager. I have configured a webhook receiver in Alert Manager to execute my ScaleUp and ScaleDown action.I have the following things set in configuration: evaluation_interval: 1m scrape_interval: 1s There is no grouping in Alert Manager.B

Re: [prometheus-users] alert rules: multiple rules for general & special cases

Re: [prometheus-users] alert rules: multiple rules for general & special cases

[prometheus-users] alert rules: multiple rules for general & special cases

[prometheus-users] alert rules between regions - to avoid triggering absent metric

[prometheus-users] Alert rules

[prometheus-users] Alert Rules expression value not getting changed to expected value after evaluation_interval

6 matches

Site Navigation

Mail list logo

Footer information