Skip to content

Commit c9be4e4

Browse files
committed
Redfish exporter: Decrease sensitivity of alert
The redfish exporter is prone to failed scrapes. Lets wait for mulitple failed scrapes before triggering an alert. This should still catch the case where it is completely dead, but reduce the false positives from failed scrapes.
1 parent 978d7b1 commit c9be4e4

File tree

2 files changed

+16
-1
lines changed

2 files changed

+16
-1
lines changed

etc/kayobe/kolla/config/prometheus/prometheus.rules

Lines changed: 10 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,14 +7,23 @@ groups:
77
rules:
88

99
- alert: PrometheusTargetMissing
10-
expr: up == 0
10+
expr: up{job!="redfish-exporter-seed"} == 0
1111
for: 5m
1212
labels:
1313
severity: critical
1414
annotations:
1515
summary: "Prometheus target missing (instance {{ $labels.instance }})"
1616
description: "A Prometheus target has disappeared. An exporter might have crashed."
1717

18+
- alert: PrometheusTargetMissing
19+
expr: up{job="redfish-exporter-seed"} == 0
20+
for: 15m
21+
labels:
22+
severity: critical
23+
annotations:
24+
summary: "Prometheus target missing (instance {{ $labels.instance }})"
25+
description: "A Prometheus target has disappeared. An exporter might have crashed."
26+
1827
- alert: PrometheusAllTargetsMissing
1928
expr: count by (job) (up) == 0
2029
for: 1m
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
---
2+
fixes:
3+
- |
4+
Changes the duration for which redfish exporter must continually fail
5+
scrapes before triggering an alert to 15 minutes. This should hopefully
6+
reduce some alert spam.

0 commit comments

Comments
 (0)