Skip to content

Commit ab444c9

Browse files
committed
Add DWPD alerts
1 parent 30467e1 commit ab444c9

File tree

1 file changed

+17
-1
lines changed

1 file changed

+17
-1
lines changed

etc/kayobe/kolla/config/prometheus/smart.rules

Lines changed: 17 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,4 +13,20 @@ groups:
1313
summary: "SMART monitor reports bad disk on (instance {{ $labels.instance }})"
1414
description: "{{ $labels.instance }} is reporting unhealthy for the disk at {{ $labels.disk }}. Disk serial number is: {{ $labels.serial_number }}"
1515

16-
{% endraw %}
16+
- alert: DWPDTooHigh
17+
expr: (delta(nvme_data_units_written_total[30d])*512000 / nvme_physical_size_bytes) / 30 > 1
18+
labels:
19+
severity: alert
20+
annotations:
21+
summary: "High 30-Day Average DWPD for {{ $labels.instance }}"
22+
description: "The 30-Day average for Disk Writes Per Day for disk {{ $labels.device }} on {{ $labels.instance }} exceeds 1 DWPD"
23+
24+
- alert: DWPDTooHighWarning
25+
expr: (delta(nvme_data_units_written_total[7d])*512000 / nvme_physical_size_bytes) / 7 > 1
26+
labels:
27+
severity: warning
28+
annotations:
29+
summary: "High 7-Day Average DWPD for {{ $labels.instance }}"
30+
description: "The 7-day average for Disk Writes Per Day for disk {{ $labels.device }} on {{ $labels.instance }} exceeds 1 DWPD"
31+
32+
{% endraw %}

0 commit comments

Comments
 (0)