Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion etc/kayobe/kolla/config/prometheus/rabbitmq.rules
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ groups:
- name: rabbitmq.rules
rules:
- alert: RabbitMQNodeDown
expr: sum(rabbitmq_build_info{instance!=""}) < 3
expr: sum(rabbitmq_build_info{instance!=""}) < {% endraw %}{{ alertmanager_number_of_rabbitmq_nodes }}{% raw %}
for: 30m
labels:
severity: critical
Expand Down
3 changes: 3 additions & 0 deletions etc/kayobe/stackhpc-monitoring.yml
Original file line number Diff line number Diff line change
Expand Up @@ -26,6 +26,9 @@ alertmanager_packet_drop_threshold: 1
# packets/s averaged over 5 minutes.
alertmanager_packet_errors_threshold: 1

# Number of RabbitMQ nodes in the cluster.
alertmanager_number_of_rabbitmq_nodes: "{{ groups['controllers'] | length }}"

###############################################################################
# Exporter configuration

Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
---
features:
- |
Allow for easy customisation of the number of expected `RabbitMQ`
nodes when evaluating the alert `RabbitMQNodeDown`. It is set by
the `alertmanager_number_of_rabbitmq_nodes` which defaults to the
number of `controllers`. This is benefical for deployments that
do not use a standard three node setup.
Loading