Skip to content

Commit ab4a1dd

Browse files
committed
monitoring: add alert NVMeoFMaxGatewayGroups
Add alert NVMeoFMaxGatewayGroups to prometheus_alerts.yml and prometheus_alerts.libsonnet. This alerts is to indicate if max number of NVMeoF gateway groups have been reached in a cluster. Signed-off-by: Vallari Agrawal <[email protected]>
1 parent c5c4b10 commit ab4a1dd

File tree

2 files changed

+19
-0
lines changed

2 files changed

+19
-0
lines changed

monitoring/ceph-mixin/prometheus_alerts.libsonnet

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -885,6 +885,16 @@
885885
description: 'You may create many gateways in a gateway group, but %(NVMeoFMaxGatewaysPerGroup)d is the tested limit' % $._config,
886886
},
887887
},
888+
{
889+
alert: 'NVMeoFMaxGatewayGroups',
890+
'for': '1m',
891+
expr: 'count(count by (group, cluster) (ceph_nvmeof_gateway_info)) by (cluster) > %.2f' % [$._config.NVMeoFMaxGatewayGroups],
892+
labels: { severity: 'warning', type: 'ceph_default' },
893+
annotations: {
894+
summary: 'Max gateway groups exceeded%(cluster)s' % $.MultiClusterSummary(),
895+
description: 'You may create many gateway groups, but %(NVMeoFMaxGatewayGroups)d is the tested limit' % $._config,
896+
},
897+
},
888898
{
889899
alert: 'NVMeoFSingleGatewayGroup',
890900
'for': '5m',

monitoring/ceph-mixin/prometheus_alerts.yml

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -792,6 +792,15 @@ groups:
792792
labels:
793793
severity: "warning"
794794
type: "ceph_default"
795+
- alert: "NVMeoFMaxGatewayGroups"
796+
annotations:
797+
description: "You may create many gateway groups, but 4 is the tested limit"
798+
summary: "Max gateway groups exceeded on cluster {{ $labels.cluster }}"
799+
expr: "count(count by (group, cluster) (ceph_nvmeof_gateway_info)) by (cluster) > 4.00"
800+
for: "1m"
801+
labels:
802+
severity: "warning"
803+
type: "ceph_default"
795804
- alert: "NVMeoFSingleGatewayGroup"
796805
annotations:
797806
description: "Although a single member gateway group is valid, it should only be used for test purposes"

0 commit comments

Comments
 (0)