Skip to content

Commit 4a7866a

Browse files
committed
monitoring: fix NVMeoFSubsystemNamespaceLimit
Alert is not triggered as expected, change the query to fix that. BZ: https://bugzilla.redhat.com/show_bug.cgi?id=2282348 Signed-off-by: Vallari Agrawal <[email protected]>
1 parent 9fa163d commit 4a7866a

File tree

2 files changed

+2
-2
lines changed

2 files changed

+2
-2
lines changed

monitoring/ceph-mixin/prometheus_alerts.libsonnet

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -848,7 +848,7 @@
848848
{
849849
alert: 'NVMeoFSubsystemNamespaceLimit',
850850
'for': '1m',
851-
expr: '(count by(nqn, cluster) (ceph_nvmeof_subsystem_namespace_metadata)) >= ceph_nvmeof_subsystem_namespace_limit',
851+
expr: '(count by(nqn, cluster, instance) (ceph_nvmeof_subsystem_namespace_metadata)) >= on(nqn, instance) group_right(cluster) ceph_nvmeof_subsystem_namespace_limit',
852852
labels: { severity: 'warning', type: 'ceph_default' },
853853
annotations: {
854854
summary: '{{ $labels.nqn }} subsystem has reached its maximum number of namespaces%(cluster)s' % $.MultiClusterSummary(),

monitoring/ceph-mixin/prometheus_alerts.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -760,7 +760,7 @@ groups:
760760
annotations:
761761
description: "Subsystems have a max namespace limit defined at creation time. This alert means that no more namespaces can be added to {{ $labels.nqn }}"
762762
summary: "{{ $labels.nqn }} subsystem has reached its maximum number of namespaces on cluster {{ $labels.cluster }}"
763-
expr: "(count by(nqn, cluster) (ceph_nvmeof_subsystem_namespace_metadata)) >= ceph_nvmeof_subsystem_namespace_limit"
763+
expr: "(count by(nqn, cluster, instance) (ceph_nvmeof_subsystem_namespace_metadata)) >= on(nqn, instance) group_right(cluster) ceph_nvmeof_subsystem_namespace_limit"
764764
for: "1m"
765765
labels:
766766
severity: "warning"

0 commit comments

Comments
 (0)