Skip to content

Commit a5248f5

Browse files
committed
doc/rados/operations/health-checks: Add MON_NETSPLIT Warning
Fixes: https://tracker.ceph.com/issues/67371 Signed-off-by: Kamoltat Sirivadhna <[email protected]>
1 parent 96f15ce commit a5248f5

File tree

1 file changed

+18
-0
lines changed

1 file changed

+18
-0
lines changed

doc/rados/operations/health-checks.rst

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -153,6 +153,24 @@ To adjust the warning threshold, run the following command:
153153

154154
ceph config set global mon_data_size_warn <size>
155155

156+
MON_NETSPLIT
157+
____________
158+
159+
A network partition has occurred among Ceph Monitors. This health check is
160+
raised when one or more monitors detect that at least two Ceph Monitors have
161+
lost connectivity or reachability, based on their individual connection scores,
162+
which are frequently updated. This warning only appears when
163+
the cluster is provisioned with at least three Ceph Monitors and are using the
164+
``connectivity`` election strategy.
165+
166+
Network partitions are reported in two ways:
167+
- As location-level netsplits (e.g., "Netsplit detected between dc1 and dc2") when
168+
all monitors in one location cannot communicate with all monitors in another location
169+
- As individual monitor netsplits (e.g., "Netsplit detected between mon.a and mon.d")
170+
when only specific monitors are disconnected across locations
171+
172+
The system prioritizes reporting at the highest topology level (``datacenter``, ``rack``, etc.)
173+
when possible, to better help operators identify infrastructure-level network issues.
156174

157175
AUTH_INSECURE_GLOBAL_ID_RECLAIM
158176
_______________________________

0 commit comments

Comments
 (0)