@@ -213,31 +213,38 @@ you should be seeing something similar to::
213213 troubleshooting the monitor, so check you ``ceph status `` again just to make
214214 sure. Proceed if the monitor is not yet in the quorum.
215215
216- **What if the state is ``probing``? **
217-
218- This means the monitor is still looking for the other monitors. Every time
219- you start a monitor, the monitor will stay in this state for some time while
220- trying to connect the rest of the monitors specified in the ``monmap ``. The
221- time a monitor will spend in this state can vary. For instance, when on a
222- single-monitor cluster (never do this in production), the monitor will pass
223- through the probing state almost instantaneously. In a multi-monitor
224- cluster, the monitors will stay in this state until they find enough monitors
225- to form a quorum |--- | this means that if you have 2 out of 3 monitors down, the
226- one remaining monitor will stay in this state indefinitely until you bring
227- one of the other monitors up.
228-
229- If you have a quorum the starting daemon should be able to find the
230- other monitors quickly, as long as they can be reached. If your
231- monitor is stuck probing and you have gone through with all the communication
232- troubleshooting, then there is a fair chance that the monitor is trying
233- to reach the other monitors on a wrong address. ``mon_status `` outputs the
234- ``monmap `` known to the monitor: check if the other monitor's locations
235- match reality. If they don't, jump to
236- `Recovering a Monitor's Broken monmap `_; if they do, then it may be related
237- to severe clock skews amongst the monitor nodes and you should refer to
238- `Clock Skews `_ first, but if that doesn't solve your problem then it is
239- the time to prepare some logs and reach out to the community (please refer
240- to `Preparing your logs `_ on how to best prepare your logs).
216+ **What does it mean if a Monitor's state is ``probing``? **
217+
218+ If ``ceph health detail `` shows that a Monitor's state is
219+ ``probing ``, then the Monitor is still looking for the other Monitors. Every
220+ Monitor remains in this state for some time when it is started. When a
221+ Monitor has connected to the other Monitors specified in the ``monmap ``, it
222+ ceases to be in the ``probing `` state. The amount of time that a Monitor is
223+ in the ``probing `` state depends upon the parameters of the cluster of which
224+ it is a part. For example, when a Monitor is a part of a single-monitor
225+ cluster (never do this in production), the monitor passes through the probing
226+ state almost instantaneously. In a multi-monitor cluster, the Monitors stay
227+ in the ``probing `` state until they find enough monitors to form a quorum
228+ |--- | this means that if two out of three Monitors in the cluster are
229+ ``down ``, the one remaining Monitor stays in the ``probing `` state
230+ indefinitely until you bring one of the other monitors up.
231+
232+ If quorum has been established, then the Monitor daemon should be able to
233+ find the other Monitors quickly, as long as they can be reached. If a Monitor
234+ is stuck in the ``probing `` state and you have exhausted the procedures above
235+ that describe the troubleshooting of communications between the Monitors,
236+ then it is possible that the problem Monitor is trying to reach the other
237+ Monitors at a wrong address. ``mon_status `` outputs the ``monmap `` that is
238+ known to the monitor: determine whether the other Monitors' locations as
239+ specified in the ``monmap `` match the locations of the Monitors in the
240+ network. If they do not, see `Recovering a Monitor's Broken monmap `_.
241+ If the locations of the Monitors as specified in the ``monmap `` match the
242+ locations of the Monitors in the network, then the persistent
243+ ``probing `` state could be related to severe clock skews amongst the monitor
244+ nodes. See `Clock Skews `_. If the information in `Clock Skews `_ does not
245+ bring the Monitor out of the ``probing `` state, then prepare your system logs
246+ and ask the Ceph community for help. See `Preparing your logs `_ for
247+ information about the proper preparation of logs.
241248
242249
243250**What if state is ``electing``? **
0 commit comments