Skip to content

Commit 48a0613

Browse files
authored
Merge pull request ceph#64869 from zdover23/wip-doc-2025-08-07-cephfs-troubleshooting
doc/cephfs: edit troubleshooting.rst Reviewed-by: Anthony D'Atri <[email protected]>
2 parents ebf66bf + 7c8e381 commit 48a0613

File tree

1 file changed

+14
-12
lines changed

1 file changed

+14
-12
lines changed

doc/cephfs/troubleshooting.rst

Lines changed: 14 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -27,15 +27,16 @@ Stuck during recovery
2727
Stuck in up:replay
2828
------------------
2929

30-
If your MDS is stuck in the ``up:replay`` state, then it is likely that the
31-
journal is very long. Did you see ``MDS_HEALTH_TRIM`` cluster warnings saying
32-
the MDS is behind on trimming its journal? Very large journals can take hours
33-
to read. There is no working around this but there are things you can do to
34-
speed things along:
30+
If your MDS is stuck in the ``up:replay`` state, then the journal is probably
31+
very long. The presence of ``MDS_HEALTH_TRIM`` cluster warnings can indicate
32+
that the MDS has not yet caught up while trimming its journal. Very large
33+
journals can take hours to process. There is no working around this, but there
34+
are things you can do to speed up the process:
3535

36-
Reduce MDS debugging to 0. Even with the default settings, the MDS logs a few
37-
messages to memory for dumping in case a fatal error is encountered. You can
38-
turn off all logging by running the following commands:
36+
Temporarily disable MDS debug logs by reducing MDS debugging to ``0``. Even
37+
with the default settings, the MDS logs a few messages to memory for dumping in
38+
case a fatal error is encountered. You can turn off all logging by running the
39+
following commands:
3940

4041
.. prompt:: bash #
4142

@@ -44,17 +45,18 @@ turn off all logging by running the following commands:
4445
ceph config set mds debug_monc 0
4546

4647
Remember that when you set ``debug_mds``, ``debug_ms``, and ``debug_monc`` to
47-
``0``, Note if the MDS fails then there will be no information to determine why
48-
fatal errors occurred. If you can calculate when ``up:replay`` will complete,
49-
you should restore these configs just prior to entering the next state:
48+
``0``, if the MDS fails then there will be no debugging information that can be
49+
used to determine why fatal errors occurred. If you can calculate when
50+
``up:replay`` will complete, restore these configurations just prior to
51+
entering the next state:
5052

5153
.. code:: bash
5254
5355
ceph config rm mds debug_mds
5456
ceph config rm mds debug_ms
5557
ceph config rm mds debug_monc
5658
57-
After replay has been speeded up, calculate when the MDS will complete the
59+
After replay has been expedited, calculate when the MDS will complete the
5860
replay. Examine the journal replay status:
5961

6062
.. code:: bash

0 commit comments

Comments
 (0)