Skip to content

Commit 8300186

Browse files
authored
Merge pull request ceph#54664 from zdover23/wip-doc-2023-11-27-rados-troubleshooting-troubleshooting-mon-everything-failed
doc/rados: edit "Everything Failed! Now What?" Reviewed-by: Antony D'Atri <[email protected]>
2 parents e577734 + d8cd492 commit 8300186

File tree

1 file changed

+35
-28
lines changed

1 file changed

+35
-28
lines changed

doc/rados/troubleshooting/troubleshooting-mon.rst

Lines changed: 35 additions & 28 deletions
Original file line numberDiff line numberDiff line change
@@ -616,7 +616,6 @@ The above recovery tool is unable to recover the following information:
616616
617617
- **MDS Maps**: The MDS maps are lost.
618618
619-
620619
Everything Failed! Now What?
621620
============================
622621
@@ -628,16 +627,20 @@ irc.oftc.net), or at ``[email protected]`` and ``[email protected]``. Make
628627
sure that you have prepared your logs and that you have them ready upon
629628
request.
630629
631-
See https://ceph.io/en/community/connect/ for current (as of October 2023)
632-
information on getting in contact with the upstream Ceph community.
630+
The upstream Ceph Slack workspace can be joined at this address:
631+
https://ceph-storage.slack.com/
633632
633+
See https://ceph.io/en/community/connect/ for current (as of December 2023)
634+
information on getting in contact with the upstream Ceph community.
634635
635636
Preparing your logs
636637
-------------------
637638
638-
The default location for monitor logs is ``/var/log/ceph/ceph-mon.FOO.log*``.
639-
However, if they are not there, you can find their current location by running
640-
the following command:
639+
The default location for Monitor logs is ``/var/log/ceph/ceph-mon.FOO.log*``.
640+
It is possible that the location of the Monitor logs has been changed from the
641+
default. If the location of the Monitor logs has been changed from the default
642+
location, find the location of the Monitor logs by running the following
643+
command:
641644
642645
.. prompt:: bash
643646
@@ -648,32 +651,34 @@ cluster's configuration files. If Ceph is using the default debug levels, then
648651
your logs might be missing important information that would help the upstream
649652
Ceph community address your issue.
650653
651-
To make sure your monitor logs contain relevant information, you can raise
652-
debug levels. Here we are interested in information from the monitors. As with
653-
other components, the monitors have different parts that output their debug
654+
Raise debug levels to make sure that your Monitor logs contain relevant
655+
information. Here we are interested in information from the Monitors. As with
656+
other components, the Monitors have different parts that output their debug
654657
information on different subsystems.
655658
656659
If you are an experienced Ceph troubleshooter, we recommend raising the debug
657-
levels of the most relevant subsystems. Of course, this approach might not be
658-
easy for beginners. In most cases, however, enough information to address the
659-
issue will be secured if the following debug levels are entered::
660+
levels of the most relevant subsystems. This approach might not be easy for
661+
beginners. In most cases, however, enough information to address the issue will
662+
be logged if the following debug levels are entered::
660663
661664
debug_mon = 10
662665
debug_ms = 1
663666
664667
Sometimes these debug levels do not yield enough information. In such cases,
665-
members of the upstream Ceph community might ask you to make additional changes
668+
members of the upstream Ceph community will ask you to make additional changes
666669
to these or to other debug levels. In any case, it is better for us to receive
667670
at least some useful information than to receive an empty log.
668671
669672
670673
Do I need to restart a monitor to adjust debug levels?
671674
------------------------------------------------------
672675
673-
No, restarting a monitor is not necessary. Debug levels may be adjusted by
674-
using two different methods, depending on whether or not there is a quorum:
676+
No. It is not necessary to restart a Monitor when adjusting its debug levels.
677+
678+
There are two different methods for adjusting debug levels. One method is used
679+
when there is quorum. The other is used when there is no quorum.
675680
676-
**If there is a quorum**
681+
**Adjusting debug levels when there is a quorum**
677682
678683
Either inject the debug option into the specific monitor that needs to
679684
be debugged::
@@ -685,17 +690,19 @@ using two different methods, depending on whether or not there is a quorum:
685690
ceph tell mon.* config set debug_mon 10/10
686691
687692
688-
**If there is no quorum**
693+
**Adjusting debug levels when there is no quorum**
689694
690695
Use the admin socket of the specific monitor that needs to be debugged
691696
and directly adjust the monitor's configuration options::
692697
693698
ceph daemon mon.FOO config set debug_mon 10/10
694699
700+
**Returning debug levels to their default values**
695701
696702
To return the debug levels to their default values, run the above commands
697-
using the debug level ``1/10`` rather than ``10/10``. To check a monitor's
698-
current values, use the admin socket and run either of the following commands:
703+
using the debug level ``1/10`` rather than the debug level ``10/10``. To check
704+
a Monitor's current values, use the admin socket and run either of the
705+
following commands:
699706
700707
.. prompt:: bash
701708
@@ -712,17 +719,17 @@ or:
712719
I Reproduced the problem with appropriate debug levels. Now what?
713720
-----------------------------------------------------------------
714721
715-
We prefer that you send us only the portions of your logs that are relevant to
716-
your monitor problems. Of course, it might not be easy for you to determine
717-
which portions are relevant so we are willing to accept complete and
718-
unabridged logs. However, we request that you avoid sending logs containing
719-
hundreds of thousands of lines with no additional clarifying information. One
720-
common-sense way of making our task easier is to write down the current time
721-
and date when you are reproducing the problem and then extract portions of your
722+
Send the upstream Ceph community only the portions of your logs that are
723+
relevant to your Monitor problems. Because it might not be easy for you to
724+
determine which portions are relevant, the upstream Ceph community accepts
725+
complete and unabridged logs. But don't send logs containing hundreds of
726+
thousands of lines with no additional clarifying information. One common-sense
727+
way to help the Ceph community help you is to write down the current time and
728+
date when you are reproducing the problem and then extract portions of your
722729
logs based on that information.
723730
724-
Finally, reach out to us on the mailing lists or IRC or Slack, or by filing a
725-
new issue on the `tracker`_.
731+
Contact the upstream Ceph community on the mailing lists or IRC or Slack, or by
732+
filing a new issue on the `tracker`_.
726733
727734
.. _tracker: http://tracker.ceph.com/projects/ceph/issues/new
728735

0 commit comments

Comments
 (0)