Skip to content

Commit c897107

Browse files
committed
doc/cephfs: edit troubleshooting.rst
Edit the section "Disconnected+Remounted FS" in the file doc/cephfs/troubleshooting.rst. Signed-off-by: Zac Dover <[email protected]>
1 parent 416c794 commit c897107

File tree

1 file changed

+15
-13
lines changed

1 file changed

+15
-13
lines changed

doc/cephfs/troubleshooting.rst

Lines changed: 15 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -320,16 +320,17 @@ will switch to doing writes synchronously, which is quite slow.
320320

321321
Disconnected+Remounted FS
322322
=========================
323-
Because CephFS has a "consistent cache", if your network connection is
324-
disrupted for a long enough time, the client will be forcibly
325-
disconnected from the system. At this point, the kernel client is in
326-
a bind: it cannot safely write back dirty data, and many applications
327-
do not handle IO errors correctly on close().
328-
At the moment, the kernel client will remount the FS, but outstanding file system
329-
IO may or may not be satisfied. In these cases, you may need to reboot your
323+
324+
Because CephFS has a "consistent cache", your client is forcibly disconnected
325+
from the cluster when the network connection has been disrupted for a long
326+
time. When this happens, the kernel client cannot safely write back dirty data
327+
and many applications will not handle IO errors correctly on ``close()``.
328+
Currently, the kernel client will remount the file system, but any outstanding
329+
file-system IO may not be properly handled. If this is the case, reboot the
330330
client system.
331331

332-
You can identify you are in this situation if dmesg/kern.log report something like::
332+
You are in this situation if the output of ``dmesg/kern.log`` contains
333+
something like the following::
333334

334335
Jul 20 08:14:38 teuthology kernel: [3677601.123718] ceph: mds0 closed our session
335336
Jul 20 08:14:38 teuthology kernel: [3677601.128019] ceph: mds0 reconnect start
@@ -340,11 +341,12 @@ You can identify you are in this situation if dmesg/kern.log report something li
340341
Jul 20 08:14:40 teuthology kernel: [3677603.126214] libceph: mds0 172.21.5.114:6812 connection reset
341342
Jul 20 08:14:40 teuthology kernel: [3677603.132176] libceph: reset on mds0
342343

343-
This is an area of ongoing work to improve the behavior. Kernels will soon
344-
be reliably issuing error codes to in-progress IO, although your application(s)
345-
may not deal with them well. In the longer-term, we hope to allow reconnect
346-
and reclaim of data in cases where it won't violate POSIX semantics (generally,
347-
data which hasn't been accessed or modified by other clients).
344+
This is an area of ongoing work to improve the behavior. Kernels will soon be
345+
reliably issuing error codes to in-progress IO, although your application(s)
346+
may not deal with them well. In the longer term, we hope to allow reconnection
347+
and reclamation of data in cases where doing so does not violate POSIX
348+
semantics (generally, data which hasn't been accessed or modified by other
349+
clients).
348350

349351
Mounting
350352
========

0 commit comments

Comments
 (0)