Skip to content

Conversation

@tahliar
Copy link
Contributor

@tahliar tahliar commented Jan 13, 2026

PR creator: Description

Now that bsc#1248874 is fixed, I've re-tested and completed the article for adding diskless SBD to a running cluster.

I also edited the abstracts of similar articles to be more like this one.

PDF:
HA-sbd-configuring-diskless_en.pdf

PR creator: Are there any relevant issues/feature requests?

  • jsc#DOCTEAM-1985

PR reviewer: Checklist for editorial review

Apart from the usual checks, please double-check also the following:

</screen>
<para>
The output of this command shows the enabled settings in the
<filename>/etc/sysconfig/sbd</filename> file and the &sbd;-related cluster settings.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Since SLE 16.1 (crmsh upstream PR ClusterLabs/crmsh#2003), crmsh includes a new interface: crm cluster sbd health, which checks whether SBD (both disk-based and diskless) timeout-related configurations and cluster properties are properly configured.

This interface is also automatically invoked when running crm sbd configure show and crm sbd status. If everything is configured correctly, it will show this at the end of output:

INFO: SBD: Check sbd timeout configuration: OK.

; otherwise, warnings and errors will be shown.

Example:

# crm sbd configure show
...
<normal output of sbd configure show>

ERROR: It's recommended that SBD_DELAY_START is set to 71, now is 40
WARNING: It's recommended that stonith-timeout is set to 71, now is 100
INFO: Please run "crm cluster health sbd --fix" to fix the above error on the running cluster
ERROR: SBD: Check sbd timeout configuration: FAIL.

In this documentation, I suggest mentioning that crmsh will check the health of SBD configuration when running crm sbd configure show and crm sbd status, and will display warnings and errors if something is configured incorrectly.

Alternatively, users can run crm cluster sbd health directly to obtain the same results.

Example:

# crm cluster health sbd
ERROR: It's recommended that SBD_DELAY_START is set to 71, now is 40
WARNING: It's recommended that stonith-timeout is set to 71, now is 100
INFO: Please run "crm cluster health sbd --fix" to fix the above error on the running cluster
ERROR: SBD: Check sbd timeout configuration: FAIL.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

And if you mentioned this on this diskless DOC, please remember to add the same description on the disk-based SBD DOC

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Or, should this part be the job of another PED task of DOC?

@zzhou1

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @liangxin1300, that's a cool feature! I agree that it should be a separate doc task for 16.1, as I'll be backporting this PR to 16.0 as well.

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Or, should this part be the job of another PED task of DOC?
@zzhou1

It makes sense for HA 16.1. Here you go: https://jira.suse.com/browse/PED-15185

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you @zzhou1 :)

@liangxin1300
Copy link

Other parts looks good to me

Copy link
Contributor

@dariavladykina dariavladykina left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

Copy link
Contributor

@lvicoun lvicoun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi Thalia,
LGTM. Thanks!

@tahliar tahliar merged commit 9afc687 into main Jan 16, 2026
11 checks passed
@tahliar tahliar deleted the tahliar/DOCTEAM-1985-diskless-sbd branch January 16, 2026 05:42
tahliar added a commit that referenced this pull request Jan 16, 2026
* Test and complete diskless SBD article

jsc#DOCTEAM-1985

* Add back-up full version of SBD

* Update abstracts

* Add some conditions for diskless/diskbased
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants