Skip to content

Conversation

sjpb
Copy link
Collaborator

@sjpb sjpb commented May 13, 2025

Bumps slurm versions to fix CVE-2025-43904.

Caution

This is a Slurm major version update for RockyLinux 9 (= OpenHPC v3 clusters).

These clusters will perform a Slurm database upgrade on slurmdbd startup. The startup timeout for that service has been increased to 45 minutes to allow for that. However it is recommended that this database (in /var/lib/state/mysql on the control node) is backed-up before starting slurmdbd, for example by snapshotting the $CLUSTER_NAME-state volume after the reimage (so the service is stopped) but before running the site.yml playbook.

Note non-upgrade OpenHPC repos have no new snapshots.

@sjpb
Copy link
Collaborator Author

sjpb commented May 13, 2025

@sjpb
Copy link
Collaborator Author

sjpb commented May 13, 2025

Above failures are caused by the ondemand-web repos being down.

@sjpb sjpb marked this pull request as ready for review May 14, 2025 09:18
@sjpb sjpb requested a review from a team as a code owner May 14, 2025 09:18
Copy link
Collaborator

@m-bull m-bull left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good - quick var rename to reflect the stringy nature of the variable

@sjpb sjpb requested a review from m-bull May 14, 2025 09:57
Copy link
Collaborator

@m-bull m-bull left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM - Vars will be removed shortly anyway

@sjpb sjpb merged commit 5a7608b into main May 14, 2025
7 checks passed
@sjpb sjpb deleted the feat/ohpc-3.3.1 branch May 14, 2025 10:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants