Skip to content

Commit 34c2037

Browse files
committed
increase default reconfiguration period
1 parent d4e7e55 commit 34c2037

File tree

6 files changed

+6
-6
lines changed

6 files changed

+6
-6
lines changed

api/v1/slurmcluster_types.go

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -343,7 +343,7 @@ type SConfigController struct {
343343
// ReconfigureWaitTimeout defines the maximum time to wait for all nodes to restart during reconfiguration.
344344
// Must be greater than ReconfigurePollInterval. Defaults to 1m
345345
// +kubebuilder:validation:Optional
346-
// +kubebuilder:default="1m"
346+
// +kubebuilder:default="5m"
347347
ReconfigureWaitTimeout *string `json:"reconfigureWaitTimeout,omitempty"`
348348

349349
// HostUsers controls if the pod containers can use the host user namespace

config/crd/bases/slurm.nebius.ai_slurmclusters.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3373,7 +3373,7 @@ spec:
33733373
Defaults to 20s
33743374
type: string
33753375
reconfigureWaitTimeout:
3376-
default: 1m
3376+
default: 5m
33773377
description: |-
33783378
ReconfigureWaitTimeout defines the maximum time to wait for all nodes to restart during reconfiguration.
33793379
Must be greater than ReconfigurePollInterval. Defaults to 1m

helm/slurm-cluster/values.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -525,7 +525,7 @@ sConfigController:
525525
runAsUid: 1001
526526
runAsGid: 1001
527527
reconfigurePollInterval: "20s"
528-
reconfigureWaitTimeout: "1m"
528+
reconfigureWaitTimeout: "5m"
529529
serviceMonitor:
530530
enabled: true
531531
jobLabel: "sconfigcontroller"

helm/soperator-crds/templates/slurmcluster-crd.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -29175,7 +29175,7 @@ spec:
2917529175
Defaults to 20s
2917629176
type: string
2917729177
reconfigureWaitTimeout:
29178-
default: 1m
29178+
default: 5m
2917929179
description: |-
2918029180
ReconfigureWaitTimeout defines the maximum time to wait for all nodes to restart during reconfiguration.
2918129181
Must be greater than ReconfigurePollInterval. Defaults to 1m

helm/soperator/crds/slurmcluster-crd.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -29175,7 +29175,7 @@ spec:
2917529175
Defaults to 20s
2917629176
type: string
2917729177
reconfigureWaitTimeout:
29178-
default: 1m
29178+
default: 5m
2917929179
description: |-
2918029180
ReconfigureWaitTimeout defines the maximum time to wait for all nodes to restart during reconfiguration.
2918129181
Must be greater than ReconfigurePollInterval. Defaults to 1m

internal/controller/sconfigcontroller/jailedconfig_controller.go

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -53,7 +53,7 @@ import (
5353
const (
5454
configMapField = ".spec.configMap.name"
5555

56-
defaultReconfigureWaitTimeout = 1 * time.Minute
56+
defaultReconfigureWaitTimeout = 5 * time.Minute
5757
defaultReconfigurePollInterval = 20 * time.Second
5858
)
5959

0 commit comments

Comments
 (0)