Skip to content

Commit 47ccb91

Browse files
committed
[DFSM] Retry the cluster readiness check with more retries having shorted delay.
In particular: 10 retries with 90s delay rather than 5 retries with 180s delay. Signed-off-by: Giacomo Marciani <[email protected]>
1 parent b82c8d9 commit 47ccb91

File tree

3 files changed

+6
-6
lines changed

3 files changed

+6
-6
lines changed

cookbooks/aws-parallelcluster-slurm/libraries/helpers.rb

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -173,8 +173,8 @@ def wait_cluster_ready
173173
" --config-version #{node['cluster']['cluster_config_version']}" \
174174
" --region #{node['cluster']['region']}"
175175
timeout 30
176-
retries 5
177-
retry_delay 180
176+
retries 10
177+
retry_delay 90
178178
end
179179
end
180180

cookbooks/aws-parallelcluster-slurm/spec/unit/recipes/finalize_head_node_spec.rb

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -48,8 +48,8 @@
4848
is_expected.to run_execute("Check cluster readiness").with(
4949
command: expected_command,
5050
timeout: 30,
51-
retries: 5,
52-
retry_delay: 180
51+
retries: 10,
52+
retry_delay: 90
5353
)
5454
end
5555
end

cookbooks/aws-parallelcluster-slurm/spec/unit/recipes/update_head_node_spec.rb

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -41,8 +41,8 @@
4141
is_expected.to run_execute("Check cluster readiness").with(
4242
command: expected_command,
4343
timeout: 30,
44-
retries: 5,
45-
retry_delay: 180
44+
retries: 10,
45+
retry_delay: 90
4646
)
4747
end
4848
end

0 commit comments

Comments
 (0)