@@ -7,11 +7,14 @@ This file is used to list changes made in each version of the AWS ParallelCluste
77------
88
99** ENHANCEMENTS**
10- - Remove UnkillableStepTimeout from slurm.conf and let slurm set this value.
10+ - Add support for p6e-gb200 instances via capacity blocks.
11+ - Remove ` UnkillableStepTimeout ` from slurm.conf and let slurm set this value.
1112- Add ` build-image ` support for kernel 6.12 of Amazon Linux 2023. The official ParallelCluster Amazon Linux 2023 AMIs use kernel 6.12.
1213
1314** CHANGES**
15+ - Install nvidia-imex for all OSs except AL2.
1416- Ubuntu 20.04 is no longer supported.
17+ - Remove ` berkshelf ` . All cookbooks are local and do not need ` berkshelf ` dependency management.
1518- Upgrade Slurm to version 24.11.6 (from 24.05.8).
1619- Upgrade EFA installer to 1.43.2 (from 1.41.0).
1720 - Efa-driver: efa-2.17.2-1
@@ -20,21 +23,18 @@ This file is used to list changes made in each version of the AWS ParallelCluste
2023 - Libfabric-aws: libfabric-aws-2.1.0-5
2124 - Rdma-core: rdma-core-58.0-1
2225 - Open MPI: openmpi40-aws-4.1.7-2 and openmpi50-aws-5.0.6-11
23- - Upgrade Cinc Client to version to 18.4.12 from 18.2.7.
26+ - Upgrade Cinc Client to version 18.4.12 ( from 18.2.7) .
2427- Upgrade NVIDIA driver to version 570.172.08 (from 570.86.15) for all OSs except AL2.
2528- Upgrade CUDA Toolkit to version 12.8.1 (from 12.8.0) for all OSs except AL2.
2629- Upgrade DCGM to version 4.2.3 (from 3.3.6) for all OSs except AL2.
2730- Upgrade Python to 3.12.11 (from 3.12.8) for all OSs except AL2.
2831- Upgrade Python to 3.9.23 (from 3.9.20) for AL2.
2932- Upgrade Intel MPI Library to 2021.16.0 (from 2021.13.1).
30- - Addressed cluster id mismatch known issue by deleting the file ` /var/spool/slurm.state/clustername ` before configuring Slurm accounting.
3133- Upgrade DCV to version 2024.0-19030.
32- - Remove ` berkshelf ` . All cookbooks are local and do not need ` berkshelf ` dependency management.
33- - Add support for GB200 instance types.
34- - Install nvidia-imex for all OSs except AL2.
3534
3635** BUG FIXES**
3736- Fix a race condition in CloudWatch Agent startup that could cause nodes bootstrap failures.
37+ - Fix cluster id mismatch known issue by deleting the file ` /var/spool/slurm.state/clustername ` before configuring Slurm accounting.
3838
39393.13.2
4040------
0 commit comments