You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+9-6Lines changed: 9 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -12,16 +12,19 @@ CHANGELOG
12
12
- Add support for p6e-gb200 instances via capacity blocks.
13
13
- Echo chef-client log when a node fails to bootstrap. This helps with investigating bootstrap failures in cases CloudWatch logs are not available.
14
14
- Add `build-image` support for kernel 6.12 of Amazon Linux 2023. The official ParallelCluster Amazon Linux 2023 AMIs use kernel 6.12.
15
+
- Support `prioritized` and `capacity-optimized-prioritized` Allocation Strategy. This allows users to prioritize subnets for instance placement to optimize costs and performance.
15
16
16
17
**CHANGES**
18
+
- Install nvidia-imex for all OSs except AL2.
17
19
- Ubuntu 20.04 is no longer supported.
18
-
-Support prioritized and capacity-optimized-prioritized Allocation Strategy
20
+
-Remove `UnkillableStepTimeout` from slurm.conf and let slurm set this value.
19
21
- Support DCV on Amazon Linux 2023.
20
22
- Upgrade Python runtime used by Lambda functions to python3.12 (from python3.9).
21
23
- Remove `berkshelf`. All cookbooks are local and do not need `berkshelf` dependency management.
22
-
- The build-image command now deploys a global role that is used to automatically delete the build-image stack after images either succeed or fail the build.
23
-
The role is meant to exists even after the stack has been deleted. This is to prevent build-image stack deletion failures, reported in https://github.com/aws/aws-parallelcluster/issues/5914
24
+
- The build-image command now deploys a global role that is used to automatically delete the build-image stack after images either succeed or fail the build.
25
+
The role is meant to exist even after the stack has been deleted. This is to prevent build-image stack deletion failures, reported in https://github.com/aws/aws-parallelcluster/issues/5914.
24
26
- Add the configuration parameter `HeadNode/SharedStorageEfsSettings/Encrypted` to enable encryption on the EFS file system used for the head node internal shared storage.
27
+
- Add validator that warns against using non GPU instances with DCV.
25
28
- Upgrade Slurm to version 24.11.6 (from 24.05.8).
26
29
- Upgrade EFA installer to 1.42.0 (from 1.41.0).
27
30
- Efa-driver: efa-2.15.3-1
@@ -30,18 +33,18 @@ CHANGELOG
30
33
- Libfabric-aws: libfabric-aws-2.1.0-3
31
34
- Rdma-core: rdma-core-57.0-1
32
35
- Open MPI: openmpi40-aws-4.1.7-2 and openmpi50-aws-5.0.6-11
33
-
- Upgrade Cinc Client to version to 18.4.12 from 18.2.7.
36
+
- Upgrade Cinc Client to version 18.4.12 (from 18.2.7).
34
37
- Upgrade NVIDIA driver to version 570.172.08 (from 570.86.15) for all OSs except AL2.
35
38
- Upgrade CUDA Toolkit to version 12.8.1 (from 12.8.0) for all OSs except AL2.
36
39
- Upgrade DCGM to version 4.2.3 (from 3.3.6) for all OSs except AL2.
37
40
- Upgrade Python to 3.12.11 (from 3.12.8) for all OSs except AL2.
38
41
- Upgrade Intel MPI Library to 2021.16.0 (from 2021.13.1).
39
-
- Add validator that warns against using non GPU instances with DCV.
40
42
41
43
**BUG FIXES**
42
44
- Fix an issue where Security Group validation failed when a rule contained both IPv4 ranges (IpRanges) and security group references (UserIdGroupPairs).
43
45
- Fix `build-image` failure on Rocky 9, occurring when the parent image does not ship the latest kernel version on the latest Rocky minor version.
44
-
- Fix AWS Batch cluster creation failures in China when the OS is Amazon Linux 2023.
46
+
- Fix AWS Batch cluster creation failures in China when the OS is Amazon Linux 2023.
47
+
- Fix cluster id mismatch issue by deleting the file `/var/spool/slurm.state/clustername` before configuring Slurm accounting.
0 commit comments