Releases: aws/aws-parallelcluster
AWS ParallelCluster v2.11.7
We're excited to announce the release of AWS ParallelCluster 2.11.7
Upgrade
How to upgrade?
sudo pip install aws-parallelcluster==2.11.7
CHANGES
- Upgrade Slurm to version 20.11.9.
AWS ParallelCluster v3.1.4
We're excited to announce the release of AWS ParallelCluster 3.1.4
Upgrade
How to upgrade?
sudo pip install --upgrade aws-parallelcluster
ENHANCEMENTS
- Add validation for
DirectoryService/PasswordSecretArnto fail in case the secret does not exist.
CHANGES
- Upgrade Slurm to version 21.08.8-2.
- Build Slurm with JWT support.
- Do not require
PlacementGroup/Enabledto be set totruewhen passing an existingPlacementGroup/Id. - Add
lambda:TagsResourcetoParallelClusterUserRoleused by ParallelCluster API stack for cluster creation and image creation.
BUG FIXES
- Fix the ability to export cluster's logs when using
export-cluster-logscommand with the--filtersoption. - Fix AWS Batch Docker entrypoint to use
/homeshared directory to coordinate Multi-node-Parallel job execution.
AWS ParallelCluster v2.11.6
We're excited to announce the release of AWS ParallelCluster 2.11.6
Upgrade
How to upgrade?
sudo pip install aws-parallelcluster==2.11.6
ENHANCEMENTS
- Improve exception management in case of missing networking.
CHANGES
- OS package updates and security fixes.
AWS ParallelCluster v3.1.3
We're excited to announce the release of AWS ParallelCluster 3.1.3
Upgrade
How to upgrade?
sudo pip install --upgrade aws-parallelcluster
ENHANCEMENTS
- Execute SSH key creation alongside with the creation of HOME directory, i.e.
during SSH login, when switching to another user and when executing a command as another user. - Add support for both FQDN and LDAP Distinguished Names in the configuration parameter
DirectoryService/DomainName. The new validator now checks both the syntaxes. - New
update_directory_service_password.shscript deployed on the head node supports the manual update of the Active Directory password in the SSSD configuration.
The password is retrieved by the AWS Secrets Manager as from the cluster configuration. - Add support to deploy API infrastructure in environments without a default VPC.
- Add validation for
DirectoryService/AdditionalSssdConfigsto fail in case of invalid overrides.
CHANGES
- Disable deeper C-States in x86_64 official AMIs and AMIs created through
build-imagecommand, to guarantee high performance and low latency. - OS package updates and security fixes.
- Change Amazon Linux 2 base images to use AMIs with Kernel 5.10.
BUG FIXES
- Fix build-image stack in
DELETE_FAILEDafter image built successful, due to new EC2ImageBuilder policies. - Fix the configuration parameter
DirectoryService/DomainAddrconversion toldap_uriSSSD property when it contains multiples domain addresses.
AWS ParallelCluster v2.11.5
We're excited to announce the release of AWS ParallelCluster 2.11.5
Upgrade
How to upgrade?
sudo pip install aws-parallelcluster==2.11.5
ENHANCEMENTS
- Add support for
NEW_CHANGED_DELETEDas value of FSx for LustreAutoImportPolicyoption.
CHANGES
- Drop support for SGE and Torque schedulers.
- Disable log4j-cve-2021-44228-hotpatch service on Amazon Linux to avoid incurring in potential performance degradation.
- Upgrade Intel MPI Library to 2021.4.0.441.
- Upgrade NVIDIA driver to version 470.103.01.
- Upgrade CUDA library to version 11.4.4.
- Upgrade NVIDIA Fabric manager to version 470.103.01.
- Extend head node creation timeout to 1h.
BUG FIXES
- Fix DCV connection through browsers.
- Fix YAML quoting to prevent custom Tags being parsed as numbers.
AWS ParallelCluster v3.1.2
We're excited to announce the release of AWS ParallelCluster 3.1.2
Upgrade
How to upgrade?
sudo pip install --upgrade aws-parallelcluster
CHANGES
- Upgrade Slurm to version 21.08.6.
BUG FIXES
- Fix the update of
/etc/hostsfile on computes nodes when a cluster is deployed in subnets without internet access. - Fix compute nodes bootstrap by waiting for ephemeral drives initialization before joining the cluster.
AWS ParallelCluster v3.1.1
We're excited to announce the release of AWS ParallelCluster 3.1.1
Upgrade
How to upgrade?
sudo pip install --upgrade aws-parallelcluster
ENHANCEMENTS
- Add support for multiple users cluster environments by integrating with Active Directory (AD) domains managed via AWS Directory Service.
- Enable cluster creation in subnets with no internet access.
- Add abbreviated flags for
cluster-name(-n),region(-r),image-id(-i) andcluster-configuration/image-configuration(-c) to the cli. - Add support for multiple compute resources with same instance type per queue.
- Add support for
UseEc2Hostnamesin the cluster configuration file. When set totrue, use EC2 default hostnames (e.g. ip-1-2-3-4) for compute nodes. - Add support for GPU scheduling with Slurm on ARM instances with NVIDIA cards. Install NVIDIA drivers and CUDA library for ARM.
- Add
parallelcluster:compute-resource-nametag to LaunchTemplates used by compute nodes. - Add support for
NEW_CHANGED_DELETEDas value of FSx for LustreAutoImportPolicyoption. - Explicitly set cloud-init datasource to be EC2. This save boot time for Ubuntu and CentOS platforms.
- Improve Security Groups created within the cluster to allow inbound connections from custom security groups when
SecurityGroupsparameter is specified for head node and/or queues. - Build Slurm with
slurmrestdsupport.
CHANGES
- Upgrade Slurm to version 21.08.5.
- Upgrade NICE DCV to version 2021.3-11591.
- Upgrade NVIDIA driver to version 470.103.01.
- Upgrade CUDA library to version 11.4.4.
- Upgrade NVIDIA Fabric manager to version 470.103.01.
- Upgrade Intel MPI Library to 2021.4.0.441.
- Upgrade PMIx to version 3.2.3.
- Disable package update at instance launch time on Amazon Linux 2.
- Enable possibility to suppress
SlurmQueuesandComputeResourceslength validators. - Use compute resource name rather than instance type in compute fleet Launch Template name.
- Disable EC2 ImageBuilder enhanced image metadata when building ParallelCluster custom images.
- Remove dumping of failed compute nodes to
/home/logs/compute. Compute nodes log files are available in CloudWatch
and in EC2 console logs.
BUG FIXES
- Redirect stderr and stdout to CLI log file to prevent unwanted text to pollute the
pclusterCLI output. - Fix exporting of cluster logs when there is no prefix specified, previously exported to a
Noneprefix. - Fix rollback not being performed in case of cluster update failure.
- Do not configure GPUs in Slurm when NVIDIA driver is not installed.
- Fix
ecs:ListContainerInstancespermission inBatchUserRole. - Fix
RootVolumeschema for theHeadNodeby raising an error if unsupportedKmsKeyIdis specified. - Fix
EfaSecurityGroupValidator. Previously, it may produce false failures when custom security groups were provided and EFA was enabled. - Fix FSx metrics not displayed in Cloudwatch Dashboard.
AWS ParallelCluster v3.0.3
We're excited to announce the release of AWS ParallelCluster 3.0.3
Upgrade
How to upgrade?
sudo pip install --upgrade aws-parallelcluster
CHANGES
- Disable log4j-cve-2021-44228-hotpatch service on Amazon Linux to avoid incurring in potential performance degradation.
AWS ParallelCluster v2.11.4
We're excited to announce the release of AWS ParallelCluster 2.11.4
Upgrade
How to upgrade?
sudo pip install aws-parallelcluster==2.11.4
CHANGES
- CentOS 8 is no longer supported (EOL on December 31st, 2021).
- Upgrade Slurm to version 20.11.8.
- Upgrade Cinc Client to version 17.2.29.
- Upgrade NICE DCV to version 2021.2-11190.
- Upgrade NVIDIA driver to version 470.82.01.
- Upgrade CUDA library to version 11.4.3.
- Upgrade NVIDIA Fabric manager to 470.82.01.
- Disable packages update at instance launch time on Amazon Linux 2.
- Disable unattended packages update on Ubuntu.
- Install Python 3 version of
aws-cfn-bootstrapscripts on CentOS 7 and Ubuntu 18.04, aligning with Ubuntu 20.04 and Amazon Linux 2.
BUG FIXES
- Disable update of
ec2_iam_roleparameter. - Fix
CpuOptionsconfiguration in LaunchTemplate for t2 instances.
AWS ParallelCluster v3.0.2
We're excited to announce the release of AWS ParallelCluster 3.0.2
Upgrade
How to upgrade?
sudo pip install --upgrade aws-parallelcluster
3.0.2
CHANGES
- Upgrade EFA installer to version 1.14.1. Thereafter, EFA enables GDR support by default on supported instance type(s).
ParallelCluster does not reinstall EFA during node start. Previously, EFA was reinstalled ifGdrSupporthad been
turned on in the configuration file. TheGdrSupportparameter has no effect and should no longer be used.- EFA configuration:
efa-config-1.9-1 - EFA profile:
efa-profile-1.5-1 - EFA kernel module:
efa-1.14.2 - RDMA core:
rdma-core-37.0 - Libfabric:
libfabric-1.13.2 - Open MPI:
openmpi40-aws-4.1.1-2
- EFA configuration:
BUG FIXES
- Fix issue that is preventing cluster names to start with
parallelcluster-prefix.