Skip to content

Commit 428430b

Browse files
jianyinglangawsamanshanbhagKeitaW
authored
Update llama3 70b and removes the neuron-top in update_neuron_sdk.sh script (aws-samples#530)
* Update the Neuron SDK to 2.21.0 * Update the Llama3-70B pretraining with the Neuron SDK 2.21 * Fix a typo * Add --hw_backend trn1 in the convert_checkpoint command * More update * Update the update_neuron_sdk.sh by removing the neuron-top check * Keep enable_update_neuron_sdk as Flase by default * Update automate-eks-cluster-creation.sh (aws-samples#529) Minor bug fix * Update according to the review comments. * minor updates in doc --------- Co-authored-by: Aman Shanbhag <[email protected]> Co-authored-by: Keita Watanabe <[email protected]>
1 parent fbe200e commit 428430b

File tree

2 files changed

+174
-72
lines changed
  • 1.architectures/5.sagemaker-hyperpod/LifecycleScripts/base-config/utils
  • 3.test_cases/neuronx-distributed/llama3/slurm

2 files changed

+174
-72
lines changed
Lines changed: 21 additions & 26 deletions
Original file line numberDiff line numberDiff line change
@@ -1,33 +1,28 @@
11
#!/bin/bash
22
set -euxo pipefail
3-
if neuron-top -v &> /dev/null
4-
then
5-
echo "Update to Neuron SDK Release 2.21.0"
6-
# Configure Linux for Neuron repository updates
7-
. /etc/os-release
83

9-
sudo echo "deb https://apt.repos.neuron.amazonaws.com ${VERSION_CODENAME} main" | sudo tee /etc/apt/sources.list.d/neuron.list
10-
wget -qO - https://apt.repos.neuron.amazonaws.com/GPG-PUB-KEY-AMAZON-AWS-NEURON.PUB | sudo apt-key add -
4+
echo "Update to Neuron SDK Release 2.21.0"
5+
# Configure Linux for Neuron repository updates
6+
. /etc/os-release
117

12-
# Update OS packages
13-
sudo apt-get update -y
8+
sudo echo "deb https://apt.repos.neuron.amazonaws.com ${VERSION_CODENAME} main" | sudo tee /etc/apt/sources.list.d/neuron.list
9+
wget -qO - https://apt.repos.neuron.amazonaws.com/GPG-PUB-KEY-AMAZON-AWS-NEURON.PUB | sudo apt-key add -
1410

15-
# Install git
16-
sudo apt-get install git -y
11+
# Update OS packages
12+
sudo apt-get update -y
1713

18-
# Remove preinstalled packages and Install Neuron Driver and Runtime
19-
sudo apt-get remove aws-neuron-dkms -y
20-
sudo apt-get remove aws-neuronx-dkms -y
21-
sudo apt-get remove aws-neuronx-oci-hook -y
22-
sudo apt-get remove aws-neuronx-runtime-lib -y
23-
sudo apt-get remove aws-neuronx-collectives -y
24-
sudo apt-get install aws-neuronx-dkms=2.19.64.0 -y
25-
sudo apt-get install aws-neuronx-oci-hook=2.6.36.0 -y
26-
sudo apt-get install aws-neuronx-runtime-lib=2.23.110.0* -y
27-
sudo apt-get install aws-neuronx-collectives=2.23.133.0* -y
14+
# Remove preinstalled packages and Install Neuron Driver and Runtime
15+
sudo apt-get remove aws-neuron-dkms -y
16+
sudo apt-get remove aws-neuronx-dkms -y
17+
sudo apt-get remove aws-neuronx-oci-hook -y
18+
sudo apt-get remove aws-neuronx-runtime-lib -y
19+
sudo apt-get remove aws-neuronx-collectives -y
20+
sudo apt-get install aws-neuronx-dkms=2.19.64.0 -y
21+
sudo apt-get install aws-neuronx-oci-hook=2.6.36.0 -y
22+
sudo apt-get install aws-neuronx-runtime-lib=2.23.110.0* -y
23+
sudo apt-get install aws-neuronx-collectives=2.23.133.0* -y
2824

29-
# Remove pre-installed package and Install Neuron Tools
30-
sudo apt-get remove aws-neuron-tools -y
31-
sudo apt-get remove aws-neuronx-tools -y
32-
sudo apt-get install aws-neuronx-tools=2.19.0.0 -y
33-
fi
25+
# Remove pre-installed package and Install Neuron Tools
26+
sudo apt-get remove aws-neuron-tools -y
27+
sudo apt-get remove aws-neuronx-tools -y
28+
sudo apt-get install aws-neuronx-tools=2.20.204.0 -y

0 commit comments

Comments
 (0)