Skip to content

Commit 8d539d3

Browse files
authored
Patch PT ARM64 Inference and Training (#5095)
* Patch PT ARM64 Inference and Training * fix * test 2.6 * revert temp changes
1 parent bc28879 commit 8d539d3

File tree

2 files changed

+15
-1
lines changed

2 files changed

+15
-1
lines changed

miscellaneous_scripts/dockerfile_patch_script.sh

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -61,6 +61,20 @@ if [[ $LATEST_RELEASED_IMAGE_URI =~ ^763104351884\.dkr\.ecr\.us-west-2\.amazonaw
6161
chmod +x /usr/local/bin/start_cuda_compat.sh
6262
fi
6363

64+
# For PT ARM64 GPU images, patch nvjpeg
65+
if [[ $LATEST_RELEASED_IMAGE_URI =~ ^763104351884\.dkr\.ecr\.us-west-2\.amazonaws\.com/pytorch-(inference|training)-arm64:2\.[5-7]\.[0-9]+-gpu(.+) ]]; then
66+
mkdir -p /tmp/nvjpeg
67+
cd /tmp/nvjpeg
68+
wget https://developer.download.nvidia.com/compute/cuda/redist/libnvjpeg/linux-aarch64/libnvjpeg-linux-aarch64-12.4.0.76-archive.tar.xz
69+
tar -xvf libnvjpeg-linux-aarch64-12.4.0.76-archive.tar.xz
70+
rm -rf /usr/local/cuda/targets/sbsa-linux/lib/libnvjpeg*
71+
rm -rf /usr/local/cuda/targets/sbsa-linux/include/nvjpeg.h
72+
cp libnvjpeg-linux-aarch64-12.4.0.76-archive/lib/libnvjpeg* /usr/local/cuda/targets/sbsa-linux/lib/
73+
cp libnvjpeg-linux-aarch64-12.4.0.76-archive/include/* /usr/local/cuda/targets/sbsa-linux/include/
74+
cd /
75+
rm -rf /tmp/nvjpeg
76+
fi
77+
6478
# For all GPU images, remove cuobjdump and nvdisasm
6579
if [[ $LATEST_RELEASED_IMAGE_URI =~ ^763104351884\.dkr\.ecr\.us-west-2\.amazonaws\.com/(pytorch|tensorflow)(.+)gpu(.+) ]]; then
6680
rm -rf /usr/local/cuda/bin/cuobjdump*

pytorch/training/buildspec-arm64-2-7-ec2.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ framework: &FRAMEWORK pytorch
55
version: &VERSION 2.7.0
66
short_version: &SHORT_VERSION "2.7"
77
arch_type: arm64
8-
autopatch_build: "False"
8+
autopatch_build: "True"
99

1010
repository_info:
1111
training_repository: &TRAINING_REPOSITORY

0 commit comments

Comments
 (0)