Skip to content

Commit c934835

Browse files
bhanutejagkBhanu Teja GoshikondasirutBuasaijunpuf
authored
Post release process for TF2.19_inference_SM (#4971)
* Building inf image * added dockerd_ec2_entrypoint.sh file * fixed syntax * modified env variable with values directly * rearranged build_image creation line to top of file * added setup.sources.sh file to add NVIDIA's package & Tensorflow serving reporitories * removed libssl1.1 line and modified libreadline-gplv2-dev * removed libssl1.1 installation and modified libreadline-gplv2-dev to libreadline-dev in gpu file * changed TF_SERVING_VERSION_GIT_COMMIT value and removed dockerd_ec2_entrypoint file * added libssl3 since it is a dependency for nginx * changed nginx installation from Focal(20.04) to Ubuntu Jammy(22.04) * trying to debug error with libssl1.1 instead of libssl3 * removed libssl1.1 * installing libcudnn and nccl via dpkg after python installation to remove 3.10 conflict * installing python before installing packages using apt install * installing wget before installing python * insatalling curl, gnupg2, ca-certificates also before python installation * divided installtion of packages as needed for python compilation and not needed ones * installing libssl1.1 * added allowlist to remove vulnerability on gpu image * removed logic for build to only test * setup nvidia repositories through cuda-keyring and building image and also skipped telemetry tests * modified wget installation code after installing wget * added 2.19 to skip framework telemetry * checking with openssl/libssl1.1.1p to check for CVE's and removed skipping of bashrc and entrypoint telemtry * Upgraded OpenSSL to a newer version that doesn't have these vulnerabilities & checking only security tests * building the image * upgraded libssl1.1 to 1.1.1t * replace 1.1.1t with 1.1.1o for libssl1.1 * upgraded libssl1.1 version to 1.1.1f-1ubuntu2.24 * changed libssl version to libssl1.1_1.1.1-1ubuntu2.1~18.04.23_amd64.deb * upgraded libssl1.1 to 1.1.1-1ubuntu2.1~18.04.23 * removed libssl1.1 installation to check if it even needed to remove error * uninstalling libssl1.1 after installing cuda packages to check for errors * Add license file content test (#4890) * Add license file content test * use short verison * test no build * print string * enable build * fix allowlist * rebuild * buildtest ec2 * test arm * build test inference * disable arm64 mode * disable build * revert toml * update EFA to 1.41.0 vllm to 0.9.0.1 (#4898) * update EFA to 1.41.0 vllm to 0.9.0.1 * upgraded base image to cuda 12.2.2 to check for errors of openssl * changed cuda base image version to 12.2.0 from 12.2.2 * added allowlist.json file for the openssl CVE's * enabled some more tests * removed some commented lines * reverted back toml file * Removed all the commented lines for cleaner code * added tensorflow 2.19 to release_images_patches.yml * corrected the cuda version * updated available_images file with new tensorflow 2.19 sm images * adjusting spaces --------- Co-authored-by: Bhanu Teja Goshikonda <[email protected]> Co-authored-by: Sirut Buasai <[email protected]> Co-authored-by: Junpu Fan <[email protected]>
1 parent 5f2384f commit c934835

File tree

1 file changed

+4
-2
lines changed

1 file changed

+4
-2
lines changed

available_images.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -141,8 +141,8 @@ SageMaker Framework Containers (SM support only)
141141
| PyTorch 2.6.0 | inference | No | GPU | 3.12 (py312) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/pytorch-inference:2.6.0-gpu-py312-cu124-ubuntu22.04-sagemaker |
142142
| TensorFlow 2.18.0 | training | No | CPU | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/tensorflow-training:2.18.0-cpu-py310-ubuntu22.04-sagemaker |
143143
| TensorFlow 2.18.0 | training | No | GPU | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/tensorflow-training:2.18.0-gpu-py310-cu125-ubuntu22.04-sagemaker |
144-
| TensorFlow 2.18.0 | inference | No | CPU | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/tensorflow-inference:2.18.0-cpu-py310-ubuntu20.04-sagemaker |
145-
| TensorFlow 2.18.0 | inference | No | GPU | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/tensorflow-inference:2.18.0-gpu-py310-cu122-ubuntu20.04-sagemaker |
144+
| TensorFlow 2.19.0 | inference | No | CPU | 3.12 (py312) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/tensorflow-inference:2.19.0-cpu-py312-ubuntu22.04-sagemaker |
145+
| TensorFlow 2.19.0 | inference | No | GPU | 3.12 (py312) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/tensorflow-inference:2.19.0-gpu-py312-cu122-ubuntu22.04-sagemaker |
146146

147147

148148
EC2 Framework ARM64/Graviton Containers (EC2, ECS, and EKS support only)
@@ -373,6 +373,8 @@ Prior SageMaker Framework Container Versions
373373
| PyTorch 2.4.0 | training | No | GPU | 3.11 (py311) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/pytorch-training:2.4.0-gpu-py311-cu124-ubuntu22.04-sagemaker |
374374
| PyTorch 2.4.0 | inference | No | CPU | 3.11 (py311) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/pytorch-inference:2.4.0-cpu-py311-ubuntu22.04-sagemaker |
375375
| PyTorch 2.4.0 | inference | No | GPU | 3.11 (py311) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/pytorch-inference:2.4.0-gpu-py311-cu124-ubuntu22.04-sagemaker |
376+
| TensorFlow 2.18.0 | inference | No | CPU | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/tensorflow-inference:2.18.0-cpu-py310-ubuntu20.04-sagemaker |
377+
| TensorFlow 2.18.0 | inference | No | GPU | 3.10 (py310) | 763104351884.dkr.ecr.us-east-1.amazonaws.com/tensorflow-inference:2.18.0-gpu-py310-cu122-ubuntu20.04-sagemaker |
376378

377379
Prior EC2 Framework ARM64/Graviton Containers
378380
============================

0 commit comments

Comments
 (0)