Skip to content

Commit 8c0b2c5

Browse files
authored
Adds vLLM CPU and Sim Support to Release Script (#1029)
Signed-off-by: Daneyon Hansen <[email protected]>
1 parent b44e2e7 commit 8c0b2c5

File tree

2 files changed

+47
-15
lines changed

2 files changed

+47
-15
lines changed

.github/ISSUE_TEMPLATE/new-release.md

Lines changed: 10 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -35,10 +35,14 @@ This document defines the process for releasing Gateway API Inference Extension.
3535
export RC=1
3636
```
3737

38-
4. The vLLM image tag defaults to `v0.7.2` for a release. Set the `VLLM` environment variable if a newer [tag][vllm-tag] has been published. For example:
38+
4. Refer to the [release-quickstart script][release-quickstart] for the default image tags used
39+
by the vLLM deployment manifests. If a newer [GPU][vllm-gpu-tag], [CPU][vllm-cpu-tag], or [Simulator][vllm-sim-tag]
40+
tag has been published, set the appropriate environment variable or update the script. For example:
3941

4042
```shell
41-
export VLLM=0.7.3
43+
export VLLM_GPU=0.9.2
44+
export VLLM_CPU=0.9.3
45+
export VLLM_SIM=0.1.2
4246
```
4347

4448
## Release Process
@@ -159,4 +163,7 @@ Use the following steps to announce the release.
159163
[k8s.io]: https://github.com/kubernetes/k8s.io
160164
[yaml]: https://github.com/kubernetes/k8s.io/blob/main/registry.k8s.io/images/k8s-staging-gateway-api-inference-extension/images.yaml
161165
[issue]: https://github.com/kubernetes-sigs/gateway-api-inference-extension/issues/new/choose
162-
[vllm-tag]: https://hub.docker.com/r/vllm/vllm-openai/tags
166+
[vllm-gpu-tag]: https://hub.docker.com/r/vllm/vllm-openai/tags
167+
[vllm-cpu-tag]: https://gallery.ecr.aws/q9t5s3a7/vllm-cpu-release-repo
168+
[vllm-sim-tag]: https://github.com/llm-d/llm-d-inference-sim/pkgs/container/llm-d-inference-sim
169+
[release-quickstart]: https://github.com/kubernetes-sigs/gateway-api-inference-extension/blob/main/hack/release-quickstart.sh

hack/release-quickstart.sh

Lines changed: 37 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -29,11 +29,18 @@ else
2929
RELEASE_TAG="v${MAJOR}.${MINOR}.0-rc.${RC}"
3030
fi
3131

32-
# vLLM image version (default to 0.7.2 if not defined)
33-
VLLM="${VLLM:-0.7.2}"
32+
# The vLLM image versions
33+
# The GPU image is from https://hub.docker.com/layers/vllm/vllm-openai
34+
VLLM_GPU="${VLLM_GPU:-0.9.1}"
35+
# The CPU image is from https://gallery.ecr.aws/q9t5s3a7/vllm-cpu-release-repo
36+
VLLM_CPU="${VLLM_CPU:-0.9.1}"
37+
# The sim image is from https://github.com/llm-d/llm-d-inference-sim/pkgs/container/llm-d-inference-sim
38+
VLLM_SIM="${VLLM_SIM:-0.1.1}"
3439

3540
echo "Using release tag: ${RELEASE_TAG}"
36-
echo "Using vLLM image version: ${VLLM}"
41+
echo "Using vLLM GPU image version: ${VLLM_GPU}"
42+
echo "Using vLLM CPU image version: ${VLLM_CPU}"
43+
echo "Using vLLM Simulator image version: ${VLLM_SIM}"
3744

3845
# -----------------------------------------------------------------------------
3946
# Update pkg/README.md
@@ -64,30 +71,48 @@ sed -i.bak -E "s|(tag: )[^\"[:space:]]+|\1${RELEASE_TAG}|g" "$EPP_HELM"
6471
sed -i.bak -E "s|(tag: )[^\"[:space:]]+|\1${RELEASE_TAG}|g" "$BBR_HELM"
6572

6673
# Update the container image pull policy.
67-
sed -i.bak '/us-central1-docker.pkg.dev\/k8s-staging-images\/gateway-api-inference-extension\/epp/ { n; s/Always/IfNotPresent/ }' "$EPP"
74+
sed -i.bak '/us-central1-docker.pkg.dev\/k8s-staging-images\/gateway-api-inference-extension\/epp/{n;s/Always/IfNotPresent/;}' "$EPP"
6875

6976
# Update the container registry.
7077
sed -i.bak -E "s|us-central1-docker\.pkg\.dev/k8s-staging-images|registry.k8s.io|g" "$EPP"
7178
sed -i.bak -E "s|us-central1-docker\.pkg\.dev/k8s-staging-images|registry.k8s.io|g" "$EPP_HELM"
7279
sed -i.bak -E "s|us-central1-docker\.pkg\.dev/k8s-staging-images|registry.k8s.io|g" "$BBR_HELM"
7380

7481
# -----------------------------------------------------------------------------
75-
# Update config/manifests/vllm/gpu-deployment.yaml
82+
# Update vLLM deployment manifests
7683
# -----------------------------------------------------------------------------
77-
VLLM_DEPLOY="config/manifests/vllm/gpu-deployment.yaml"
78-
echo "Updating ${VLLM_DEPLOY} ..."
84+
VLLM_GPU_DEPLOY="config/manifests/vllm/gpu-deployment.yaml"
85+
echo "Updating ${VLLM_GPU_DEPLOY} ..."
7986

80-
# Update the vLLM image version
81-
sed -i.bak -E "s|(vllm/vllm-openai:)[^\"[:space:]]+|\1v${VLLM}|g" "$VLLM_DEPLOY"
87+
# Update the vLLM GPU image version
88+
sed -i.bak -E "s|(vllm/vllm-openai:)[^\"[:space:]]+|\1v${VLLM_GPU}|g" "$VLLM_GPU_DEPLOY"
8289

8390
# Also change the imagePullPolicy from Always to IfNotPresent on lines containing the vLLM image.
84-
sed -i.bak '/vllm\/vllm-openai/ { n; s/Always/IfNotPresent/ }' "$VLLM_DEPLOY"
91+
sed -i.bak '/vllm\/vllm-openai/{n;s/Always/IfNotPresent/;}' "$VLLM_GPU_DEPLOY"
92+
93+
VLLM_CPU_DEPLOY="config/manifests/vllm/cpu-deployment.yaml"
94+
echo "Updating ${VLLM_CPU_DEPLOY} ..."
95+
96+
# Update the vLLM CPU image version
97+
sed -i.bak -E "s|(q9t5s3a7/vllm-cpu-release-repo:)[^\"[:space:]]+|\1v${VLLM_CPU}|g" "$VLLM_CPU_DEPLOY"
98+
99+
# Also change the imagePullPolicy from Always to IfNotPresent on lines containing the vLLM CPU image.
100+
sed -i.bak '/q9t5s3a7\/vllm-cpu-release-repo/{n;s/Always/IfNotPresent/;}' "$VLLM_CPU_DEPLOY"
101+
102+
VLLM_SIM_DEPLOY="config/manifests/vllm/sim-deployment.yaml"
103+
echo "Updating ${VLLM_SIM_DEPLOY} ..."
104+
105+
# Update the vLLM Simulator image version
106+
sed -i.bak -E "s|(llm-d/llm-d-inference-sim:)[^\"[:space:]]+|\1v${VLLM_SIM}|g" "$VLLM_SIM_DEPLOY"
107+
108+
# Also change the imagePullPolicy from Always to IfNotPresent on lines containing the vLLM image.
109+
sed -i.bak '/llm-d\/llm-d-inference-sim/{n;s/Always/IfNotPresent/;}' "$VLLM_SIM_DEPLOY"
85110

86111
# -----------------------------------------------------------------------------
87112
# Stage the changes
88113
# -----------------------------------------------------------------------------
89-
echo "Staging $README $EPP $EPP_HELM $BBR_HELM $VLLM_DEPLOY files..."
90-
git add $README $EPP $EPP_HELM $BBR_HELM $VLLM_DEPLOY
114+
echo "Staging $README $EPP $EPP_HELM $BBR_HELM $VLLM_GPU_DEPLOY $VLLM_CPU_DEPLOY $VLLM_SIM_DEPLOY files..."
115+
git add $README $EPP $EPP_HELM $BBR_HELM $VLLM_GPU_DEPLOY $VLLM_CPU_DEPLOY $VLLM_SIM_DEPLOY
91116

92117
# -----------------------------------------------------------------------------
93118
# Cleanup backup files and finish

0 commit comments

Comments
 (0)