Skip to content

Commit 911a1e5

Browse files
authored
Updates artifacts for v0.1.0-rc1 release (#285)
Signed-off-by: Daneyon Hansen <[email protected]>
1 parent d5f5507 commit 911a1e5

File tree

3 files changed

+5
-4
lines changed

3 files changed

+5
-4
lines changed

pkg/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ This quickstart guide is intended for engineers familiar with k8s and model serv
2323
1. **Install the Inference Extension CRDs:**
2424

2525
```sh
26-
kubectl apply -k https://github.com/kubernetes-sigs/gateway-api-inference-extension/config/crd
26+
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/releases/download/v0.1.0-rc1/manifests.yaml
2727
```
2828

2929
1. **Deploy InferenceModel**

pkg/manifests/ext_proc.yaml

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -71,7 +71,8 @@ spec:
7171
spec:
7272
containers:
7373
- name: inference-gateway-ext-proc
74-
image: us-central1-docker.pkg.dev/k8s-staging-images/gateway-api-inference-extension/epp:main
74+
image: us-central1-docker.pkg.dev/k8s-staging-images/gateway-api-inference-extension/epp:v0.1.0-rc1
75+
imagePullPolicy: IfNotPresent
7576
args:
7677
- -poolName
7778
- "vllm-llama2-7b-pool"

pkg/manifests/vllm/deployment.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -14,8 +14,8 @@ spec:
1414
spec:
1515
containers:
1616
- name: lora
17-
image: "vllm/vllm-openai:latest"
18-
imagePullPolicy: Always
17+
image: "vllm/vllm-openai:v0.7.1"
18+
imagePullPolicy: IfNotPresent
1919
command: ["python3", "-m", "vllm.entrypoints.openai.api_server"]
2020
args:
2121
- "--model"

0 commit comments

Comments
 (0)