Skip to content

Commit 5d9d447

Browse files
committed
fixed comments
1 parent e934b58 commit 5d9d447

File tree

1 file changed

+10
-6
lines changed

1 file changed

+10
-6
lines changed

site-src/guides/ga-migration.md

Lines changed: 10 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -54,15 +54,19 @@ helm uninstall <helm_alpha_inferencepool_name>
5454
If you are not using Helm, you will need to manually delete all resources associated with your `v1alpha2` deployment. The key is to remove the `HTTPRoute`'s reference to the old `InferencePool` and then delete the `v1alpha2` resources themselves.
5555

5656
1. **Update or Delete the `HTTPRoute`**: Modify the `HTTPRoute` to remove the `backendRef` that points to the `v1alpha2` `InferencePool`.
57-
2. **Delete the `InferencePool` and associated resources**: You must delete the `v1alpha2` `InferencePool`, any `InferenceModel` resources that point to it, and the corresponding Endpoint Picker (EPP) Deployment and Service.
57+
2. **Delete the `InferencePool` and associated resources**: You must delete the `v1alpha2` `InferencePool`, any `InferenceModel` (or 'InferenceObjective') resources that point to it, and the corresponding Endpoint Picker (EPP) Deployment and Service.
5858
3. **Delete the `v1alpha2` CRDs**: Once all `v1alpha2` custom resources are deleted, you can remove the CRD definitions from your cluster.
5959
```bash
60-
kubectl delete -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/releases/download/v0.3.0/manifests.yaml
60+
# You can change the version to the one you installed `v1alpha2` CRDs
61+
export VERSION="v0.3.0"
62+
kubectl delete -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/releases/download/${VERSION}/manifests.yaml
6163
```
6264

6365
### 2. Install v1 Resources
6466

65-
After cleaning up the old resources, you can proceed with a fresh installation of the `v1` Inference Gateway. This involves installing the new `v1` CRDs, creating a new `v1` `InferencePool` and corresponding `InferenceObjective` resources, and creating a new `HTTPRoute` that directs traffic to your new `v1` `InferencePool`.
67+
After cleaning up the old resources, you can proceed with a fresh installation of the `v1` Inference Gateway.
68+
This involves deploying a new EPP image compatible with the `v1` API and installing the new `v1` CRDs.
69+
You can then create a new v1 InferencePool with its corresponding InferenceObjective resources, and a new HTTPRoute that directs traffic to your new `v1` InferencePool.
6670

6771

6872
### 3. Verify the Deployment
@@ -71,14 +75,14 @@ After a few minutes, verify that your new `v1` stack is correctly serving traffi
7175

7276
```bash
7377
❯ kubectl get gateway -o wide
74-
NAME CLASS ADDRESS PROGRAMMED AGE
75-
inference-gateway inference-gateway <IP_ADDRESS> True 10m
78+
NAME CLASS ADDRESS PROGRAMMED AGE
79+
<YOUR_INFERENCE_GATEWAY_NAME> inference-gateway <IP_ADDRESS> True 10m
7680
```
7781

7882
Curl the endpoint to make sure you are getting a successful response with a **200** response code.
7983

8084
```bash
81-
IP=$(kubectl get gateway/inference-gateway -o jsonpath='{.status.addresses[0].value}')
85+
IP=$(kubectl get gateway/<YOUR_INFERENCE_GATEWAY_NAME> -o jsonpath='{.status.addresses[0].value}')
8286
PORT=80
8387
8488
curl -i ${IP}:${PORT}/v1/completions -H 'Content-Type: application/json' -d '{

0 commit comments

Comments
 (0)