You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If you are not using Helm, you will need to manually delete all resources associated with your `v1alpha2` deployment. The key is to remove the `HTTPRoute`'s reference to the old `InferencePool` and then delete the `v1alpha2` resources themselves.
55
55
56
56
1.**Update or Delete the `HTTPRoute`**: Modify the `HTTPRoute` to remove the `backendRef` that points to the `v1alpha2``InferencePool`.
57
-
2.**Delete the `InferencePool` and associated resources**: You must delete the `v1alpha2``InferencePool`, any `InferenceModel` resources that point to it, and the corresponding Endpoint Picker (EPP) Deployment and Service.
57
+
2.**Delete the `InferencePool` and associated resources**: You must delete the `v1alpha2``InferencePool`, any `InferenceModel`(or 'InferenceObjective') resources that point to it, and the corresponding Endpoint Picker (EPP) Deployment and Service.
58
58
3.**Delete the `v1alpha2` CRDs**: Once all `v1alpha2` custom resources are deleted, you can remove the CRD definitions from your cluster.
After cleaning up the old resources, you can proceed with a fresh installation of the `v1` Inference Gateway. This involves installing the new `v1` CRDs, creating a new `v1``InferencePool` and corresponding `InferenceObjective` resources, and creating a new `HTTPRoute` that directs traffic to your new `v1``InferencePool`.
67
+
After cleaning up the old resources, you can proceed with a fresh installation of the `v1` Inference Gateway.
68
+
This involves deploying a new EPP image compatible with the `v1` API and installing the new `v1` CRDs.
69
+
You can then create a new v1 InferencePool with its corresponding InferenceObjective resources, and a new HTTPRoute that directs traffic to your new `v1` InferencePool.
66
70
67
71
68
72
### 3. Verify the Deployment
@@ -71,14 +75,14 @@ After a few minutes, verify that your new `v1` stack is correctly serving traffi
0 commit comments