Skip to content

Commit 0245c2f

Browse files
committed
Add describe for EPP
1 parent b0d5671 commit 0245c2f

File tree

1 file changed

+7
-1
lines changed

1 file changed

+7
-1
lines changed

content/ngf/how-to/gateway-api-inference-extension.md

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,7 @@ kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extens
5959

6060
## Deploy the InferencePool and Endpoint Picker Extension
6161

62-
The InferencePool is a Gateway API Inference Extension resource that represents a set of Infernece-focused Pods. With InferencePool, you can configure a routing extension as well as inference-specific routing optimizations. For more information on this resource, refer to the Gateway API Inference Extension [InferencePool documentation](https://gateway-api-inference-extension.sigs.k8s.io/api-types/inferencepool/).
62+
The InferencePool is a Gateway API Inference Extension resource that represents a set of Inference-focused Pods. With InferencePool, you can configure a routing extension as well as inference-specific routing optimizations. For more information on this resource, refer to the Gateway API Inference Extension [InferencePool documentation](https://gateway-api-inference-extension.sigs.k8s.io/api-types/inferencepool/).
6363

6464
Install an InferencePool named `vllm-llama3-8b-instruct` that selects from endpoints with label `app: vllm-llama3-8b-instruct` and listening on port 8000. The Helm install command automatically installs the Endpoint Picker Extension and InferencePool.
6565

@@ -75,6 +75,12 @@ helm install vllm-llama3-8b-instruct \
7575
oci://registry.k8s.io/gateway-api-inference-extension/charts/inferencepool
7676
```
7777

78+
Confirm that the Endpoint Picker was deployed and is running:
79+
80+
```shell
81+
kubectl describe deployment vllm-llama3-8b-instruct-epp
82+
```
83+
7884
## Deploy an Inference Gateway
7985

8086
1. Deploy Inference Gateway:

0 commit comments

Comments
 (0)