-
Notifications
You must be signed in to change notification settings - Fork 182
Adding Kubvernor to the list of implementors #1313
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 2 commits
96f2ceb
db14992
426f4c0
8dcb895
8e66626
061a556
5719437
f3895cf
4e2ac44
2847818
8b85635
6eec771
4683dca
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,10 @@ | ||
apiVersion: gateway.networking.k8s.io/v1 | ||
kind: Gateway | ||
metadata: | ||
name: kubvernor-inference-gateway | ||
spec: | ||
gatewayClassName: kubvernor-inference-gateway | ||
listeners: | ||
- name: http | ||
port: 80 | ||
protocol: HTTP |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,20 @@ | ||
apiVersion: gateway.networking.k8s.io/v1 | ||
kind: HTTPRoute | ||
metadata: | ||
name: llm-route | ||
spec: | ||
parentRefs: | ||
- group: gateway.networking.k8s.io | ||
kind: Gateway | ||
name: kubvernor-inference-gateway | ||
rules: | ||
- backendRefs: | ||
- group: inference.networking.x-k8s.io | ||
kind: InferencePool | ||
name: vllm-llama3-8b-instruct | ||
matches: | ||
- path: | ||
type: PathPrefix | ||
value: / | ||
timeouts: | ||
request: 300s |
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -291,6 +291,42 @@ This quickstart guide is intended for engineers familiar with k8s and model serv | |
kubectl get httproute llm-route -o yaml | ||
``` | ||
|
||
=== "Kubvernor Rust API Gateway" | ||
dawid-nowak marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
|
||
|
||
[Kubvernor Rust API Gateway](https://github.com/kubvernor/kubvernor) is a higly experimental project so not ready for production but it supports version v0.5.1 of Inference Extension Spec. | ||
|
||
1. Requirements | ||
- Rust and Cargo installed | ||
|
||
|
||
2. Run Kubvernor Rust API Gateway as documented in [README](https://github.com/kubvernor/kubvernor/blob/main/README.md) | ||
dawid-nowak marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
|
||
|
||
|
||
3. Deploy the Gateway | ||
|
||
```bash | ||
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/kubvernor/gateway.yaml | ||
``` | ||
|
||
Confirm that the Gateway was assigned an IP address and reports a `Programmed=True` status: | ||
```bash | ||
$ kubectl get gateway kubvernor-inference-gateway | ||
dawid-nowak marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
|
||
NAME CLASS ADDRESS PROGRAMMED AGE | ||
kubvernor-inference-gateway kubvernor-inference-gateway <MY_ADDRESS> True 22s | ||
dawid-nowak marked this conversation as resolved.
Outdated
Show resolved
Hide resolved
|
||
``` | ||
|
||
5. Deploy the HTTPRoute | ||
|
||
```bash | ||
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/gateway/kubvernor/httproute.yaml | ||
``` | ||
|
||
6. Confirm that the HTTPRoute status conditions include `Accepted=True` and `ResolvedRefs=True`: | ||
|
||
```bash | ||
kubectl get httproute llm-route -o yaml | ||
``` | ||
|
||
|
||
### Try it out | ||
|
||
Wait until the gateway is ready. | ||
|
@@ -408,3 +444,5 @@ This quickstart guide is intended for engineers familiar with k8s and model serv | |
```bash | ||
kubectl delete ns kgateway-system | ||
``` | ||
=== "Kubvernor" | ||
|
Uh oh!
There was an error while loading. Please reload this page.