Skip to content

Commit 13b9012

Browse files
committed
Fix some markdown formatting errors
1 parent 173ad58 commit 13b9012

File tree

3 files changed

+4
-4
lines changed

3 files changed

+4
-4
lines changed

mkdocs.yml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@ theme:
1313
favicon: images/favicon-64.png
1414
features:
1515
- content.code.annotate
16+
- content.code.copy
1617
- search.highlight
1718
- navigation.tabs
1819
- navigation.top

site-src/guides/index.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -321,11 +321,11 @@ Tooling:
321321

322322
### Deploy the Body Based Router Extension (Optional)
323323

324-
This guide shows how to get started with serving only 1 base model type per L7 URL path. If in addition, you wish to exercise model-aware routing such that more than 1 base model is served at the same L7 url path, that requires use of the (optional) Body Based Routing (BBR) extension which is described in a following section of the guide, namely the [`Serving Multiple GenAI Models`](serve-multiple-genai-models.md) section.
324+
This guide shows how to get started with serving only 1 base model type per L7 URL path. If in addition, you wish to exercise model-aware routing such that more than 1 base model is served at the same L7 url path, that requires use of the (optional) Body Based Routing (BBR) extension which is described in a following section of the guide, namely the [`Serving Multiple GenAI Models`](serve-multiple-genai-models.md) section.
325325

326326
### Deploy InferenceObjective (Optional)
327327

328-
Deploy the sample InferenceObjective which allows you to specify priority of requests.
328+
Deploy the sample InferenceObjective which allows you to specify priority of requests.
329329

330330
```bash
331331
kubectl apply -f https://github.com/kubernetes-sigs/gateway-api-inference-extension/raw/main/config/manifests/inferenceobjective.yaml

site-src/guides/serve-multiple-genai-models.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -152,6 +152,7 @@ spec:
152152
request: 300s
153153
---
154154
```
155+
1. [BBR](https://github.com/kubernetes-sigs/gateway-api-inference-extension/blob/main/pkg/bbr/README.md) is being used to copy the model name from the request body to the header with key `X-Gateway-Model-Name`. The header can then be used in the `HTTPRoute` to route requests to different `InferencePool` instances.
155156

156157
Confirm that the HTTPRoute status conditions include `Accepted=True` and `ResolvedRefs=True` for both routes:
157158

@@ -163,8 +164,6 @@ kubectl get httproute llm-llama-route -o yaml
163164
kubectl get httproute llm-phi4-route -o yaml
164165
```
165166

166-
[BBR](https://github.com/kubernetes-sigs/gateway-api-inference-extension/blob/main/pkg/bbr/README.md) is being used to copy the model name from the request body to the header with key `X-Gateway-Model-Name`. The header can then be used in the `HTTPRoute` to route requests to different `InferencePool` instances.
167-
168167
## Try it out
169168

170169
1. Get the gateway IP:

0 commit comments

Comments
 (0)