Skip to content

Commit 1fd694b

Browse files
authored
Scrubbing out more InferenceModel references (#1592)
1 parent 0ed667e commit 1fd694b

File tree

4 files changed

+1
-2
lines changed

4 files changed

+1
-2
lines changed

site-src/guides/serve-multiple-genai-models.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,6 @@ The following diagram illustrates how an Inference Gateway routes requests to di
1212
The model name is extracted by [Body-Based routing](https://github.com/kubernetes-sigs/gateway-api-inference-extension/blob/main/pkg/bbr/README.md) (BBR)
1313
from the request body to the header. The header is then matched to dispatch
1414
requests to different `InferencePool` (and their EPPs) instances.
15-
![Serving multiple generative AI models](../images/serve-mul-gen-AI-models.png)
1615

1716
### Deploy Body-Based Routing
1817

site-src/images/inference-overview.svg

Lines changed: 1 addition & 1 deletion
Loading
-371 KB
Binary file not shown.
-403 KB
Binary file not shown.

0 commit comments

Comments
 (0)