Skip to content

Commit 51b7b25

Browse files
authored
fixes (#3571)
1 parent 28ef155 commit 51b7b25

File tree

2 files changed

+6
-6
lines changed

2 files changed

+6
-6
lines changed

app/_kong_plugins/ai-proxy/examples/azure-chat-route.yaml

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -24,16 +24,16 @@ weight: 900
2424

2525
config:
2626
model_name_header: false
27-
route_type: llm/v1/batches
27+
route_type: llm/v1/chat
2828
auth:
2929
header_name: Authorization
3030
header_value: Bearer ${azure_key}
3131
model:
3232
provider: azure
33-
options:
34-
azure_api_version: "2025-01-01-preview"
35-
azure_instance: ${azure_instance}
36-
azure_deployment_id: ${azure_deployment}
33+
options:
34+
azure_api_version: "2025-01-01-preview"
35+
azure_instance: ${azure_instance}
36+
azure_deployment_id: ${azure_deployment}
3737
variables:
3838
azure_key:
3939
value: "$AZURE_OPENAI_API_KEY"

app/_landing_pages/ai-gateway.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -581,7 +581,7 @@ rows:
581581
- type: card
582582
config:
583583
title: Save LLM usage costs with semantic load balancing
584-
description: Use semantic load balancing to optimize LLM usage and reduce costs by intelligently routing chat requests across multiple OpenAI models based on [semantic similarity](/ai-gateway/semantic-similarity/).
584+
description: Use semantic load balancing to optimize LLM usage and reduce costs by intelligently routing chat requests across multiple OpenAI models based on semantic similarity.
585585
icon: /assets/icons/money.svg
586586
cta:
587587
url: /how-to/use-semantic-load-balancing

0 commit comments

Comments
 (0)