Skip to content

Commit 378f135

Browse files
committed
new doc draft
1 parent 3c33c1e commit 378f135

File tree

1 file changed

+48
-0
lines changed

1 file changed

+48
-0
lines changed
Lines changed: 48 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,48 @@
1+
---
2+
title: Azure OpenAI model router concepts
3+
titleSuffix: Azure OpenAI
4+
description: Learn about the model router feature in Azure OpenAI Service.
5+
author: PatrickFarley
6+
ms.author: pafarley
7+
ms.service: azure-ai-openai
8+
ms.topic: conceptual
9+
ms.date: 04/17/2025
10+
ms.custom:
11+
manager: nitinme
12+
---
13+
14+
# Azure OpenAI model router
15+
16+
Azure OpenAI model router is a deployable AI chat model that automatically selects the best underlying chat model to respond to a given prompt. It uses a combination of preexisting models to provide high performance while saving on compute costs where possible.
17+
18+
## Why use model router?
19+
20+
Model router intelligently selects the best underlying model for a given prompt. This way, smaller and cheaper models are used when they're sufficient for the task, but larger and more expensive models are available for more complex tasks. Also, reasoning models are available for tasks that require complex reasoning, but non-reasoning models are used otherwise. Model router provides a single chat experience that combines the best features from all the underlying chat models.
21+
22+
Model router is useful for a variety of applications. TBD
23+
24+
25+
## Versioning
26+
27+
Each version of model router is associated with a specific set of underlying models and their versions. This set won't change—only newer versions of model router can expose new underlying models.
28+
29+
If you select **Auto-update** at the deployment step (see [Manage models](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#model-updates)), then your model router model will automatically update when new versions become available. When that happens, the set of underlying models will also change, and this could affect the overall performance of the model as well as costs.
30+
31+
## Underlying models
32+
33+
|Model router version|Underlying models (version)|
34+
|---|---|
35+
|`2025-04-15`|GPT-4.1 (`2025-04-14`)</br>GPT-4.1-mini (`2025-04-14`)</br>GPT-4.1-nano (`2025-04-14`)</br>o4-mini (`2025-04-14`) |
36+
37+
38+
## Limitations
39+
40+
Model router doesn't process input images or audio.
41+
42+
Global Standard region support.
43+
44+
## Billing information
45+
46+
When you use Azure OpenAI model router, you are only billed for the use of the underlying models as they're recruited to respond to prompts. The model router itself doesn't incur any extra charges.
47+
48+
You can monitor the overall costs of your model router deployment in the Azure portal. TBD

0 commit comments

Comments
 (0)