Skip to content

Commit 23179c2

Browse files
Merge pull request #4853 from aahill/pre-build-agents
updating models
2 parents ab87702 + 27e8899 commit 23179c2

File tree

1 file changed

+49
-22
lines changed

1 file changed

+49
-22
lines changed

articles/ai-services/agents/concepts/model-region-support.md

Lines changed: 49 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -22,46 +22,73 @@ Azure OpenAI provides customers with choices on the hosting structure that fits
2222
- **Standard** is offered with a global deployment option, routing traffic globally to provide higher throughput.
2323
- **Provisioned** is also offered with a global deployment option, allowing customers to purchase and deploy provisioned throughput units across Azure global infrastructure.
2424

25-
All deployments can perform the exact same inference operations, however the billing, scale, and performance are substantially different. To learn more about Azure OpenAI deployment types see our [deployment types guide](../../openai/how-to/deployment-types.md).
25+
All deployments can perform the exact same inference operations, however the billing, scale, and performance are substantially different. To learn more about Azure OpenAI deployment types see [deployment types guide](../../openai/how-to/deployment-types.md).
2626

2727
Azure AI Agent Service supports the following Azure OpenAI models in the listed regions.
2828

2929
> [!NOTE]
3030
> The following table is for pay-as-you-go. For information on Provisioned Throughput Unit (PTU) availability, see [provisioned throughput](../../openai/concepts/provisioned-throughput.md) in the Azure OpenAI documentation. `GlobalStandard` customers also have access to [global standard models](../../openai/concepts/models.md#global-standard-model-availability).
3131
32-
| **Region** | **gpt-4o**, **2024-11-20** | **gpt-4o**, **2024-05-13** | **gpt-4o**, **2024-08-06** | **gpt-4o-mini**, **2024-07-18** | **gpt-4**, **0613** | **gpt-4**, **1106-Preview** | **gpt-4**, **0125-Preview** | **gpt-4**, **turbo-2024-04-09** | **gpt-4-32k**, **0613** | **gpt-35-turbo**, **0613** | **gpt-35-turbo**, **1106** | **gpt-35-turbo**, **0125** | **gpt-35-turbo-16k**, **0613** |
33-
|:--------------|:--------------------------:|:--------------------------:|:--------------------------:|:-------------------------------:|:-------------------:|:---------------------------:|:---------------------------:|:-------------------------------:|:--------------------------:|:--------------------------:|:--------------------------:|:------------------------------:|
34-
| australiaeast | | - | - | - ||| -| - ||||||
35-
| canadaeast || - | - | - ||| -| - || - ||||
36-
| eastus | |||| - | - ||| - || - |||
37-
| eastus2 | |||| - || - || - || - |||
38-
| francecentral | | - | - | - ||| - | - |||| - ||
39-
| japaneast || - | - | - | - | - | - | - | - || - |||
40-
| koreacentral | - |- | - | - | - | - | - |- | - | - | - | - | - |
41-
| norwayeast | |- | - | - | - || - |- | - | - | - | - | - |
42-
| polandcentral | - | - | - | - | - | - | - | - | - | - | - | - | - |
43-
| southindia | | - | - | - | - || - | - | - | - ||| - |
44-
| swedencentral ||||||| - ||||| - ||
45-
| switzerlandnorth | - | - | - | - || - | - | - || - | - |||
46-
| uaenorth | | - | - | - | - | - | - | - | - | - | - | - | - |
47-
| uksouth | | - | - | - | - ||| - | - |||||
48-
| westus | |||| - || - || - | - ||| - |
49-
| westus3 | |||| - || - || - | - | - || - |
50-
32+
| REGION | o1 | o3-mini | gpt-4.1, 2025-04-14 | gpt-4.1-mini, 2025-04-14 | gpt-4.1-nano, 2025-04-14 | gpt-4o, 2024-05-13 | gpt-4o, 2024-08-06 | gpt-4o, 2024-11-20 | gpt-4o-mini, 2024-07-18 | gpt-4, 0613 | gpt-4, turbo-2024-04-09 | gpt-4-32k, 0613 | gpt-35-turbo, 1106 | gpt-35-turbo, 0125 |
33+
|------------------|----|---------|---------------------|--------------------------|--------------------------|--------------------|--------------------|--------------------|-------------------------|-------------|-------------------------|-----------------|--------------------|--------------------|
34+
| australiaeast | | | | | | | | X | | X | | X | X | X |
35+
| canadaeast | | | | | | | | X | | X | | X | X | X |
36+
| eastus | | | | | | X | X | X | X | X | X | | | X |
37+
| eastus2 | | | | | | X | X | X | X | X | X | | | X |
38+
| francecentral | | | | | | | | X | | X | | X | X | X |
39+
| japaneast | | | | | | | | X | | | | | | X |
40+
| koreacentral | | | | | | | | | | | | | | |
41+
| norwayeast | | | | | | | | X | | | | | | |
42+
| polandcentral | | | | | | | | | | | | | | |
43+
| southindia | | | | | | | | X | | | | | X | |
44+
| swedencentral | | | | | | X | X | X | X | X | X | X | X | X |
45+
| switzerlandnorth | | | | | | | | X | | X | | X | | X |
46+
| uaenorth | | | | | | | | | | | | | | |
47+
| uksouth | | | | | | | | X | | | | | X | X |
48+
| westus | | | | | | X | X | X | X | | X | | X | |
49+
| westus3 | | | | | | X | X | X | X | | X | | | |
5150

5251
## Non-Microsoft models
5352

5453
The Azure AI Agent Service also supports the following models from the Azure AI Foundry model catalog.
5554

5655
* Meta-Llama-405B-Instruct
57-
* Cohere-command-r-plus
58-
* Cohere-command-r
5956

6057
To use these models, you can use [Azure AI Foundry portal](https://ai.azure.com/) to make a deployment, and then reference the deployment name in your agent. For example:
6158

6259
```python
6360
agent = project_client.agents.create_agent( model="llama-3", name="my-agent", instructions="You are a helpful agent" )
6461
```
62+
## Azure AI Foundry models
63+
64+
### Models with tool-calling
65+
66+
To best support agentic scenarios, we recommend using models that support tool-calling. The Azure AI Agent Service currently supports all agent-compatible models from the Azure AI Foundry model catalog.
67+
68+
To use these models, use the [Azure AI Foundry portal](https://ai.azure.com/) to make a model deployment, then reference the deployment name in your agent. For example:
69+
70+
`agent = project_client.agents.create_agent( model="llama-3", name="my-agent", instructions="You are a helpful agent")`
71+
72+
> [!NOTE]
73+
> This option should only be used for open-source models (for example, Cepstral, Mistral, Llama) and not for OpenAI models, which are natively supported in the service. This option should also only be used for models that support tool-calling.
74+
75+
### Models without tool-calling
76+
77+
Though tool-calling support is a core capability for agentic scenarios, we now provide the ability to use models that don’t support tool-calling in our API and SDK. This option can be helpful when you have specific use-cases that don’t require tool-calling.
78+
79+
The following steps will allow you to utilize any chat-completion model that is available through a [serverless API](/azure/ai-foundry/how-to/model-catalog-overview):
80+
81+
82+
83+
1. Deploy your desired model through serverless API. Model will show up on your **Models + Endpoints** page.
84+
85+
1. Click on model name to see model details, where you'll find your model's target URI and key.
86+
87+
1. Create a new Serverless connection on **Connected Resources** page, using the target URI and key.
88+
89+
The model can now be referenced in your code (`Target URI` + `@` + `Model Name`), for example:
90+
91+
`Model=https://Phi-4-mejco.eastus.models.ai.azure.com/@Phi-4-mejco`
6592

6693
## Next steps
6794

0 commit comments

Comments
 (0)