Skip to content

Commit 07917fd

Browse files
Merge pull request #4865 from MicrosoftDocs/main
Merged by Learn.Build PR Management system
2 parents 6d3cef9 + 23179c2 commit 07917fd

File tree

4 files changed

+600
-426
lines changed

4 files changed

+600
-426
lines changed

articles/ai-services/agents/concepts/model-region-support.md

Lines changed: 49 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -22,46 +22,73 @@ Azure OpenAI provides customers with choices on the hosting structure that fits
2222
- **Standard** is offered with a global deployment option, routing traffic globally to provide higher throughput.
2323
- **Provisioned** is also offered with a global deployment option, allowing customers to purchase and deploy provisioned throughput units across Azure global infrastructure.
2424

25-
All deployments can perform the exact same inference operations, however the billing, scale, and performance are substantially different. To learn more about Azure OpenAI deployment types see our [deployment types guide](../../openai/how-to/deployment-types.md).
25+
All deployments can perform the exact same inference operations, however the billing, scale, and performance are substantially different. To learn more about Azure OpenAI deployment types see [deployment types guide](../../openai/how-to/deployment-types.md).
2626

2727
Azure AI Agent Service supports the following Azure OpenAI models in the listed regions.
2828

2929
> [!NOTE]
3030
> The following table is for pay-as-you-go. For information on Provisioned Throughput Unit (PTU) availability, see [provisioned throughput](../../openai/concepts/provisioned-throughput.md) in the Azure OpenAI documentation. `GlobalStandard` customers also have access to [global standard models](../../openai/concepts/models.md#global-standard-model-availability).
3131
32-
| **Region** | **gpt-4o**, **2024-11-20** | **gpt-4o**, **2024-05-13** | **gpt-4o**, **2024-08-06** | **gpt-4o-mini**, **2024-07-18** | **gpt-4**, **0613** | **gpt-4**, **1106-Preview** | **gpt-4**, **0125-Preview** | **gpt-4**, **turbo-2024-04-09** | **gpt-4-32k**, **0613** | **gpt-35-turbo**, **0613** | **gpt-35-turbo**, **1106** | **gpt-35-turbo**, **0125** | **gpt-35-turbo-16k**, **0613** |
33-
|:--------------|:--------------------------:|:--------------------------:|:--------------------------:|:-------------------------------:|:-------------------:|:---------------------------:|:---------------------------:|:-------------------------------:|:--------------------------:|:--------------------------:|:--------------------------:|:------------------------------:|
34-
| australiaeast | | - | - | - ||| -| - ||||||
35-
| canadaeast || - | - | - ||| -| - || - ||||
36-
| eastus | |||| - | - ||| - || - |||
37-
| eastus2 | |||| - || - || - || - |||
38-
| francecentral | | - | - | - ||| - | - |||| - ||
39-
| japaneast || - | - | - | - | - | - | - | - || - |||
40-
| koreacentral | - |- | - | - | - | - | - |- | - | - | - | - | - |
41-
| norwayeast | |- | - | - | - || - |- | - | - | - | - | - |
42-
| polandcentral | - | - | - | - | - | - | - | - | - | - | - | - | - |
43-
| southindia | | - | - | - | - || - | - | - | - ||| - |
44-
| swedencentral ||||||| - ||||| - ||
45-
| switzerlandnorth | - | - | - | - || - | - | - || - | - |||
46-
| uaenorth | | - | - | - | - | - | - | - | - | - | - | - | - |
47-
| uksouth | | - | - | - | - ||| - | - |||||
48-
| westus | |||| - || - || - | - ||| - |
49-
| westus3 | |||| - || - || - | - | - || - |
50-
32+
| REGION | o1 | o3-mini | gpt-4.1, 2025-04-14 | gpt-4.1-mini, 2025-04-14 | gpt-4.1-nano, 2025-04-14 | gpt-4o, 2024-05-13 | gpt-4o, 2024-08-06 | gpt-4o, 2024-11-20 | gpt-4o-mini, 2024-07-18 | gpt-4, 0613 | gpt-4, turbo-2024-04-09 | gpt-4-32k, 0613 | gpt-35-turbo, 1106 | gpt-35-turbo, 0125 |
33+
|------------------|----|---------|---------------------|--------------------------|--------------------------|--------------------|--------------------|--------------------|-------------------------|-------------|-------------------------|-----------------|--------------------|--------------------|
34+
| australiaeast | | | | | | | | X | | X | | X | X | X |
35+
| canadaeast | | | | | | | | X | | X | | X | X | X |
36+
| eastus | | | | | | X | X | X | X | X | X | | | X |
37+
| eastus2 | | | | | | X | X | X | X | X | X | | | X |
38+
| francecentral | | | | | | | | X | | X | | X | X | X |
39+
| japaneast | | | | | | | | X | | | | | | X |
40+
| koreacentral | | | | | | | | | | | | | | |
41+
| norwayeast | | | | | | | | X | | | | | | |
42+
| polandcentral | | | | | | | | | | | | | | |
43+
| southindia | | | | | | | | X | | | | | X | |
44+
| swedencentral | | | | | | X | X | X | X | X | X | X | X | X |
45+
| switzerlandnorth | | | | | | | | X | | X | | X | | X |
46+
| uaenorth | | | | | | | | | | | | | | |
47+
| uksouth | | | | | | | | X | | | | | X | X |
48+
| westus | | | | | | X | X | X | X | | X | | X | |
49+
| westus3 | | | | | | X | X | X | X | | X | | | |
5150

5251
## Non-Microsoft models
5352

5453
The Azure AI Agent Service also supports the following models from the Azure AI Foundry model catalog.
5554

5655
* Meta-Llama-405B-Instruct
57-
* Cohere-command-r-plus
58-
* Cohere-command-r
5956

6057
To use these models, you can use [Azure AI Foundry portal](https://ai.azure.com/) to make a deployment, and then reference the deployment name in your agent. For example:
6158

6259
```python
6360
agent = project_client.agents.create_agent( model="llama-3", name="my-agent", instructions="You are a helpful agent" )
6461
```
62+
## Azure AI Foundry models
63+
64+
### Models with tool-calling
65+
66+
To best support agentic scenarios, we recommend using models that support tool-calling. The Azure AI Agent Service currently supports all agent-compatible models from the Azure AI Foundry model catalog.
67+
68+
To use these models, use the [Azure AI Foundry portal](https://ai.azure.com/) to make a model deployment, then reference the deployment name in your agent. For example:
69+
70+
`agent = project_client.agents.create_agent( model="llama-3", name="my-agent", instructions="You are a helpful agent")`
71+
72+
> [!NOTE]
73+
> This option should only be used for open-source models (for example, Cepstral, Mistral, Llama) and not for OpenAI models, which are natively supported in the service. This option should also only be used for models that support tool-calling.
74+
75+
### Models without tool-calling
76+
77+
Though tool-calling support is a core capability for agentic scenarios, we now provide the ability to use models that don’t support tool-calling in our API and SDK. This option can be helpful when you have specific use-cases that don’t require tool-calling.
78+
79+
The following steps will allow you to utilize any chat-completion model that is available through a [serverless API](/azure/ai-foundry/how-to/model-catalog-overview):
80+
81+
82+
83+
1. Deploy your desired model through serverless API. Model will show up on your **Models + Endpoints** page.
84+
85+
1. Click on model name to see model details, where you'll find your model's target URI and key.
86+
87+
1. Create a new Serverless connection on **Connected Resources** page, using the target URI and key.
88+
89+
The model can now be referenced in your code (`Target URI` + `@` + `Model Name`), for example:
90+
91+
`Model=https://Phi-4-mejco.eastus.models.ai.azure.com/@Phi-4-mejco`
6592

6693
## Next steps
6794

articles/ai-services/openai/api-version-deprecation.md

Lines changed: 12 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -18,17 +18,20 @@ This article is to help you understand the support lifecycle for the Azure OpenA
1818

1919

2020
> [!NOTE]
21-
> New API response objects may be added to the API response without version changes. We recommend you only parse the response objects you require.
21+
> New API response objects may be added to the API response without version changes. We recommend you only parse the response objects you require.
22+
>
23+
> The latest Azure OpenAI spec uses OpenAPI 3.1. It is a known issue that this is currently not fully supported by [Azure API Management](/azure/api-management/api-management-key-concepts)
2224
2325
## Latest preview API releases
2426

2527
Azure OpenAI API latest release:
2628

27-
- Inference: [2025-03-01-preview](reference-preview.md)
28-
- Authoring: [2025-03-01-preview](https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/data-plane/AzureOpenAI/authoring/preview/)
29+
- Inference: [2025-04-01-preview](reference-preview.md)
30+
- Authoring: [2025-04-01-preview](authoring-reference-preview.md)
2931

3032
This version contains support for the latest Azure OpenAI features including:
3133

34+
- `GPT-image-1`, the evaluations API, reasoning summary with `o3` and `o4-mini` . [**Added in 2025-04-01-preview**]
3235
- [Responses API & support for `computer-use-preview` model](./how-to/responses.md) [**Added in 2025-03-01-preview**]
3336
- [Stored Completions (distillation) API](./how-to/stored-completions.md#stored-completions-api) [**Added in 2025-02-01-preview**]
3437
- [Predicted Outputs](./how-to/predicted-outputs.md) [**Added in 2025-01-01-preview**]
@@ -44,6 +47,12 @@ This version contains support for the latest Azure OpenAI features including:
4447
- [Function calling](./how-to/function-calling.md) [**Added in 2023-07-01-preview**]
4548
- [Retrieval augmented generation with your data feature](./use-your-data-quickstart.md). [**Added in 2023-06-01-preview**]
4649

50+
## Changes between 2025-04-01-preview and 2025-03-01-preview
51+
52+
- [`GPT-image-1` support](/azure/ai-services/openai/how-to/dall-e)
53+
- [Reasoning summary for `o3` and `o4-mini`](/azure/ai-services/openai/how-to/reasoning)
54+
- [Evaluation API](/azure/ai-services/openai/authoring-reference-preview#evaluation---create)
55+
4756
## Changes between 2025-03-01-preview and 2025-02-01-preview
4857

4958
- [Responses API](./how-to/responses.md)

articles/ai-services/openai/includes/api-surface.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ Each API surface/specification encapsulates a different set of Azure OpenAI capa
2222
| API | Latest preview release | Latest GA release | Specifications | Description |
2323
|:---|:----|:----|:----|:---|
2424
| **Control plane** | [`2024-06-01-preview`](/rest/api/aiservices/accountmanagement/operation-groups?view=rest-aiservices-accountmanagement-2024-06-01-preview&preserve-view=true) | [`2024-10-01`](/rest/api/aiservices/accountmanagement/deployments/create-or-update?view=rest-aiservices-accountmanagement-2024-10-01&tabs=HTTP&preserve-view=true) | [Spec files](https://github.com/Azure/azure-rest-api-specs/tree/main/specification/cognitiveservices/resource-manager/Microsoft.CognitiveServices) | Azure OpenAI shares a common control plane with all other Azure AI Services. The control plane API is used for things like [creating Azure OpenAI resources](/rest/api/aiservices/accountmanagement/accounts/create?view=rest-aiservices-accountmanagement-2023-05-01&tabs=HTTP&preserve-view=true), [model deployment](/rest/api/aiservices/accountmanagement/deployments/create-or-update?view=rest-aiservices-accountmanagement-2023-05-01&tabs=HTTP&preserve-view=true), and other higher level resource management tasks. The control plane also governs what is possible to do with capabilities like Azure Resource Manager, Bicep, Terraform, and Azure CLI.|
25-
| **Data plane - authoring** | `2025-04-01-preview` | `2024-10-21` | [Spec files](https://github.com/Azure/azure-rest-api-specs/tree/main/specification/cognitiveservices/data-plane/AzureOpenAI/authoring) | The data plane authoring API controls [fine-tuning](/rest/api/azureopenai/fine-tuning?view=rest-azureopenai-2024-08-01-preview&preserve-view=true), [file-upload](/rest/api/azureopenai/files/upload?view=rest-azureopenai-2024-08-01-preview&tabs=HTTP&preserve-view=true), [ingestion jobs](/rest/api/azureopenai/ingestion-jobs/create?view=rest-azureopenai-2024-08-01-preview&tabs=HTTP&preserve-view=true), [batch](/rest/api/azureopenai/batch?view=rest-azureopenai-2024-08-01-preview&tabs=HTTP&preserve-view=true) and certain [model level queries](/rest/api/azureopenai/models/get?view=rest-azureopenai-2024-08-01-preview&tabs=HTTP&preserve-view=true)
25+
| **Data plane - authoring** | [`2025-04-01-preview`](/azure/ai-services/openai/authoring-reference-preview) | `2024-10-21` | [Spec files](https://github.com/Azure/azure-rest-api-specs/tree/main/specification/cognitiveservices/data-plane/AzureOpenAI/authoring) | The data plane authoring API controls [fine-tuning](/rest/api/azureopenai/fine-tuning?view=rest-azureopenai-2024-08-01-preview&preserve-view=true), [file-upload](/rest/api/azureopenai/files/upload?view=rest-azureopenai-2024-08-01-preview&tabs=HTTP&preserve-view=true), [ingestion jobs](/rest/api/azureopenai/ingestion-jobs/create?view=rest-azureopenai-2024-08-01-preview&tabs=HTTP&preserve-view=true), [batch](/rest/api/azureopenai/batch?view=rest-azureopenai-2024-08-01-preview&tabs=HTTP&preserve-view=true) and certain [model level queries](/rest/api/azureopenai/models/get?view=rest-azureopenai-2024-08-01-preview&tabs=HTTP&preserve-view=true)
2626
| **Data plane - inference** | [`2025-04-01-preview`](/azure/ai-services/openai/reference-preview#data-plane-inference) | [`2024-10-21`](/azure/ai-services/openai/reference#data-plane-inference) | [Spec files](https://github.com/Azure/azure-rest-api-specs/tree/main/specification/cognitiveservices/data-plane/AzureOpenAI/inference) | The data plane inference API provides the inference capabilities/endpoints for features like completions, chat completions, embeddings, audio, on your data, Dall-e, assistants, etc. |
2727

2828
## Authentication

0 commit comments

Comments
 (0)