You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/model-inference/overview.md
+15-18Lines changed: 15 additions & 18 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -18,30 +18,27 @@ recommendations: false
18
18
19
19
Azure AI model inference provides access to the most powerful models available in the Azure AI model catalog. The models come from key model providers in the industry, including OpenAI, Microsoft, Meta, Mistral, Cohere, G42, and AI21 Labs. These models can be integrated with software solutions to deliver a wide range of tasks that include content generation, summarization, image understanding, semantic search, and code generation.
20
20
21
-
> [!TIP]
22
-
> To deploy DeepSeek-R1 or OpenAI o3-mini in Azure AI model inference, follow the steps at [Add and configure models](how-to/create-model-deployments.md).
23
-
24
21
Azure AI model inference provides a way to **consume models as APIs without hosting them on your infrastructure**. Models are hosted in a Microsoft-managed infrastructure, which enables API-based access to the model provider's model. API-based access can dramatically reduce the cost of accessing a model and simplify the provisioning experience.
25
22
26
23
Azure AI model inference is part of Azure AI Services, and users can access the service through [REST APIs](./reference/reference-model-inference-api.md), [SDKs in several languages](supported-languages.md) such as Python, C#, JavaScript, and Java. You can also use the Azure AI model inference from [Azure AI Foundry by configuring a connection](how-to/configure-project-connection.md).
27
24
28
25
## Models
29
26
30
-
You can get access to the key model providers in the industry including OpenAI, Microsoft, Meta, Mistral, Cohere, G42, and AI21 Labs. Model providers define the license terms and set the price for use of their models. The following list shows all the models available:
31
-
32
-
To see details for each model including, language, types, and capabilities, see [Models](concepts/models.md) article.
You can get access to the key model providers in the industry including OpenAI, Microsoft, Meta, Mistral, Cohere, G42, and AI21 Labs. Model providers define the license terms and set the price for use of their models.
28
+
29
+
Explore the following model families available:
30
+
31
+
-[AI21 Labs](concepts/models.md#ai21-labs)
32
+
-[Azure OpenAI](concepts/models.md#azure-openai)
33
+
-[Cohere](concepts/models.md#cohere)
34
+
-[Core42](concepts/models.md#core42)
35
+
-[DeepSeek](concepts/models.md#deepseek)
36
+
-[Meta](concepts/models.md#meta)
37
+
-[Microsoft](concepts/models.md#microsoft)
38
+
-[Mistral AI](concepts/models.md#mistral-ai)
39
+
-[NTT Data](concepts/models.md#ntt-data)
40
+
41
+
To see details for each model including language, types, and capabilities, see [Models](concepts/models.md) article.
Copy file name to clipboardExpand all lines: articles/ai-services/openai/concepts/model-retirements.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -103,7 +103,7 @@ These models are currently available for use in Azure OpenAI Service.
103
103
|`gpt-4`| 1106-preview | To be upgraded to **`gpt-4o` version: `2024-11-20`**, starting no sooner than April 17, 2025 **<sup>1</sup>** <br>Retirement date: May 1, 2025 |`gpt-4o`|
104
104
|`gpt-4`| 0125-preview |To be upgraded to **`gpt-4o` version: `2024-11-20`**, starting no sooner than April 17, 2025 **<sup>1</sup>** <br>Retirement date: May 1, 2025 |`gpt-4o`|
105
105
|`gpt-4`| vision-preview | To be upgraded to **`gpt-4o` version: `2024-11-20`**, starting no sooner than April 17, 2025 **<sup>1</sup>** <br>Retirement date: May 1, 2025 |`gpt-4o`|
106
-
|`gpt-4.5-preview`| 2025-02-27 |No earlier than July 02, 2025 |`gpt-4.1`|
106
+
|`gpt-4.5-preview`| 2025-02-27 | July 14, 2025 |`gpt-4.1`|
107
107
|`gpt-4.1`| 2025-04-14 | No earlier than April 11, 2026 ||
108
108
|`gpt-4.1-mini`| 2025-04-14 | No earlier than April 11, 2026 |
109
109
|`gpt-4.1-nano`| 2025-04-14 | No earlier than April 11, 2026 |
Copy file name to clipboardExpand all lines: articles/ai-services/openai/how-to/function-calling.md
+4-1Lines changed: 4 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -41,7 +41,6 @@ At a high level you can break down working with functions into three steps:
41
41
*`gpt-4o-mini` (`2024-07-18`)
42
42
*`gpt-4.5-preview` (`2025-02-27`)
43
43
*`gpt-4.1` (`2025-04-14`)
44
-
*`gpt-4.1-nano` (`2025-04-14`)
45
44
*`gpt-4.1-mini` (`2025-04-14`)
46
45
47
46
Support for parallel function was first added in API version [`2023-12-01-preview`](https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/data-plane/AzureOpenAI/inference/preview/2023-12-01-preview/inference.json)
@@ -51,6 +50,7 @@ Support for parallel function was first added in API version [`2023-12-01-previe
51
50
* All the models that support parallel function calling
52
51
*`o4-mini` (`2025-04-16`)
53
52
*`o3` (`2025-04-16`)
53
+
*`gpt-4.1-nano` (`2025-04-14`)
54
54
*`o3-mini` (`2025-01-31`)
55
55
*`o1` (`2024-12-17`)
56
56
*`gpt-4` (`0613`)
@@ -61,6 +61,9 @@ Support for parallel function was first added in API version [`2023-12-01-previe
61
61
> [!NOTE]
62
62
> The `tool_choice` parameter is now supported with `o3-mini` and `o1`. For more information on what parameters are supported with the o-series models see, the [reasoning models guide](./reasoning.md).
63
63
64
+
> [!IMPORTANT]
65
+
> Tool/function descriptions are currently limited to 1024 characters with Azure OpenAI. We will update this article if this limit is changed.
66
+
64
67
## Single tool/function calling example
65
68
66
69
First we will demonstrate a simple toy function call that can check the time in three hardcoded locations with a single tool/function defined. We have added print statements to help make the code execution easier to follow:
0 commit comments