You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/model-inference/includes/use-chat-reasoning/about-reasoning.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,7 +9,7 @@ author: santiagxf
9
9
10
10
## Reasoning models
11
11
12
-
Reasoning models can reach higher levels of performance in domains like math, coding, science, strategy, and logistics. The way these models produces outputs is by explicitly using chain of thought to explore all possible paths before generating an answer. They verify their answers as they produce them which helps them to arrive to better more accurate conclusions. This means that reasoning models may require less context in prompting in order to produce effective results.
12
+
Reasoning models can reach higher levels of performance in domains like math, coding, science, strategy, and logistics. The way these models produce outputs is by explicitly using chain of thought to explore all possible paths before generating an answer. They verify their answers as they produce them which helps them to arrive to better more accurate conclusions. This means that reasoning models may require less context in prompting in order to produce effective results.
13
13
14
14
Such way of scaling model's performance is referred as *inference compute time* as it trades performance against higher latency and cost. It contrasts to other approaches that scale through *training compute time*.
15
15
@@ -19,4 +19,4 @@ Reasoning models then produce two types of outputs:
19
19
> * Reasoning completions
20
20
> * Output completions
21
21
22
-
Both of these completions count towards content generated from the model and hence, towards the token limits and costs associated with the model. Some models may output the reasoning content, like `DeepSeek-R1`. Some others, like `o1`, only outputs the output piece of the completions.
22
+
Both of these completions count towards content generated from the model and hence, towards the token limits and costs associated with the model. Some models may output the reasoning content, like `DeepSeek-R1`. Some others, like `o1`, only outputs the output piece of the completions.
Copy file name to clipboardExpand all lines: articles/ai-services/openai/how-to/fine-tuning-direct-preference-optimization.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -51,7 +51,7 @@ Training datasets must be in `jsonl` format:
51
51
52
52
## Direct preference optimization model support
53
53
54
-
-`gpt-4o-2024-08-06` supports direct preference optimization in its respective fine-tuning regions. Latest region availability is updated in the [models page](../concepts/models.md#fine-tuning-models)
54
+
-`gpt-4o-2024-08-06`,`gpt-4.1-2025-04-14`,`gpt-4.1-mini-2025-04-14` supports direct preference optimization in its respective fine-tuning regions. Latest region availability is updated in the [models page](../concepts/models.md#fine-tuning-models)
55
55
56
56
Users can use preference fine tuning with base models as well as models that have already been fine-tuned using supervised fine-tuning as long as they are of a supported model/version.
57
57
@@ -70,4 +70,4 @@ Users can use preference fine tuning with base models as well as models that hav
70
70
71
71
- Explore the fine-tuning capabilities in the [Azure OpenAI fine-tuning tutorial](../tutorials/fine-tune.md).
Copy file name to clipboardExpand all lines: articles/ai-services/openai/how-to/reasoning.md
+25-22Lines changed: 25 additions & 22 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,7 +5,7 @@ description: Learn how to use Azure OpenAI's advanced o3-mini, o1, & o1-mini rea
5
5
manager: nitinme
6
6
ms.service: azure-ai-openai
7
7
ms.topic: include
8
-
ms.date: 04/18/2025
8
+
ms.date: 06/17/2025
9
9
author: mrbullwinkle
10
10
ms.author: mbullwin
11
11
---
@@ -28,40 +28,43 @@ Azure OpenAI `o-series` models are designed to tackle reasoning and problem-solv
28
28
29
29
| Model | Region | Limited access |
30
30
|---|---|---|
31
-
|`o4-mini`| East US2 (Global Standard) <br><br> Sweden Central (Global Standard) | No access request needed to use the core capabilities of this model.<br><br> Request access: [o4-mini reasoning summary feature](https://aka.ms/oai/o3access)|
32
-
|`o3`| East US2 (Global Standard) <br><br> Sweden Central (Global Standard) | Request access: [o3 limited access model application](https://aka.ms/oai/o3access)|
31
+
|`o3-pro`| East US2 & Sweden Central (Global Standard) | Request access: [o3 limited access model application](https://aka.ms/oai/o3access). If you already have `o3 access` no request is required for `o3-pro`. |
32
+
|`codex-mini`| East US2 & Sweden Central (Global Standard) | No access request needed. |
33
+
|`o4-mini`|[Model availability](../concepts/models.md#global-standard-model-availability)| No access request needed to use the core capabilities of this model.<br><br> Request access: [o4-mini reasoning summary feature](https://aka.ms/oai/o3access)|
34
+
|`o3`|[Model availability](../concepts/models.md#global-standard-model-availability)| Request access: [o3 limited access model application](https://aka.ms/oai/o3access)|
33
35
|`o3-mini`|[Model availability](../concepts/models.md#global-standard-model-availability). | Access is no longer restricted for this model. |
34
36
|`o1`|[Model availability](../concepts/models.md#global-standard-model-availability). | Access is no longer restricted for this model. |
35
37
|`o1-preview`|[Model availability](../concepts/models.md#global-standard-model-availability). |This model is only available for customers who were granted access as part of the original limited access release. We're currently not expanding access to `o1-preview`. |
36
38
|`o1-mini`|[Model availability](../concepts/models.md#global-standard-model-availability). | No access request needed for Global Standard deployments.<br><br>Standard (regional) deployments are currently only available to select customers who were previously granted access as part of the `o1-preview` release.|
|**API Version**|`2025-04-01-preview`|`2025-04-01-preview`|`2024-12-01-preview` or later <br> `2025-03-01-preview` (Recommended) |`2024-12-01-preview` or later <br> `2025-03-01-preview` (Recommended) |`2024-09-01-preview` or later <br> `2025-03-01-preview` (Recommended) |`2024-09-01-preview` or later <br> `2025-03-01-preview` (Recommended) |
|**API Version**|`2025-04-01-preview` & [v1 preview](../api-version-lifecycle.md#api-evolution)|`2025-04-01-preview` & [v1 preview](../api-version-lifecycle.md#api-evolution)|`2025-04-01-preview`|`2025-04-01-preview`|`2024-12-01-preview` or later <br> `2025-03-01-preview` (Recommended) |`2024-12-01-preview` or later <br> `2025-03-01-preview` (Recommended) |`2024-09-01-preview` or later <br> `2025-03-01-preview` (Recommended) |`2024-09-01-preview` or later <br> `2025-03-01-preview` (Recommended) |
<sup>1</sup> Reasoning models will only work with the `max_completion_tokens` parameter. <br><br>
58
-
59
60
<sup>2</sup> The latest o<sup>*</sup> series model support system messages to make migration easier. When you use a system message with `o4-mini`, `o3`, `o3-mini`, and `o1` it will be treated as a developer message. You should not use both a developer message and a system message in the same API request.
60
-
61
61
<sup>3</sup> Access to the chain-of-thought reasoning summary is limited access only for `o3` & `o4-mini`.
62
-
63
62
<sup>4</sup> Streaming for `o3` is limited access only.
64
63
64
+
> [!NOTE]
65
+
> - To avoid timeouts [background mode](./responses.md#background-tasks) is recommended for `o3-pro`.
66
+
> -`o3-pro` does not currently support image generation.
67
+
65
68
### Not Supported
66
69
67
70
The following are currently unsupported with reasoning models:
0 commit comments