Skip to content

Commit 4c5909e

Browse files
Merge pull request #2364 from mrbullwinkle/mrb_01_16_2025_reasoning_updates
[Azure OpenAI] Reasoning model updates
2 parents 15d2d31 + 151ad7d commit 4c5909e

File tree

2 files changed

+28
-14
lines changed

2 files changed

+28
-14
lines changed

articles/ai-services/openai/how-to/reasoning.md

Lines changed: 27 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Learn how to use Azure OpenAI's advanced o1 series reasoning models
55
manager: nitinme
66
ms.service: azure-ai-openai
77
ms.topic: include
8-
ms.date: 12/17/2024
8+
ms.date: 01/16/2025
99
author: mrbullwinkle
1010
ms.author: mbullwin
1111
---
@@ -38,31 +38,45 @@ Once access has been granted, you'll need to create a deployment for each model.
3838
| `o1-preview` | See [models page](../concepts/models.md#global-standard-model-availability). |
3939
| `o1-mini` | See [models page](../concepts/models.md#global-standard-model-availability). |
4040

41-
## API support
41+
## API & feature support
4242

43-
Initial support for the **o1-preview** and **o1-mini** preview models was added in API version `2024-09-01-preview`.
43+
| **Feature** | **o1**, **2024-12-17** | **o1-preview**, **2024-09-12** | **o1-mini**, **2024-09-12** |
44+
|:-------------------|:--------------------------:|:--------------------------:|:-------------------------------:|
45+
| **API Version** | `2024-12-01-preview` | `2024-09-01-preview` <br> `2024-10-01-preview` <br> `2024-12-01-preview` | `2024-09-01-preview` <br> `2024-10-01-preview` <br> `2024-12-01-preview` |
46+
| **[Developer Messages](#developer-messages)** || - | - |
47+
| **[Structured Outputs](./structured-outputs.md)** || - | - |
48+
| **[Context Window](../concepts/models.md#o1-and-o1-mini-models-limited-access)** | Input: 200,000 <br> Output: 100,000 | Input: 128,000 <br> Output: 32,768 | Input: 128,000 <br> Output: 65,536 |
49+
| **[Reasoning effort](#reasoning-effort)** || - | - |
50+
| System Messages | - | - | - |
51+
| `max_completion_tokens` ||||
4452

45-
As part of this release, the `max_tokens` parameter was deprecated and replaced with the new `max_completion_tokens` parameter. **o1 series** models will only work with the `max_completion_tokens` parameter.
53+
**o1 series** models will only work with the `max_completion_tokens` parameter.
4654

47-
The latest most capable **o1 series** model is `o1` **Version: 2024-12-17**. This general availability (GA) model should be used with API version `2024-12-01-preview`.
55+
> [!IMPORTANT]
56+
> There is a known issue with the `o1` model and the `tool_choice` parameter. Currently function calls that include the optional `tool_choice` parameter will fail. this page will be updated once the issue is resolved.
4857
49-
### 2024-12-01-preview
58+
### Not Supported
5059

51-
`2024-12-01-preview` adds support for the new `reasoning_effort` parameter, [structured outputs](./structured-outputs.md), and developer messages. The older preview reasoning models do not currently support these features. For reasoning models, these features are currently only available with `o1` **Version: 2024-12-17**.
60+
The following are currently unsupported with o1-series models:
61+
62+
- System Messages
63+
- Streaming
64+
- Parallel tool calling
65+
- `temperature`, `top_p`, `presence_penalty`, `frequency_penalty`, `logprobs`, `top_logprobs`, `logit_bias`, `max_tokens`
5266

5367
## Usage
5468

55-
These models do not currently support the same set of parameters as other models that use the chat completions API. Only a limited subset is currently supported. Using standard parameters like `temperature` and `top_p` will result in errors.
69+
These models [don't currently support the same set of parameters](#api--feature-support) as other models that use the chat completions API.
5670

5771
# [Python (Microsoft Entra ID)](#tab/python-secure)
5872

59-
You will need to upgrade your OpenAI client library for access to the latest parameters.
73+
You'll need to upgrade your OpenAI client library for access to the latest parameters.
6074

6175
```cmd
6276
pip install openai --upgrade
6377
```
6478

65-
If you are new to using Microsoft Entra ID for authentication see [How to configure Azure OpenAI Service with Microsoft Entra ID authentication](../how-to/managed-identity.md).
79+
If you're new to using Microsoft Entra ID for authentication see [How to configure Azure OpenAI Service with Microsoft Entra ID authentication](../how-to/managed-identity.md).
6680

6781
```python
6882
from openai import AzureOpenAI
@@ -218,10 +232,10 @@ print(response.model_dump_json(indent=2))
218232
}
219233
```
220234

221-
235+
## Reasoning effort
222236

223237
> [!NOTE]
224-
> Reasoning models have `reasoning_tokens` as part of `completion_tokens_details` in the model response. These are hidden tokens that are not returned as part of the message response content but are used by the model to help generate a final answer to your request. `2024-12-01-preview` adds an additional new parameter `reasoning_effort` which can be set to `low`, `medium`, or `high` with the latest `o1` model. The higher the effort setting, the longer the model will spend processing the request, which will generally result in a larger number of `reasoning_tokens`.
238+
> Reasoning models have `reasoning_tokens` as part of `completion_tokens_details` in the model response. These are hidden tokens that aren't returned as part of the message response content but are used by the model to help generate a final answer to your request. `2024-12-01-preview` adds an additional new parameter `reasoning_effort` which can be set to `low`, `medium`, or `high` with the latest `o1` model. The higher the effort setting, the longer the model will spend processing the request, which will generally result in a larger number of `reasoning_tokens`.
225239
226240
## Developer messages
227241

@@ -234,7 +248,7 @@ Adding a developer message to the previous code example would look as follows:
234248

235249
# [Python (Microsoft Entra ID)](#tab/python-secure)
236250

237-
You will need to upgrade your OpenAI client library for access to the latest parameters.
251+
You'll need to upgrade your OpenAI client library for access to the latest parameters.
238252

239253
```cmd
240254
pip install openai --upgrade

articles/ai-services/openai/overview.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@ Azure OpenAI Service provides REST API access to OpenAI's powerful language mode
2020

2121
| Feature | Azure OpenAI |
2222
| --- | --- |
23-
| Models available | **o1** & **o1-mini** - (Limited Access - [Request Access](https://aka.ms/OAI/o1access))<br>**GPT-4o & GPT-4o mini**<br> **GPT-4 series (including GPT-4 Turbo with Vision)** <br>**GPT-3.5-Turbo series**<br> Embeddings series <br> Learn more in our [Models](./concepts/models.md) page.|
23+
| Models available | [**o1** & **o1-mini**](./how-to/reasoning.md) - (Limited Access - [Request Access](https://aka.ms/OAI/o1access))<br>**GPT-4o & GPT-4o mini**<br> **GPT-4 series (including GPT-4 Turbo with Vision)** <br>**GPT-3.5-Turbo series**<br> Embeddings series <br> Learn more in our [Models](./concepts/models.md) page.|
2424
| Fine-tuning | `GPT-4o-mini` (preview) <br> `GPT-4` (preview) <br>`GPT-3.5-Turbo` (0613) <br> `babbage-002` <br> `davinci-002`.|
2525
| Price | [Available here](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) <br> For details on vision-enabled chat models, see the [special pricing information](../openai/concepts/gpt-with-vision.md#special-pricing-information).|
2626
| Virtual network support & private link support | Yes. |

0 commit comments

Comments
 (0)