Skip to content

Commit 4d1c1ef

Browse files
Merge pull request #2670 from mrbullwinkle/mrb_01_30_2025_o3_mini
[Azure OpenAI] [Release branch] vNext
2 parents 2435186 + c037462 commit 4d1c1ef

File tree

7 files changed

+50
-41
lines changed

7 files changed

+50
-41
lines changed

articles/ai-services/openai/concepts/models.md

Lines changed: 8 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
1818

1919
| Models | Description |
2020
|--|--|
21-
| [o1 & o1-mini](#o1-and-o1-mini-models) |[Reasoning models](../how-to/reasoning.md) with advanced problem-solving and increased focus and capability. |
21+
| [o-series models](#o-series-models) |[Reasoning models](../how-to/reasoning.md) with advanced problem-solving and increased focus and capability. |
2222
| [GPT-4o & GPT-4o mini & GPT-4 Turbo](#gpt-4o-and-gpt-4-turbo) | The latest most capable Azure OpenAI models with multimodal versions, which can accept both text and images as input. |
2323
| [GPT-4o audio](#gpt-4o-audio) | GPT-4o audio models that support either low-latency, "speech in, speech out" conversational interactions or audio generation. |
2424
| [GPT-4](#gpt-4) | A set of models that improve on GPT-3.5 and can understand and generate natural language and code. |
@@ -28,30 +28,32 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
2828
| [Whisper](#whisper-models) | A series of models in preview that can transcribe and translate speech to text. |
2929
| [Text to speech](#text-to-speech-models-preview) (Preview) | A series of models in preview that can synthesize text to speech. |
3030

31-
## o1 and o1-mini models
31+
## o-series models
3232

33-
The Azure OpenAI `o1` and `o1-mini` models are specifically designed to tackle reasoning and problem-solving tasks with increased focus and capability. These models spend more time processing and understanding the user's request, making them exceptionally strong in areas like science, coding, and math compared to previous iterations.
33+
The Azure OpenAI o<sup>&#42;</sup> series models are specifically designed to tackle reasoning and problem-solving tasks with increased focus and capability. These models spend more time processing and understanding the user's request, making them exceptionally strong in areas like science, coding, and math compared to previous iterations.
3434

3535
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
3636
| --- | :--- |:--- |:---: |
37+
| `o3-mini` (2025-01-31) | The latest reasoning model, offering [enhanced reasoning abilities](../how-to/reasoning.md). <br> - Structured outputs<br> - Text-only processing <br> - Functions/Tools <br> <br> **Request access: [limited access model application](https://aka.ms/OAI/o1access)** | Input: 200,000 <br> Output: 100,000 | Oct 2023 |
3738
| `o1` (2024-12-17) | The most capable model in the o1 series, offering [enhanced reasoning abilities](../how-to/reasoning.md). <br> - Structured outputs<br> - Text, image processing <br> - Functions/Tools <br> <br> **Request access: [limited access model application](https://aka.ms/OAI/o1access)** | Input: 200,000 <br> Output: 100,000 | Oct 2023 |
3839
|`o1-preview` (2024-09-12) | Older preview version | Input: 128,000 <br> Output: 32,768 | Oct 2023 |
3940
| `o1-mini` (2024-09-12) | A faster and more cost-efficient option in the o1 series, ideal for coding tasks requiring speed and lower resource consumption.| Input: 128,000 <br> Output: 65,536 | Oct 2023 |
4041

4142
### Availability
4243

43-
The `o1` and `o1-mini` models are now available for API access and model deployment. **For access to `o1` registration is required, and access will be granted based on Microsoft's eligibility criteria**. Customers who previously applied and received access to `o1-preview`, don't need to reapply as they are automatically on the wait-list for the latest model.
44+
**For access to `o3-mini` and `o1` registration is required, and access will be granted based on Microsoft's eligibility criteria**. Customers who previously applied and received access to `o1-preview`, or `o1` don't need to reapply as they are automatically on the wait-list for the latest models in the o-series.
4445

4546
Request access: [limited access model application](https://aka.ms/OAI/o1access)
4647

47-
Once access has been granted, you will need to create a deployment for each model. If you have an existing `o1-preview` deployment in place upgrade is currently not supported, you will need to create a new deployment.
48+
Once access has been granted, you will need to create a deployment for each model.
4849

49-
To learn more about the advanced `o1` series models see, [getting started with o1 series reasoning models](../how-to/reasoning.md).
50+
To learn more about the advanced `o-series` models see, [getting started with reasoning models](../how-to/reasoning.md).
5051

5152
### Region availability
5253

5354
| Model | Region |
5455
|---|---|
56+
|`o3-mini` | East US2 (Global Standard) <br> Sweden Central (Global Standard) |
5557
|`o1` | East US2 (Global Standard) <br> Sweden Central (Global Standard) |
5658
| `o1-preview` | See the [models table](#global-standard-model-availability). |
5759
| `o1-mini` | See the [models table](#global-provisioned-managed-model-availability). |

articles/ai-services/openai/how-to/function-calling.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.author: mbullwin #delegenz
77
ms.service: azure-ai-openai
88
ms.custom: devx-track-python
99
ms.topic: how-to
10-
ms.date: 01/17/2025
10+
ms.date: 01/30/2025
1111
manager: nitinme
1212
---
1313

@@ -45,14 +45,15 @@ Support for parallel function was first added in API version [`2023-12-01-previe
4545
### Basic function calling with tools
4646

4747
* All the models that support parallel function calling
48+
* `o3-mini` (`2025-01-31`)
4849
* `o1` (`2024-12-17`)
4950
* `gpt-4` (`0613`)
5051
* `gpt-4-32k` (`0613`)
5152
* `gpt-35-turbo-16k` (`0613`)
5253
* `gpt-35-turbo` (`0613`)
5354

54-
> [!IMPORTANT]
55-
> There is a known issue with the `o1` model and the `tool_choice` parameter. Currently function calls that include the optional `tool_choice` parameter will fail. This page will be updated once the issue is resolved. For more information on what parameters are supported with the o1-series models see, the [reasoning models guide](./reasoning.md).
55+
> [!NOTE]
56+
> The `tool_choice` parameter is now supported with `o3-mini` and `o1`. For more information on what parameters are supported with the o-series models see, the [reasoning models guide](./reasoning.md).
5657
5758
## Single tool/function calling example
5859

articles/ai-services/openai/how-to/reasoning.md

Lines changed: 23 additions & 25 deletions
Original file line numberDiff line numberDiff line change
@@ -5,17 +5,17 @@ description: Learn how to use Azure OpenAI's advanced o1 series reasoning models
55
manager: nitinme
66
ms.service: azure-ai-openai
77
ms.topic: include
8-
ms.date: 01/16/2025
8+
ms.date: 01/30/2025
99
author: mrbullwinkle
1010
ms.author: mbullwin
1111
---
1212

1313

1414
# Azure OpenAI reasoning models
1515

16-
Azure OpenAI `o1` and `o1-mini` models are designed to tackle reasoning and problem-solving tasks with increased focus and capability. These models spend more time processing and understanding the user's request, making them exceptionally strong in areas like science, coding, and math compared to previous iterations.
16+
Azure OpenAI `o-series` models are designed to tackle reasoning and problem-solving tasks with increased focus and capability. These models spend more time processing and understanding the user's request, making them exceptionally strong in areas like science, coding, and math compared to previous iterations.
1717

18-
**Key capabilities of the o1 series:**
18+
**Key capabilities of the o-series models:**
1919

2020
- Complex Code Generation: Capable of generating algorithms and handling advanced coding tasks to support developers.
2121
- Advanced Problem Solving: Ideal for comprehensive brainstorming sessions and addressing multifaceted challenges.
@@ -24,45 +24,43 @@ Azure OpenAI `o1` and `o1-mini` models are designed to tackle reasoning and prob
2424

2525
## Availability
2626

27-
The **o1 series** models are now available for API access and model deployment. **For access to o1, and o1-preview registration is required, and access will be granted based on Microsoft's eligibility criteria**. Customers who previously applied and received access to `o1-preview`, don't need to reapply as they are automatically on the wait-list for the latest model.
27+
**For access to `o3-mini`, `o1`, and `o1-preview`, registration is required, and access will be granted based on Microsoft's eligibility criteria**.
2828

29-
Request access: [limited access model application](https://aka.ms/OAI/o1access)
29+
Customers who previously applied and received access to `o1` or `o1-preview`, don't need to reapply as they are automatically on the wait-list for the latest model.
3030

31-
Once access has been granted, you'll need to create a deployment for each model. If you have an existing `o1-preview` deployment, in-place upgrade is currently not supported, you'll need to create a new deployment.
31+
Request access: [limited access model application](https://aka.ms/OAI/o1access)
3232

3333
### Region availability
3434

3535
| Model | Region | Limited access |
3636
|---|---|---|
37+
| `o3-mini` | East US2 (Global Standard) <br> Sweden Central (Global Standard) | [Limited access model application](https://aka.ms/OAI/o1access) |
3738
|`o1` | East US2 (Global Standard) <br> Sweden Central (Global Standard) | [Limited access model application](https://aka.ms/OAI/o1access) |
3839
| `o1-preview` | See [models page](../concepts/models.md#global-standard-model-availability). | [Limited access model application](https://aka.ms/OAI/o1access) |
3940
| `o1-mini` | See [models page](../concepts/models.md#global-standard-model-availability). | No access request needed |
4041

4142
## API & feature support
4243

43-
| **Feature** | **o1**, **2024-12-17** | **o1-preview**, **2024-09-12** | **o1-mini**, **2024-09-12** |
44-
|:-------------------|:--------------------------:|:--------------------------:|:-------------------------------:|
45-
| **API Version** | `2024-12-01-preview` | `2024-09-01-preview` <br> `2024-10-01-preview` <br> `2024-12-01-preview` | `2024-09-01-preview` <br> `2024-10-01-preview` <br> `2024-12-01-preview` |
46-
| **[Developer Messages](#developer-messages)** || - | - |
47-
| **[Structured Outputs](./structured-outputs.md)** || - | - |
48-
| **[Context Window](../concepts/models.md#o1-and-o1-mini-models)** | Input: 200,000 <br> Output: 100,000 | Input: 128,000 <br> Output: 32,768 | Input: 128,000 <br> Output: 65,536 |
49-
| **[Reasoning effort](#reasoning-effort)** || - | - |
50-
| **[Vision Support](./gpt-with-vision.md)** || - | - |
51-
| Functions/Tools || - | - |
52-
| `max_completion_tokens` ||||
53-
| System Messages | - | - | - |
54-
55-
**o1 series** models will only work with the `max_completion_tokens` parameter.
56-
57-
> [!IMPORTANT]
58-
> There is a known issue with the `o1` model and the `tool_choice` parameter. Currently function calls that include the optional `tool_choice` parameter will fail. This page will be updated once the issue is resolved.
44+
| **Feature** | **o3-mini**, **2025-01-31** |**o1**, **2024-12-17** | **o1-preview**, **2024-09-12** | **o1-mini**, **2024-09-12** |
45+
|:-------------------|:--------------------------:|:--------------------------:|:-------------------------------:|:---:|
46+
| **API Version** | `2024-12-01-preview` <br> `2025-01-01-preview` | `2024-12-01-preview` <br> `2025-01-01-preview` | `2024-09-01-preview` <br> `2024-10-01-preview` <br> `2024-12-01-preview` | `2024-09-01-preview` <br> `2024-10-01-preview` <br> `2024-12-01-preview` |
47+
| **[Developer Messages](#developer-messages)** ||| - | - |
48+
| **[Structured Outputs](./structured-outputs.md)** ||| - | - |
49+
| **[Context Window](../concepts/models.md#o-series-models)** | Input: 200,000 <br> Output: 100,000 | Input: 200,000 <br> Output: 100,000 | Input: 128,000 <br> Output: 32,768 | Input: 128,000 <br> Output: 65,536 |
50+
| **[Reasoning effort](#reasoning-effort)** ||| - | - |
51+
| **[Vision Support](./gpt-with-vision.md)** | - || - | - |
52+
| Functions/Tools ||| - | - |
53+
| `max_completion_tokens`<sup>*</sup> |||||
54+
| System Messages<sup>**</sup> ||| - | - |
55+
| Streaming || - | - | - |
56+
57+
<sup>*</sup> Reasoning models will only work with the `max_completion_tokens` parameter. <br><br>
58+
<sup>**</sup>The latest o<sup>&#42;</sup> series model support system messages to make migration easier. When you use a system message with `o3-mini` and `o1` it will be treated as a developer message. You should not use both a developer message and a system message in the same API request.
5959

6060
### Not Supported
6161

62-
The following are currently unsupported with o1-series models:
62+
The following are currently unsupported with reasoning models:
6363

64-
- System Messages
65-
- Streaming
6664
- Parallel tool calling
6765
- `temperature`, `top_p`, `presence_penalty`, `frequency_penalty`, `logprobs`, `top_logprobs`, `logit_bias`, `max_tokens`
6866

articles/ai-services/openai/how-to/structured-outputs.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -24,6 +24,7 @@ Structured outputs make a model follow a [JSON Schema](https://json-schema.org/o
2424
2525
## Supported models
2626

27+
- `o3-mini` version `2025-01-31`
2728
- `o1` version: `2024-12-17`
2829
- `gpt-4o-mini` version: `2024-07-18`
2930
- `gpt-4o` version: `2024-08-06`

articles/ai-services/openai/overview.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,20 +7,20 @@ author: mrbullwinkle
77
ms.author: mbullwin
88
ms.service: azure-ai-openai
99
ms.topic: overview
10-
ms.date: 01/23/2025
10+
ms.date: 01/30/2025
1111
ms.custom: build-2023, build-2023-dataai
1212
recommendations: false
1313
---
1414

1515
# What is Azure OpenAI Service?
1616

17-
Azure OpenAI Service provides REST API access to OpenAI's powerful language models including o1, o1-mini, GPT-4o, GPT-4o mini, GPT-4 Turbo with Vision, GPT-4, GPT-3.5-Turbo, and Embeddings model series. These models can be easily adapted to your specific task including but not limited to content generation, summarization, image understanding, semantic search, and natural language to code translation. Users can access the service through REST APIs, Python SDK, or in the [Azure AI Foundry](https://ai.azure.com).
17+
Azure OpenAI Service provides REST API access to OpenAI's powerful language models including o3-mini, o1, o1-mini, GPT-4o, GPT-4o mini, GPT-4 Turbo with Vision, GPT-4, GPT-3.5-Turbo, and Embeddings model series. These models can be easily adapted to your specific task including but not limited to content generation, summarization, image understanding, semantic search, and natural language to code translation. Users can access the service through REST APIs, Python SDK, or in the [Azure AI Foundry](https://ai.azure.com).
1818

1919
### Features overview
2020

2121
| Feature | Azure OpenAI |
2222
| --- | --- |
23-
| Models available | [**o1**](./how-to/reasoning.md) - (Limited Access - [Request Access](https://aka.ms/OAI/o1access))<br>[**o1-mini**](./how-to/reasoning.md)<br>**GPT-4o & GPT-4o mini**<br> **GPT-4 series (including GPT-4 Turbo with Vision)** <br>**GPT-3.5-Turbo series**<br> Embeddings series <br> Learn more in our [Models](./concepts/models.md) page.|
23+
| Models available | [**o3-mini & o1**](./how-to/reasoning.md) - (Limited Access - [**Request Access**](https://aka.ms/OAI/o1access))<br>[**o1-mini**](./how-to/reasoning.md)<br>**GPT-4o & GPT-4o mini**<br> **GPT-4 series (including GPT-4 Turbo with Vision)** <br>**GPT-3.5-Turbo series**<br> Embeddings series <br> Learn more in our [Models](./concepts/models.md) page.|
2424
| Fine-tuning | `GPT-4o-mini` (preview) <br> `GPT-4` (preview) <br>`GPT-3.5-Turbo` (0613) <br> `babbage-002` <br> `davinci-002`.|
2525
| Price | [Available here](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) <br> For details on vision-enabled chat models, see the [special pricing information](../openai/concepts/gpt-with-vision.md#special-pricing-information).|
2626
| Virtual network support & private link support | Yes. |

articles/ai-services/openai/quotas-limits.md

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.custom:
1010
- ignite-2023
1111
- references_regions
1212
ms.topic: conceptual
13-
ms.date: 1/17/2025
13+
ms.date: 1/30/2025
1414
ms.author: mbullwin
1515
---
1616

@@ -62,25 +62,28 @@ The following sections provide you with a quick guide to the default quotas and
6262

6363
[!INCLUDE [Quota](./includes/global-batch-limits.md)]
6464

65-
## o1 & o1-mini rate limits
65+
## `o-series` rate limits
6666

6767
> [!IMPORTANT]
6868
> The ratio of RPM/TPM for quota with o1-series models works differently than older chat completions models:
6969
>
7070
> - **Older chat models:** 1 unit of capacity = 6 RPM and 1,000 TPM.
7171
> - **o1 & o1-preview:** 1 unit of capacity = 1 RPM and 6,000 TPM.
72+
> - **o3-mini:** 1 unit of capacity = 1 RPM per 10,000 TPM.
7273
> - **o1-mini:** 1 unit of capacity = 1 RPM per 10,000 TPM.
7374
>
7475
> This is particularly important for programmatic model deployment as this change in RPM/TPM ratio can result in accidental under allocation of quota if one is still assuming the 1:1000 ratio followed by older chat completion models.
7576
>
7677
> There is a known issue with the [quota/usages API](/rest/api/aiservices/accountmanagement/usages/list?view=rest-aiservices-accountmanagement-2024-06-01-preview&tabs=HTTP&preserve-view=true) where it assumes the old ratio applies to the new o1-series models. The API returns the correct base capacity number, but doesn't apply the correct ratio for the accurate calculation of TPM.
7778
78-
### o1 & o1-mini global standard
79+
### `o-series` global standard
7980

8081
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
8182
|---|---|:---:|:---:|
83+
| `o3-mini` | Enterprise agreement | 50 M | 5 K |
8284
| `o1` & `o1-preview` | Enterprise agreement | 30 M | 5 K |
8385
| `o1-mini`| Enterprise agreement | 50 M | 5 K |
86+
| `o3-mini` | Default | 5 M | 500 |
8487
| `o1` & `o1-preview` | Default | 3 M | 500 |
8588
| `o1-mini`| Default | 5 M | 500 |
8689

articles/ai-services/openai/whats-new.md

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ ms.custom:
1111
- references_regions
1212
- ignite-2024
1313
ms.topic: whats-new
14-
ms.date: 1/21/2025
14+
ms.date: 1/30/2025
1515
recommendations: false
1616
---
1717

@@ -21,6 +21,10 @@ This article provides a summary of the latest releases and major documentation u
2121

2222
## January 2025
2323

24+
### o3-mini released
25+
26+
`o3-mini` (2025-01-31) is the latest reasoning model, offering enhanced reasoning abilities. For more information, see our [reasoning model guide](./how-to/reasoning.md).
27+
2428
### GPT-4o audio completions
2529

2630
The `gpt-4o-audio-preview` model is now available for global deployments in [East US 2 and Sweden Central regions](./concepts/models.md#global-standard-model-availability). Use the `gpt-4o-audio-preview` model for audio generation.

0 commit comments

Comments
 (0)