Skip to content

Commit 2be1827

Browse files
authored
Merge pull request #591 from MicrosoftDocs/release-2024-openai-sep-o1
Release 2024 openai sep o1--10/01 2:00 PM PT
2 parents 1e38c25 + 1cb8d66 commit 2be1827

File tree

5 files changed

+75
-15
lines changed

5 files changed

+75
-15
lines changed

articles/ai-services/openai/concepts/content-filter.md

Lines changed: 20 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -84,6 +84,25 @@ Detecting indirect attacks requires using document delimiters when constructing
8484
[!INCLUDE [content-filter-configurability](../includes/content-filter-configurability.md)]
8585

8686

87+
<sup>1</sup> For Azure OpenAI models, only customers who have been approved for modified content filtering have full content filtering control and can turn off content filters. Apply for modified content filters via this form: [Azure OpenAI Limited Access Review: Modified Content Filters](https://ncv.microsoft.com/uEfCgnITdR) For Azure Government customers, please apply for modified content filters via this form: [Azure Government - Request Modified Content Filtering for Azure OpenAI Service](https://aka.ms/AOAIGovModifyContentFilter).
88+
89+
Configurable content filters for inputs (prompts) and outputs (completions) are available for the following Azure OpenAI models:
90+
91+
* GPT model series
92+
* GPT-4 Turbo Vision GA<sup>*</sup> (turbo-2024-04-09)
93+
* GPT-4o
94+
* GPT-4o mini
95+
* DALL-E 2 and 3
96+
97+
Configurable content filters are currently not available for
98+
99+
`o1-preview` and `o1-mini`
100+
101+
<sup>*</sup>Only available for GPT-4 Turbo Vision GA, does not apply to GPT-4 Turbo Vision preview
102+
103+
Content filtering configurations are created within a Resource in Azure AI Studio, and can be associated with Deployments. [Learn more about configurability here](../how-to/content-filters.md).
104+
105+
Customers are responsible for ensuring that applications integrating Azure OpenAI comply with the [Code of Conduct](/legal/cognitive-services/openai/code-of-conduct?context=%2Fazure%2Fai-services%2Fopenai%2Fcontext%2Fcontext).
87106

88107
## Scenario details
89108

@@ -1012,4 +1031,4 @@ As part of your application design, consider the following best practices to del
10121031
- Apply for modified content filters via [this form](https://ncv.microsoft.com/uEfCgnITdR).
10131032
- Azure OpenAI content filtering is powered by [Azure AI Content Safety](https://azure.microsoft.com/products/cognitive-services/ai-content-safety).
10141033
- Learn more about understanding and mitigating risks associated with your application: [Overview of Responsible AI practices for Azure OpenAI models](/legal/cognitive-services/openai/overview?context=/azure/ai-services/openai/context/context).
1015-
- Learn more about how data is processed in connection with content filtering and abuse monitoring: [Data, privacy, and security for Azure OpenAI Service](/legal/cognitive-services/openai/data-privacy?context=/azure/ai-services/openai/context/context#preventing-abuse-and-harmful-content-generation).
1034+
- Learn more about how data is processed in connection with content filtering and abuse monitoring: [Data, privacy, and security for Azure OpenAI Service](/legal/cognitive-services/openai/data-privacy?context=/azure/ai-services/openai/context/context#preventing-abuse-and-harmful-content-generation).

articles/ai-services/openai/concepts/models.md

Lines changed: 11 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about the different model capabilities that are available with Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 09/30/2024
7+
ms.date: 10/01/2024
88
ms.custom: references_regions, build-2023, build-2023-dataai, refefences_regions
99
manager: nitinme
1010
author: mrbullwinkle #ChrisHMSFT
@@ -33,16 +33,17 @@ The Azure OpenAI `o1-preview` and `o1-mini` models are specifically designed to
3333

3434
### Availability
3535

36-
The `o1-preview` and `o1-mini` models are available in the East US2 region for limited access through the [AI Studio](https://ai.azure.com) early access playground. Data processing for the `o1` models may occur in a different region than where they are available for use.
37-
38-
To try the `o1-preview` and `o1-mini` models in the early access playground, **registration is required, and access will be granted based on Microsoft’s eligibility criteria**.
36+
The `o1-preview` and `o1-mini` models are now available for API access and model deployment. **Registration is required, and access will be granted based on Microsoft's eligibility criteria**.
3937

4038
Request access: [limited access model application](https://aka.ms/oai/modelaccess)
4139

42-
Once access has been granted, you will need to:
40+
Once access has been granted, you will need to create a deployment for each model.
41+
42+
### API support
43+
44+
Support for the **o1 series** models was added in API version `2024-09-01-preview`.
4345

44-
1. Navigate to https://ai.azure.com/resources and select a resource in the `eastus2` region. If you do not have an Azure OpenAI resource in this region you will need to [create one](https://portal.azure.com/#create/Microsoft.CognitiveServicesOpenAI).
45-
2. Once the `eastus2` Azure OpenAI resource is selected, in the upper left-hand panel under **Playgrounds** select **Early access playground (preview)**.
46+
The `max_tokens` parameter has been deprecated and replaced with the new `max_completion_tokens` parameter. **o1 series** models will only work with the `max_completions_tokens` parameter.
4647

4748
## GPT-4o audio
4849

@@ -98,6 +99,8 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
9899

99100
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
100101
| --- | :--- |:--- |:---: |
102+
|`o1-preview` (2024-09-12) | The most capable model in the o1 series, offering enhanced reasoning abilities.| Input: 128,000 <br> Output: 32,768 | Oct 2023 |
103+
| `o1-mini` (2024-09-12) | A faster and more cost-efficient option in the o1 series, ideal for coding tasks requiring speed and lower resource consumption.| Input: 128,000 <br> Output: 65,536 | Oct 2023 |
101104
|`gpt-4o` (2024-08-06) <br> **GPT-4o (Omni)** | **Latest large GA model** <br> - Structured outputs<br> - Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000 <br> Output: 16,384 | Oct 2023 |
102105
|`gpt-4o-mini` (2024-07-18) <br> **GPT-4o mini** | **Latest small GA model** <br> - Fast, inexpensive, capable model ideal for replacing GPT-3.5 Turbo series models. <br> - Text, image processing <br>- JSON Mode <br> - parallel function calling | Input: 128,000 <br> Output: 16,384 | Oct 2023 |
103106
|`gpt-4o` (2024-05-13) <br> **GPT-4o (Omni)** | Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000 <br> Output: 4,096| Oct 2023 |
@@ -421,4 +424,4 @@ For the latest information on model retirements, refer to the [model retirement
421424
- [Model retirement and deprecation](./model-retirements.md)
422425
- [Learn more about working with Azure OpenAI models](../how-to/working-with-models.md)
423426
- [Learn more about Azure OpenAI](../overview.md)
424-
- [Learn more about fine-tuning Azure OpenAI models](../how-to/fine-tuning.md)
427+
- [Learn more about fine-tuning Azure OpenAI models](../how-to/fine-tuning.md)

articles/ai-services/openai/overview.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -14,13 +14,13 @@ recommendations: false
1414

1515
# What is Azure OpenAI Service?
1616

17-
Azure OpenAI Service provides REST API access to OpenAI's powerful language models including GPT-4o, GPT-4o mini, GPT-4 Turbo with Vision, GPT-4, GPT-3.5-Turbo, and Embeddings model series. These models can be easily adapted to your specific task including but not limited to content generation, summarization, image understanding, semantic search, and natural language to code translation. Users can access the service through REST APIs, Python SDK, or our web-based interface in the Azure OpenAI Studio.
17+
Azure OpenAI Service provides REST API access to OpenAI's powerful language models including o1-preview, o1-mini, GPT-4o, GPT-4o mini, GPT-4 Turbo with Vision, GPT-4, GPT-3.5-Turbo, and Embeddings model series. These models can be easily adapted to your specific task including but not limited to content generation, summarization, image understanding, semantic search, and natural language to code translation. Users can access the service through REST APIs, Python SDK, or our web-based interface in the Azure OpenAI Studio.
1818

1919
### Features overview
2020

2121
| Feature | Azure OpenAI |
2222
| --- | --- |
23-
| Models available | **GPT-4o & GPT-4o mini**<br> **GPT-4 series (including GPT-4 Turbo with Vision)** <br>**GPT-3.5-Turbo series**<br> Embeddings series <br> Learn more in our [Models](./concepts/models.md) page.|
23+
| Models available | **o1-preview** & **o1-mini** - (Limited Access - [Request Access](https://aka.ms/oai/modelaccess))<br>**GPT-4o & GPT-4o mini**<br> **GPT-4 series (including GPT-4 Turbo with Vision)** <br>**GPT-3.5-Turbo series**<br> Embeddings series <br> Learn more in our [Models](./concepts/models.md) page.|
2424
| Fine-tuning | `GPT-4o-mini` (preview) <br> `GPT-4` (preview) <br>`GPT-3.5-Turbo` (0613) <br> `babbage-002` <br> `davinci-002`.|
2525
| Price | [Available here](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) <br> For details on GPT-4 Turbo with Vision, see the [special pricing information](../openai/concepts/gpt-with-vision.md#special-pricing-information).|
2626
| Virtual network support & private link support | Yes, unless using [Azure OpenAI on your data](./concepts/use-your-data.md). |
@@ -125,4 +125,4 @@ Learn more about each model on our [models concept page](./concepts/models.md).
125125

126126
## Next steps
127127

128-
Learn more about the [underlying models that power Azure OpenAI](./concepts/models.md).
128+
Learn more about the [underlying models that power Azure OpenAI](./concepts/models.md).

articles/ai-services/openai/quotas-limits.md

Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -60,6 +60,26 @@ The following sections provide you with a quick guide to the default quotas and
6060

6161
[!INCLUDE [Quota](./includes/global-batch-limits.md)]
6262

63+
## o1-preview & o1-mini rate limits
64+
65+
### o1-preview & o1-mini global standard
66+
67+
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
68+
|---|---|:---:|:---:|
69+
| `o1-preview` | Enterprise agreement | 15 M | 2.5 K |
70+
| `o1-mini`| Enterprise agreement | 50 M | 5 K |
71+
| `o1-preview` | Default | 1.5 M | 250 |
72+
| `o1-mini`| Default | 20 M | 2 K |
73+
74+
### o1-preview & o1-mini standard
75+
76+
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
77+
|---|---|:---:|:---:|
78+
| `o1-preview` | Enterprise agreement | 300 K | 50 |
79+
| `o1-mini`| Enterprise agreement | 1 M | 100 |
80+
| `o1-preview` | Default | 300 K | 50 |
81+
| `o1-mini`| Default | 1 M | 100 |
82+
6383
## gpt-4o & GPT-4 Turbo rate limits
6484

6585
`gpt-4o` and `gpt-4o-mini`, and `gpt-4` (`turbo-2024-04-09`) have rate limit tiers with higher limits for certain customer types.

articles/ai-services/openai/whats-new.md

Lines changed: 21 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.custom:
1010
- ignite-2023
1111
- references_regions
1212
ms.topic: whats-new
13-
ms.date: 9/19/2024
13+
ms.date: 10/01/2024
1414
recommendations: false
1515
---
1616

@@ -20,6 +20,24 @@ This article provides a summary of the latest releases and major documentation u
2020

2121
## October 2024
2222

23+
### o1-preview and o1-mini models limited access
24+
25+
The `o1-preview` and `o1-mini` models are now available for API access and model deployment. **Registration is required, and access will be granted based on Microsoft's eligibility criteria**.
26+
27+
Request access: [limited access model application](https://aka.ms/oai/modelaccess)
28+
29+
Customers who were already approved and have access to the model through the early access playground do not need to apply again, you will automatically be granted API access. Once access has been granted, you will need to create a deployment for each model.
30+
31+
**API support:**
32+
33+
Support for the **o1 series** models was added in API version `2024-09-01-preview`.
34+
35+
The `max_tokens` parameter has been deprecated and replaced with the new `max_completion_tokens` parameter. **o1 series** models will only work with the `max_completions_tokens` parameter.
36+
37+
**Region availability**:
38+
39+
Models are available for standard and global standard deployment in East US2 and Sweden Central for approved customers.
40+
2341
### New GPT-4o real-time audio public preview
2442

2543
Azure OpenAI GPT-4o audio is part of the GPT-4o model family that supports low-latency, "speech in, speech out" conversational interactions. The GPT-4o audio `realtime` API is designed to handle real-time, low-latency conversational interactions, making it a great fit for use cases involving live interactions between a user and a model, such as customer support agents, voice assistants, and real-time translators.
@@ -674,7 +692,7 @@ If you are currently using the `2023-03-15-preview` API, we recommend migrating
674692
- **GPT-35-Turbo preview**. To learn more checkout the [how-to article](./how-to/chatgpt.md).
675693

676694
- Increased training limits for fine-tuning: The max training job size (tokens in training file) x (# of epochs) is 2 Billion tokens for all models. We have also increased the max training job from 120 to 720 hours.
677-
- Adding additional use cases to your existing access.  Previously, the process for adding new use cases required customers to reapply to the service. Now, we're releasing a new process that allows you to quickly add new use cases to your use of the service. This process follows the established Limited Access process within Azure AI services. [Existing customers can attest to any and all new use cases here](https://customervoice.microsoft.com/Pages/ResponsePage.aspx?id=v4j5cvGGr0GRqy180BHbR7en2Ais5pxKtso_Pz4b1_xUM003VEJPRjRSOTZBRVZBV1E5N1lWMk1XUyQlQCN0PWcu). Please note that this is required anytime you would like to use the service for a new use case you did not originally apply for.
695+
- Adding additional use cases to your existing access. Previously, the process for adding new use cases required customers to reapply to the service. Now, we're releasing a new process that allows you to quickly add new use cases to your use of the service. This process follows the established Limited Access process within Azure AI services. [Existing customers can attest to any and all new use cases here](https://customervoice.microsoft.com/Pages/ResponsePage.aspx?id=v4j5cvGGr0GRqy180BHbR7en2Ais5pxKtso_Pz4b1_xUM003VEJPRjRSOTZBRVZBV1E5N1lWMk1XUyQlQCN0PWcu). Please note that this is required anytime you would like to use the service for a new use case you did not originally apply for.
678696

679697
## February 2023
680698

@@ -774,4 +792,4 @@ New training course:
774792

775793
## Next steps
776794

777-
Learn more about the [underlying models that power Azure OpenAI](./concepts/models.md).
795+
Learn more about the [underlying models that power Azure OpenAI](./concepts/models.md).

0 commit comments

Comments
 (0)