Skip to content

Commit 0ba20fa

Browse files
committed
update
1 parent a66d631 commit 0ba20fa

File tree

6 files changed

+9
-16
lines changed

6 files changed

+9
-16
lines changed

articles/ai-services/openai/concepts/models.md

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -277,10 +277,8 @@ These models can only be used with Embedding API requests.
277277
| `gpt-35-turbo` (1106) | East US2 <br> North Central US <br> Sweden Central <br> Switzerland West | Input: 16,385<br> Output: 4,096 | Sep 2021|
278278
| `gpt-35-turbo` (0125) | East US2 <br> North Central US <br> Sweden Central <br> Switzerland West | 16,385 | Sep 2021 |
279279
| `gpt-4` (0613) <sup>**1**</sup> | North Central US <br> Sweden Central | 8192 | Sep 2021 |
280-
| `gpt-4o-mini` <sup>**2**</sup> (2024-07-18) | Sweden Central | Input: 128,000 <br> Output: 16,384 <br> Training example context length: 64,536 | Oct 2023 |
281280

282281
**<sup>1</sup>** GPT-4 fine-tuning is currently in public preview. See our [GPT-4 fine-tuning safety evaluation guidance](/azure/ai-services/openai/how-to/fine-tuning?tabs=turbo%2Cpython-new&pivots=programming-language-python#safety-evaluation-gpt-4-fine-tuning---public-preview) for more information.
283-
**<sup>2</sup>** GPT-4o mini fine-tuning is currently in public preview.
284282

285283
### Whisper models
286284

articles/ai-services/openai/includes/fine-tuning-python.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -31,8 +31,7 @@ The following models support fine-tuning:
3131
- `gpt-35-turbo` (0613)
3232
- `gpt-35-turbo` (1106)
3333
- `gpt-35-turbo` (0125)
34-
- `gpt-4` (0613)**<sup>*</sup>**
35-
- `gpt-4o-mini` (2024-07-18)**<sup>*</sup>**
34+
- `gpt-4` (0613)**<sup>*</sup>**
3635

3736
**<sup>*</sup>** Fine-tuning for this model is currently in public preview.
3837

articles/ai-services/openai/includes/fine-tuning-rest.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,6 @@ The following models support fine-tuning:
3131
- `gpt-35-turbo` (1106)
3232
- `gpt-35-turbo` (0125)
3333
- `gpt-4` (0613)**<sup>*</sup>**
34-
- `gpt-4o-mini` (2024-07-18)**<sup>*</sup>**
3534

3635
**<sup>*</sup>** Fine-tuning for this model is currently in public preview.
3736

articles/ai-services/openai/includes/fine-tuning-studio.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,6 @@ The following models support fine-tuning:
3030
- `gpt-35-turbo` (1106)
3131
- `gpt-35-turbo` (0125)
3232
- `gpt-4` (0613)**<sup>*</sup>**
33-
- `gpt-4o-mini` (2024-07-18)**<sup>*</sup>**
3433

3534
**<sup>*</sup>** Fine-tuning for this model is currently in public preview.
3635

articles/ai-services/openai/quotas-limits.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.custom:
1010
- ignite-2023
1111
- references_regions
1212
ms.topic: conceptual
13-
ms.date: 07/25/2024
13+
ms.date: 07/31/2024
1414
ms.author: mbullwin
1515
---
1616

@@ -62,9 +62,9 @@ The following sections provide you with a quick guide to the default quotas and
6262
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
6363
|---|---|:---:|:---:|
6464
|`gpt-4o`|Enterprise agreement | 30 M | 180 K |
65-
|`gpt-4o-mini` | Enterprise agreement | 15 M | 90 K |
65+
|`gpt-4o-mini` | Enterprise agreement | 50 M | 300 K |
6666
|`gpt-4o` |Default | 450 K | 2.7 K |
67-
|`gpt-4o-mini` | Default | 250 K | 1.5 K |
67+
|`gpt-4o-mini` | Default | 2 M | 12 K |
6868

6969
M = million | K = thousand
7070

@@ -73,9 +73,9 @@ M = million | K = thousand
7373
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
7474
|---|---|:---:|:---:|
7575
|`gpt4o`|Enterprise agreement | 1 M | 6 K |
76-
|`gpt-4o-mini` | Enterprise agreement | 7.5 M | 45 K |
76+
|`gpt-4o-mini` | Enterprise agreement | 2 M | 12 K |
7777
|`gpt4o`|Default | 150 K | 900 |
78-
|`gpt-4o-mini` | Default | 125 K | 750 |
78+
|`gpt-4o-mini` | Default | 450 K | 2.7 K |
7979

8080
M = million | K = thousand
8181

articles/ai-services/openai/whats-new.md

Lines changed: 3 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.custom:
1010
- ignite-2023
1111
- references_regions
1212
ms.topic: whats-new
13-
ms.date: 07/25/2024
13+
ms.date: 07/31/2024
1414
recommendations: false
1515
---
1616

@@ -20,15 +20,13 @@ This article provides a summary of the latest releases and major documentation u
2020

2121
## July 2024
2222

23-
### GPT-4o mini preview model available for deployment & fine-tuning (preview)
23+
### GPT-4o mini preview model available for deployment
2424

2525
GPT-4o mini is the latest Azure OpenAI model first [announced on July 18, 2024](https://azure.microsoft.com/blog/openais-fastest-model-gpt-4o-mini-is-now-available-on-azure-ai/):
2626

2727
*"GPT-4o mini allows customers to deliver stunning applications at a lower cost with blazing speed. GPT-4o mini is significantly smarter than GPT-3.5 Turbo—scoring 82% on Measuring Massive Multitask Language Understanding (MMLU) compared to 70%—and is more than 60% cheaper.1 The model delivers an expanded 128K context window and integrates the improved multilingual capabilities of GPT-4o, bringing greater quality to languages from around the world."*
2828

29-
- The model is currently available for both [standard and global standard deployment](./how-to/deployment-types.md) in the East US and Sweden Central regions.
30-
31-
- Fine-tuning for GPT-4o mini is in public preview and is currently available in Sweden Central.
29+
The model is currently available for both [standard and global standard deployment](./how-to/deployment-types.md) in the East US and Sweden Central regions.
3230

3331
For information on model quota, consult the [quota and limits page](./quotas-limits.md) and for the latest info on model availability refer to the [models page](./concepts/models.md).
3432

0 commit comments

Comments
 (0)