update

mrbullwinkle · mrbullwinkle · commit 0ba20fa90d6a · 2024-07-30T21:39:53.000-04:00
diff --git a/articles/ai-services/openai/concepts/models.md b/articles/ai-services/openai/concepts/models.md
@@ -277,10 +277,8 @@ These models can only be used with Embedding API requests.
 | `gpt-35-turbo` (1106) | East US2 <br> North Central US <br> Sweden Central <br> Switzerland West | Input: 16,385<br> Output: 4,096 |  Sep 2021|
 | `gpt-35-turbo` (0125)  | East US2 <br> North Central US <br> Sweden Central <br> Switzerland West | 16,385 | Sep 2021 |
 | `gpt-4` (0613) <sup>**1**</sup> | North Central US <br> Sweden Central | 8192 | Sep 2021 |
-| `gpt-4o-mini` <sup>**2**</sup> (2024-07-18) | Sweden Central | Input: 128,000 <br> Output: 16,384  <br> Training example context length: 64,536 | Oct 2023 |
 
 **<sup>1</sup>** GPT-4 fine-tuning is currently in public preview. See our [GPT-4 fine-tuning safety evaluation guidance](/azure/ai-services/openai/how-to/fine-tuning?tabs=turbo%2Cpython-new&pivots=programming-language-python#safety-evaluation-gpt-4-fine-tuning---public-preview) for more information.
-**<sup>2</sup>** GPT-4o mini fine-tuning is currently in public preview.
 
 ### Whisper models
 
diff --git a/articles/ai-services/openai/includes/fine-tuning-python.md b/articles/ai-services/openai/includes/fine-tuning-python.md
@@ -31,8 +31,7 @@ The following models support fine-tuning:
 - `gpt-35-turbo` (0613)
 - `gpt-35-turbo` (1106)
 - `gpt-35-turbo` (0125)
-- `gpt-4` (0613)**<sup>*</sup>** 
-- `gpt-4o-mini` (2024-07-18)**<sup>*</sup>**
+- `gpt-4` (0613)**<sup>*</sup>**
 
 **<sup>*</sup>** Fine-tuning for this model is currently in public preview.
 
diff --git a/articles/ai-services/openai/includes/fine-tuning-rest.md b/articles/ai-services/openai/includes/fine-tuning-rest.md
@@ -31,7 +31,6 @@ The following models support fine-tuning:
 - `gpt-35-turbo` (1106)
 - `gpt-35-turbo` (0125)
 - `gpt-4` (0613)**<sup>*</sup>** 
-- `gpt-4o-mini` (2024-07-18)**<sup>*</sup>**
 
 **<sup>*</sup>** Fine-tuning for this model is currently in public preview.
 
diff --git a/articles/ai-services/openai/includes/fine-tuning-studio.md b/articles/ai-services/openai/includes/fine-tuning-studio.md
@@ -30,7 +30,6 @@ The following models support fine-tuning:
 - `gpt-35-turbo` (1106)
 - `gpt-35-turbo` (0125)
 - `gpt-4` (0613)**<sup>*</sup>** 
-- `gpt-4o-mini` (2024-07-18)**<sup>*</sup>**
 
 **<sup>*</sup>** Fine-tuning for this model is currently in public preview.
 
diff --git a/articles/ai-services/openai/quotas-limits.md b/articles/ai-services/openai/quotas-limits.md
@@ -10,7 +10,7 @@ ms.custom:
   - ignite-2023
   - references_regions
 ms.topic: conceptual
-ms.date: 07/25/2024
+ms.date: 07/31/2024
 ms.author: mbullwin
 ---
 
@@ -62,9 +62,9 @@ The following sections provide you with a quick guide to the default quotas and
 | Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
 |---|---|:---:|:---:|
 |`gpt-4o`|Enterprise agreement | 30 M | 180 K |
-|`gpt-4o-mini` | Enterprise agreement | 15 M | 90 K |
+|`gpt-4o-mini` | Enterprise agreement | 50 M | 300 K |
 |`gpt-4o` |Default | 450 K | 2.7 K |
-|`gpt-4o-mini` | Default | 250 K | 1.5 K  |
+|`gpt-4o-mini` | Default | 2 M | 12 K  |
 
 M = million | K = thousand
 
@@ -73,9 +73,9 @@ M = million | K = thousand
 | Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |
 |---|---|:---:|:---:|
 |`gpt4o`|Enterprise agreement | 1 M | 6 K |
-|`gpt-4o-mini` | Enterprise agreement | 7.5 M | 45 K |
+|`gpt-4o-mini` | Enterprise agreement | 2 M | 12 K |
 |`gpt4o`|Default | 150 K | 900 |
-|`gpt-4o-mini` | Default | 125 K | 750 |
+|`gpt-4o-mini` | Default | 450 K | 2.7 K |
 
 M = million | K = thousand
 
diff --git a/articles/ai-services/openai/whats-new.md b/articles/ai-services/openai/whats-new.md
@@ -10,7 +10,7 @@ ms.custom:
   - ignite-2023
   - references_regions
 ms.topic: whats-new
-ms.date: 07/25/2024
+ms.date: 07/31/2024
 recommendations: false
 ---
 
@@ -20,15 +20,13 @@ This article provides a summary of the latest releases and major documentation u
 
 ## July 2024
 
-### GPT-4o mini preview model available for deployment & fine-tuning (preview)
+### GPT-4o mini preview model available for deployment
 
 GPT-4o mini is the latest Azure OpenAI model first [announced on July 18, 2024](https://azure.microsoft.com/blog/openais-fastest-model-gpt-4o-mini-is-now-available-on-azure-ai/):
 
 *"GPT-4o mini allows customers to deliver stunning applications at a lower cost with blazing speed. GPT-4o mini is significantly smarter than GPT-3.5 Turbo—scoring 82% on Measuring Massive Multitask Language Understanding (MMLU) compared to 70%—and is more than 60% cheaper.1 The model delivers an expanded 128K context window and integrates the improved multilingual capabilities of GPT-4o, bringing greater quality to languages from around the world."*
 
-- The model is currently available for both [standard and global standard deployment](./how-to/deployment-types.md) in the East US and Sweden Central regions.
-
-- Fine-tuning for GPT-4o mini is in public preview and is currently available in Sweden Central.
+The model is currently available for both [standard and global standard deployment](./how-to/deployment-types.md) in the East US and Sweden Central regions.
 
 For information on model quota, consult the [quota and limits page](./quotas-limits.md) and for the latest info on model availability refer to the [models page](./concepts/models.md).