Skip to content

Commit 81a9e41

Browse files
Merge pull request #4435 from mrbullwinkle/mrb_04_30_2025_freshness_001
[Azure OpenAI] Freshness updates
2 parents c08a969 + eb4ee32 commit 81a9e41

File tree

8 files changed

+22
-22
lines changed

8 files changed

+22
-22
lines changed

articles/ai-services/openai/concepts/legacy-models.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about the deprecated models in Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 01/31/2025
7+
ms.date: 04/30/2025
88
ms.custom: references_regions, build-2023, build-2023-dataai
99
manager: nitinme
1010
author: mrbullwinkle

articles/ai-services/openai/concepts/model-versions.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about model versions in Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 01/09/2025
7+
ms.date: 04/30/2025
88
manager: nitinme
99
author: mrbullwinkle #ChrisHMSFT
1010
ms.author: mbullwin #chrhoder

articles/ai-services/openai/how-to/deployment-types.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ author: mrbullwinkle
66
manager: nitinme
77
ms.service: azure-ai-openai
88
ms.topic: how-to
9-
ms.date: 01/24/2025
9+
ms.date: 04/30/2025
1010
ms.author: mbullwin
1111
---
1212

articles/ai-services/openai/how-to/dynamic-quota.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ author: mrbullwinkle
66
manager: nitinme
77
ms.service: azure-ai-openai
88
ms.topic: how-to
9-
ms.date: 01/31/2025
9+
ms.date: 04/30/2025
1010
ms.author: mbullwin
1111
---
1212

articles/ai-services/openai/how-to/evaluations.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ manager: nitinme
66
ms.service: azure-ai-openai
77
ms.topic: how-to
88
ms.custom: references_regions
9-
ms.date: 01/29/2025
9+
ms.date: 04/30/2025
1010
author: mrbullwinkle
1111
ms.author: mbullwin
1212
recommendations: false

articles/ai-services/openai/how-to/migration-javascript.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.author: mbullwin
77
ms.service: azure-ai-openai
88
ms.custom: devx-track-python
99
ms.topic: how-to
10-
ms.date: 01/31/2025
10+
ms.date: 04/30/2025
1111
manager: nitinme
1212
---
1313

articles/ai-services/openai/how-to/quota.md

Lines changed: 15 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ author: mrbullwinkle
66
manager: nitinme
77
ms.service: azure-ai-openai
88
ms.topic: how-to
9-
ms.date: 01/09/2025
9+
ms.date: 04/30/2025
1010
ms.author: mbullwin
1111
---
1212

@@ -28,11 +28,11 @@ Quota provides the flexibility to actively manage the allocation of rate limits
2828
Azure OpenAI's quota feature enables assignment of rate limits to your deployments, up-to a global limit called your *quota*. Quota is assigned to your subscription on a per-region, per-model basis in units of **Tokens-per-Minute (TPM)**. When you onboard a subscription to Azure OpenAI, you'll receive default quota for most available models. Then, you'll assign TPM to each deployment as it is created, and the available quota for that model will be reduced by that amount. You can continue to create deployments and assign them TPM until you reach your quota limit. Once that happens, you can only create new deployments of that model by reducing the TPM assigned to other deployments of the same model (thus freeing TPM for use), or by requesting and being approved for a model quota increase in the desired region.
2929

3030
> [!NOTE]
31-
> With a quota of 240,000 TPM for GPT-35-Turbo in East US, a customer can create a single deployment of 240 K TPM, 2 deployments of 120 K TPM each, or any number of deployments in one or multiple Azure OpenAI resources as long as their TPM adds up to less than 240 K total in that region.
31+
> With a quota of 240,000 TPM for GPT-4o in East US, a customer can create a single deployment of 240 K TPM, 2 deployments of 120 K TPM each, or any number of deployments in one or multiple Azure OpenAI resources as long as their TPM adds up to less than 240 K total in that region.
3232
3333
When a deployment is created, the assigned TPM will directly map to the tokens-per-minute rate limit enforced on its inferencing requests. A **Requests-Per-Minute (RPM)** rate limit will also be enforced whose value is set proportionally to the TPM assignment using the following ratio:
3434

35-
6 RPM per 1000 TPM.
35+
6 RPM per 1000 TPM. (This ratio can vary by model for more information, see [quota, and limits](../quotas-limits.md#o-series-rate-limits).)
3636

3737
The flexibility to distribute TPM globally within a subscription and region has allowed Azure OpenAI Service to loosen other restrictions:
3838

@@ -145,10 +145,10 @@ This is only a subset of the available request body parameters. For the full lis
145145
#### Example request
146146

147147
```Bash
148-
curl -X PUT https://management.azure.com/subscriptions/00000000-0000-0000-0000-000000000000/resourceGroups/resource-group-temp/providers/Microsoft.CognitiveServices/accounts/docs-openai-test-001/deployments/gpt-35-turbo-test-deployment?api-version=2023-05-01 \
148+
curl -X PUT https://management.azure.com/subscriptions/00000000-0000-0000-0000-000000000000/resourceGroups/resource-group-temp/providers/Microsoft.CognitiveServices/accounts/docs-openai-test-001/deployments/gpt-4o-test-deployment?api-version=2023-05-01 \
149149
-H "Content-Type: application/json" \
150150
-H 'Authorization: Bearer YOUR_AUTH_TOKEN' \
151-
-d '{"sku":{"name":"Standard","capacity":10},"properties": {"model": {"format": "OpenAI","name": "gpt-35-turbo","version": "0613"}}}'
151+
-d '{"sku":{"name":"Standard","capacity":10},"properties": {"model": {"format": "OpenAI","name": "gpt-4o","version": "2024-11-20"}}}'
152152
```
153153

154154
> [!NOTE]
@@ -215,7 +215,7 @@ az login
215215
By setting sku-capacity to 10 in the command below this deployment will be set with a 10K TPM limit.
216216

217217
```azurecli
218-
az cognitiveservices account deployment create -g test-resource-group -n test-resource-name --deployment-name test-deployment-name --model-name gpt-35-turbo --model-version "0613" --model-format OpenAI --sku-capacity 10 --sku-name "Standard"
218+
az cognitiveservices account deployment create -g test-resource-group -n test-resource-name --deployment-name test-deployment-name --model-name gpt-4o --model-version "2024-11-20" --model-format OpenAI --sku-capacity 10 --sku-name "Standard"
219219
```
220220

221221
### Usage
@@ -272,8 +272,8 @@ $cognitiveServicesDeploymentParams = @{
272272
Name = 'test-deployment-name'
273273
Properties = @{
274274
Model = @{
275-
Name = 'gpt-35-turbo'
276-
Version = '0613'
275+
Name = 'gpt-4o'
276+
Version = '2024-11-20'
277277
Format = 'OpenAI'
278278
}
279279
}
@@ -301,7 +301,7 @@ Get-AzCognitiveServicesUsage -Location eastus
301301

302302
This command runs in the context of the currently active subscription for Azure PowerShell. Use `Set-AzContext` to [modify the active subscription](/powershell/azure/manage-subscriptions-azureps#change-the-active-subscription).
303303

304-
For more details on `New-AzCognitiveServicesAccountDeployment` and `Get-AzCognitiveServicesUsage`, consult the [Azure PowerShell reference documentation](/powershell/module/az.cognitiveservices/).
304+
For more information on `New-AzCognitiveServicesAccountDeployment` and `Get-AzCognitiveServicesUsage`, see [Azure PowerShell reference documentation](/powershell/module/az.cognitiveservices/).
305305

306306
# [Azure Resource Manager](#tab/arm)
307307

@@ -324,8 +324,8 @@ For more details on `New-AzCognitiveServicesAccountDeployment` and `Get-AzCognit
324324
"properties": {
325325
"model": {
326326
"format": "OpenAI",
327-
"name": "gpt-35-turbo",
328-
"version": "0613" // Version 0613 of gpt-35-turbo will be used
327+
"name": "gpt-4o",
328+
"version": "2024-11-20"
329329
}
330330
}
331331
}
@@ -350,8 +350,8 @@ resource arm_je_std_deployment 'Microsoft.CognitiveServices/accounts/deployments
350350
properties: {
351351
model: {
352352
format: 'OpenAI'
353-
name: 'gpt-35-turbo'
354-
version: '0613' // gpt-35-turbo version 0613 will be used
353+
name: 'gpt-4o'
354+
version: '2024-11-20'
355355
}
356356
}
357357
}
@@ -425,8 +425,8 @@ resource "azapi_resource" "TERRAFORM-AOAI-STD-DEPLOYMENT" {
425425
properties = {
426426
model = {
427427
format = "OpenAI",
428-
name = "gpt-35-turbo",
429-
version = "0613" # Deploy gpt-35-turbo version 0613
428+
name = "gpt-4o",
429+
version = "2024-11-20"
430430
}
431431
}
432432
})

articles/ai-services/openai/reference.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Learn how to use Azure OpenAI's REST API. In this article, you lear
55
manager: nitinme
66
ms.service: azure-ai-openai
77
ms.topic: conceptual
8-
ms.date: 01/29/2025
8+
ms.date: 04/30/2025
99
author: mrbullwinkle
1010
ms.author: mbullwin
1111
recommendations: false

0 commit comments

Comments
 (0)