MicrosoftDocs
diff --git a/‎articles/ai-services/openai/api-version-deprecation.md
Lines changed: 3 additions & 3 deletions b/‎articles/ai-services/openai/api-version-deprecation.md
Lines changed: 3 additions & 3 deletions
diff --git a/‎articles/ai-services/openai/concepts/model-retirements.md
Lines changed: 4 additions & 4 deletions b/‎articles/ai-services/openai/concepts/model-retirements.md
Lines changed: 4 additions & 4 deletions
diff --git a/‎articles/ai-services/openai/concepts/models.md
Lines changed: 1 addition & 1 deletion b/‎articles/ai-services/openai/concepts/models.md
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/ai-services/openai/concepts/provisioned-throughput.md
Lines changed: 2 additions & 3 deletions b/‎articles/ai-services/openai/concepts/provisioned-throughput.md
Lines changed: 2 additions & 3 deletions
diff --git a/‎articles/ai-services/openai/includes/gpt-4-turbo.md
Lines changed: 1 addition & 1 deletion b/‎articles/ai-services/openai/includes/gpt-4-turbo.md
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/ai-services/openai/includes/model-matrix/quota.md
Lines changed: 1 addition & 1 deletion b/‎articles/ai-services/openai/includes/model-matrix/quota.md
Lines changed: 1 addition & 1 deletion
diff --git a/‎articles/ai-services/openai/includes/model-matrix/standard-gpt-4.md
Lines changed: 18 additions & 16 deletions b/‎articles/ai-services/openai/includes/model-matrix/standard-gpt-4.md
Lines changed: 18 additions & 16 deletions
@@ -5,7 +5,7 @@ services: cognitive-services
 manager: nitinme
 ms.service: azure-ai-openai
 ms.topic: conceptual 
-ms.date: 03/28/2024
+ms.date: 05/02/2024
 author: mrbullwinkle
 ms.author: mbullwin
 recommendations: false
@@ -14,14 +14,14 @@ ms.custom:
 
 # Azure OpenAI API preview lifecycle
 
-This article is to help you understand the support lifecycle for the Azure OpenAI API previews. New preview APIs target a monthly release cadence. After July 1, 2024, the latest three preview APIs will remain supported while older APIs will no longer be supported unless support is explictly indicated.
+This article is to help you understand the support lifecycle for the Azure OpenAI API previews. New preview APIs target a monthly release cadence. After July 1, 2024, the latest three preview APIs will remain supported while older APIs will no longer be supported unless support is explicitly indicated.
 
 > [!NOTE]
 > The `2023-06-01-preview` API will remain supported at this time, as `DALL-E 2` is only available in this API version. `DALL-E 3` is supported in the latest API releases. The `2023-10-01-preview` API will also remain supported at this time.
 
 ## Latest preview API release
 
-Azure OpenAI API version [2024-03-01-preview](https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/data-plane/AzureOpenAI/inference/preview/2024-03-01-preview/inference.json)
+Azure OpenAI API version [2024-04-01-preview](https://github.com/Azure/azure-rest-api-specs/blob/main/specification/cognitiveservices/data-plane/AzureOpenAI/inference/preview/2024-04-01-preview/inference.json)
 is currently the latest preview release.
 
 This version contains support for all the latest Azure OpenAI features including:
 
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
 description: Learn about the model deprecations and retirements in Azure OpenAI.
 ms.service: azure-ai-openai
 ms.topic: conceptual
-ms.date: 04/24/2024
+ms.date: 05/01/2024
 ms.custom: 
 manager: nitinme
 author: mrbullwinkle
@@ -66,9 +66,9 @@ These models are currently available for use in Azure OpenAI Service.
 | `gpt-35-turbo` | 0125 | No earlier than Feb 22, 2025 |
 | `gpt-4`<br>`gpt-4-32k` | 0314 | No earlier than July 13, 2024 |
 | `gpt-4`<br>`gpt-4-32k` | 0613 | No earlier than Sep 30, 2024 |
-| `gpt-4` | 1106-preview | To be upgraded to `gpt-4` Version: `2024-04-09`, starting on June 10, 2024, or later **<sup>1</sup>** |
-| `gpt-4` | 0125-preview |To be upgraded to `gpt-4` Version: `2024-04-09`, starting on June 10, 2024, or later  **<sup>1</sup>**  |
-| `gpt-4` | vision-preview | To be upgraded to `gpt-4` Version: `2024-04-09`, starting on June 10, 2024, or later  **<sup>1</sup>** |
+| `gpt-4` | 1106-preview | To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on June 10, 2024, or later **<sup>1</sup>** |
+| `gpt-4` | 0125-preview |To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on June 10, 2024, or later  **<sup>1</sup>**  |
+| `gpt-4` | vision-preview | To be upgraded to `gpt-4` Version: `turbo-2024-04-09`, starting on June 10, 2024, or later  **<sup>1</sup>** |
 | `gpt-3.5-turbo-instruct` | 0914 | No earlier than Sep 14, 2025 |
 | `text-embedding-ada-002` | 2 | No earlier than April 3, 2025 |
 | `text-embedding-ada-002` | 1 | No earlier than April 3, 2025 |
 
@@ -74,7 +74,7 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
 
 > [!IMPORTANT]
 >
-> - `gpt-4` versions 1106-Preview and 0125-Preview will be upgraded with a stable version of `gpt-4` in the future. The deployment upgrade of `gpt-4` 1106-Preview to `gpt-4` 0125-Preview scheduled for March 8, 2024 is no longer taking place.  Deployments of `gpt-4` versions 1106-Preview and 0125-Preview set to "Auto-update to default" and "Upgrade when expired" will start to be upgraded after the stable version is released.  For each deployment, a model version upgrade takes place with no interruption in service for API calls.  Upgrades are staged by region and the full upgrade process is expected to take 2 weeks. Deployments of `gpt-4` versions 1106-Preview and 0125-Preview set to "No autoupgrade" will not be upgraded and will stop operating when the preview version is upgraded in the region.
+> - `gpt-4` versions 1106-Preview and 0125-Preview will be upgraded with a stable version of `gpt-4` in the future. Deployments of `gpt-4` versions 1106-Preview and 0125-Preview set to "Auto-update to default" and "Upgrade when expired" will start to be upgraded after the stable version is released.  For each deployment, a model version upgrade takes place with no interruption in service for API calls.  Upgrades are staged by region and the full upgrade process is expected to take 2 weeks. Deployments of `gpt-4` versions 1106-Preview and 0125-Preview set to "No autoupgrade" will not be upgraded and will stop operating when the preview version is upgraded in the region. See [Azure OpenAI model retirements and deprecations](./model-retirements.md) for more information on the timing of the upgrade.
 
 ## GPT-3.5
 
 
@@ -68,13 +68,12 @@ az cognitiveservices account deployment create \
 
 ### Quota
 
-Provisioned throughput quota represents a specific amount of total throughput you can deploy. Quota in the Azure OpenAI Service is managed at the subscription level. All Azure OpenAI resources within the subscription share this quota. 
+Provisioned throughput quota represents a specific amount of total throughput you can deploy. Quota in the Azure OpenAI Service is managed at the subscription level. All Azure OpenAI resources within the subscription share this quota.
 
-Quota is specified in Provisioned throughput units and is specific to a (deployment type, model, region) triplet. Quota isn't interchangeable. Meaning you can't use quota for GPT-4 to deploy GPT-35-turbo. You can raise a support request to move quota across deployment types, models, or regions but the swap isn't guaranteed.
+Quota is specified in Provisioned throughput units and is specific to a (deployment type, model, region) triplet. Quota isn't interchangeable. Meaning you can't use quota for GPT-4 to deploy GPT-3.5-Turbo.
 
 While we make every attempt to ensure that quota is deployable, quota doesn't represent a guarantee that the underlying capacity is available. The service assigns capacity during the deployment operation and if capacity is unavailable the deployment fails with an out of capacity error.
 
-
 ### Determining the number of PTUs needed for a workload
 
 PTUs represent an amount of model processing capacity. Similar to your computer or databases, different workloads or requests to the model will consume different amounts of underlying processing capacity. The conversion from call shape characteristics (prompt size, generation size and call rate) to PTUs is complex and non-linear. To simplify this process, you can use the [Azure OpenAI Capacity calculator](https://oai.azure.com/portal/calculator) to size specific workload shapes. 
 
@@ -21,7 +21,7 @@ This is the replacement for the following preview models:
 ### Differences between OpenAI and Azure OpenAI GPT-4 Turbo with Vision GA model
 
 - OpenAI's version of the latest `0409` turbo model supports JSON mode and function calling for all inference requests.
-- Azure OpenAI's version of the latest `turbo-2024-04-09` currently doesn't support the use of JSON mode and function calling when making inference requests with image (vision) input. Text based input requests do support JSON mode and function calling.
+- Azure OpenAI's version of the latest `turbo-2024-04-09` currently doesn't support the use of JSON mode and function calling when making inference requests with image (vision) input. Text based input requests (requests without `image_url` and inline images) do support JSON mode and function calling.
 
 ### Differences from gpt-4 vision-preview
 
 
@@ -32,6 +32,6 @@ Quota for standard deployments is described in of terms of [Tokens-Per-Minute (T
 | uksouth          | -       | -           | 80 K          | -               | 240 K          | -                       | 350 K                    | -                        | -                        | -             | -                        | -             | -                        | -                         | -                              | -                              |
 | westeurope       | -       | -           | -             | -               | 240 K          | -                       | 240 K                    | -                        | -                        | -             | -                        | -             | -                        | -                         | -                              | -                              |
 | westus           | -       | -           | 80 K          | 30 K            | 300 K          | -                       | 350 K                    | -                        | -                        | -             | -                        | -             | -                        | -                         | -                              | -                              |
-| westus3          | -       | -           | -             | -               | -              | -                       | 350 K                    | -                        | -                        | -             | -                        | -             | -                        | -                         | -                              | -                              |
+| westus3          | -       | -           | 80 K          | -               | -              | -                       | 350 K                    | -                        | -                        | -             | -                        | -             | -                        | -                         | -                              | -                              |
 
 1 K = 1000 Tokens-Per-Minute (TPM). The relationship between TPM and Requests Per Minute (RPM) is [currently defined as 6 RPM per 1000 TPM](../../how-to/quota.md#understanding-rate-limits).
@@ -8,19 +8,21 @@ ms.topic: include
 ms.date: 04/29/2024
 ---
 
-| **Region**   | **gpt-4**, **0613**   | **gpt-4**, **1106-Preview**   | **gpt-4**, **0125-Preview**   |**gpt-4**, **vision-preview**  | **gpt-4**, **turbo-2024-04-09**  | **gpt-4-32k**, **0613**   |
-|:-----------------|:-------------------:|:---------------------------:|:---------------------------:|:-----------------------------:|:-----------------------------:|:-----------------------:|
-| australiaeast    | ✅                | ✅                           | -                           | ✅                           | -                             | ✅                    |
-| canadaeast       | ✅                | ✅                           | -                           | -                            | -                             | ✅                    |
-| eastus           | -                 | -                             | ✅                         | -                            | -                            | -                   |
-| eastus2          | -                 | ✅                            | -                          | -                            | ✅                             | -                   |
-| francecentral    | ✅                | ✅                           | -                           | -                           | -                               | ✅                    |
-| japaneast        | -                 | -                             | -                           | ✅                          | -                               | -                   |
-| northcentralus   | -                 | -                             | ✅                         | -                            | -                               | -                   |
-| norwayeast       | -                 | ✅                            | -                          | -                            | -                               | -                   |
-| southcentralus   | -                 | -                             | ✅                         | -                            | -                               | -                   |
-| southindia       | -                 | ✅                            | -                          | -                            | -                               | -                   |
-| swedencentral    | ✅                | ✅                           | -                           | ✅                          | ✅                              | ✅                    |
-| switzerlandnorth | ✅                | -                             | -                          | ✅                           | -                               | ✅                    |
-| uksouth          | -                 | ✅                            | ✅                        | -                             | -                               | -                   |
-| westus           | -                | ✅                            | -                          | ✅                            | -                               | -                   |
+| **Region**   | **gpt-4**, **0613**   | **gpt-4**, **1106-Preview**   | **gpt-4**, **0125-Preview**   | **gpt-4**, **vision-preview**   | **gpt-4**, **turbo-2024-04-09**   | **gpt-4-32k**, **0613**   |
+|:-----------------|:-------------------:|:---------------------------:|:---------------------------:|:-----------------------------:|:-------------------------------:|:-----------------------:|
+| australiaeast    | ✅                | ✅                        | -                       | ✅                          | -                           | ✅                    |
+| canadaeast       | ✅                | ✅                        | -                       | -                         | -                           | ✅                    |
+| eastus           | -               | -                       | ✅                        | -                         | -                           | -                   |
+| eastus2          | -               | ✅                        | -                       | -                         | ✅                            | -                   |
+| francecentral    | ✅                | ✅                        | -                       | -                         | -                           | ✅                    |
+| japaneast        | -               | -                       | -                       | ✅                          | -                           | -                   |
+| northcentralus   | -               | -                       | ✅                        | -                         | -                           | -                   |
+| norwayeast       | -               | ✅                        | -                       | -                         | -                           | -                   |
+| southcentralus   | -               | -                       | ✅                        | -                         | -                           | -                   |
+| southindia       | -               | ✅                        | -                       | -                         | -                           | -                   |
+| swedencentral    | ✅                | ✅                        | -                       | ✅                          | ✅                            | ✅                    |
+| switzerlandnorth | ✅                | -                       | -                       | ✅                          | -                           | ✅                    |
+| uksouth          | -               | ✅                        | ✅                        | -                         | -                           | -                   |
+| westus           | -               | ✅                        | -                       | ✅                          | -                           | -                   |
+| westus3          | -               | ✅                        | -                       | -                         | -                           | -                   |
+
Original file line number	Diff line number	Diff line change
`@@ -74,7 +74,7 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope`
`74`	`74`
`75`	`75`	`> [!IMPORTANT]`
`76`	`76`	`>`
`77`		-> - `gpt-4` versions 1106-Preview and 0125-Preview will be upgraded with a stable version of `gpt-4` in the future. The deployment upgrade of `gpt-4` 1106-Preview to `gpt-4` 0125-Preview scheduled for March 8, 2024 is no longer taking place. Deployments of `gpt-4` versions 1106-Preview and 0125-Preview set to "Auto-update to default" and "Upgrade when expired" will start to be upgraded after the stable version is released. For each deployment, a model version upgrade takes place with no interruption in service for API calls. Upgrades are staged by region and the full upgrade process is expected to take 2 weeks. Deployments of `gpt-4` versions 1106-Preview and 0125-Preview set to "No autoupgrade" will not be upgraded and will stop operating when the preview version is upgraded in the region.
	`77`	+> - `gpt-4` versions 1106-Preview and 0125-Preview will be upgraded with a stable version of `gpt-4` in the future. Deployments of `gpt-4` versions 1106-Preview and 0125-Preview set to "Auto-update to default" and "Upgrade when expired" will start to be upgraded after the stable version is released. For each deployment, a model version upgrade takes place with no interruption in service for API calls. Upgrades are staged by region and the full upgrade process is expected to take 2 weeks. Deployments of `gpt-4` versions 1106-Preview and 0125-Preview set to "No autoupgrade" will not be upgraded and will stop operating when the preview version is upgraded in the region. See [Azure OpenAI model retirements and deprecations](./model-retirements.md) for more information on the timing of the upgrade.
`78`	`78`
`79`	`79`	`## GPT-3.5`
`80`	`80`