MicrosoftDocs
diff --git a/‎.openpublishing.redirection.json‎
Lines changed: 5 additions & 0 deletions b/‎.openpublishing.redirection.json‎
Lines changed: 5 additions & 0 deletions
diff --git a/‎articles/ai-services/openai/concepts/model-retirements.md‎
Lines changed: 7 additions & 2 deletions b/‎articles/ai-services/openai/concepts/model-retirements.md‎
Lines changed: 7 additions & 2 deletions
diff --git a/‎articles/ai-services/openai/concepts/models.md‎
Lines changed: 77 additions & 49 deletions b/‎articles/ai-services/openai/concepts/models.md‎
Lines changed: 77 additions & 49 deletions
diff --git a/‎articles/ai-services/openai/concepts/provisioned-throughput.md‎
Lines changed: 2 additions & 2 deletions b/‎articles/ai-services/openai/concepts/provisioned-throughput.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎articles/ai-services/openai/includes/model-matrix/global-batch.md‎
Lines changed: 6 additions & 6 deletions b/‎articles/ai-services/openai/includes/model-matrix/global-batch.md‎
Lines changed: 6 additions & 6 deletions
diff --git a/‎articles/ai-services/openai/includes/model-matrix/provisioned-global.md‎
Lines changed: 2 additions & 2 deletions b/‎articles/ai-services/openai/includes/model-matrix/provisioned-global.md‎
Lines changed: 2 additions & 2 deletions
@@ -19,6 +19,11 @@
       "source_path_from_root": "/articles/search/search-howto-index-csv-blobs.md",
       "redirect_url": "/azure/search/search-how-to-index-csv-blobs",
       "redirect_document_id": false
+    },
+    {
+      "source_path_from_root": "/articles/search/search-howto-large-index.md",
+      "redirect_url": "/azure/search/search-how-to-large-index",
+      "redirect_document_id": false
     }
   ]
 }
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
 description: Learn about the model deprecations and retirements in Azure OpenAI.
 ms.service: azure-ai-openai
 ms.topic: conceptual
-ms.date: 10/02/2024
+ms.date: 10/25/2024
 ms.custom: 
 manager: nitinme
 author: mrbullwinkle
@@ -91,6 +91,8 @@ These models are currently available for use in Azure OpenAI Service.
 
 | Model | Version | Retirement date | Suggested replacements |
 | ---- | ---- | ---- | --- |
+| `babbage-002` | 1 | Deprecation Date: November 15, 2024 <br>Retirement Date: January 27, 2025 | |
+| `davinci-002` | 1 | Deprecation Date: November 15, 2024 <br>Retirement Date: January 27, 2025 | |
 | `dall-e-2`| 2 | January 27, 2025 | `dalle-3` |
 | `dall-e-3` | 3 | No earlier than April 30, 2025 | |
 | `gpt-35-turbo` | 0301 | January 27, 2025<br><br> Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 13, 2024.   | `gpt-35-turbo` (0125) <br><br> `gpt-4o-mini`  |
@@ -158,9 +160,12 @@ If you're an existing customer looking for information about these models, see [
 | code-search-babbage-code-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
 | code-search-babbage-text-001 | July 6, 2023 | June 14, 2024 | text-embedding-3-small |
 
-
 ## Retirement and deprecation history
 
+## October 25, 2024
+
+* `babbage-002` & `davinci-002` deprecation date: November 15, 2024  and retirement date: January 27, 2025.
+
 ## September 12, 2024
 
 * `gpt-35-turbo` (0301), (0613), (1106) and `gpt-35-turbo-16k` (0613) auto-update to default upgrade date updated to November 13, 2024.
 
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
 description: Learn about the different model capabilities that are available with Azure OpenAI.
 ms.service: azure-ai-openai
 ms.topic: conceptual
-ms.date: 10/09/2024
+ms.date: 10/25/2024
 ms.custom: references_regions, build-2023, build-2023-dataai, refefences_regions
 manager: nitinme
 author: mrbullwinkle #ChrisHMSFT
@@ -357,16 +357,40 @@ You can also use the OpenAI text to speech voices via Azure AI Speech. To learn
 
 ## Model summary table and region availability
 
-> [!NOTE]
-> This article primarily covers model/region availability that applies to all Azure OpenAI customers with deployment types of **Standard**. Some select customers have access to model/region combinations that are not listed in the unified table below. For more information on Provisioned deployments, see our [Provisioned guidance](./provisioned-throughput.md).
+### Models by deployment type
+
+Azure OpenAI provides customers with choices on the hosting structure that fits their business and usage patterns. The service offers two main types of deployment: 
+
+- **Standard** is offered with a global deployment option, routing traffic globally to provide higher throughput.
+- **Provisioned** is also offered with a global deployment option, allowing customers to purchase and deploy provisioned throughput units across Azure global infrastructure.
+
+All deployments can perform the exact same inference operations, however the billing, scale, and performance are substantially different. To learn more about Azure OpenAI deployment types see our [deployment types guide](../how-to/deployment-types.md).
+
+# [Global Standard](#tab/global-standard)
+
+### Global standard model availability
+
+[!INCLUDE [Standard Global](../includes/model-matrix/standard-global.md)]
+
+# [Global Provisioned Managed](#tab/global-ptum)
+
+### Global provisioned managed model availability
+
+[!INCLUDE [Provisioned Managed Global](../includes/model-matrix/provisioned-global.md)]
+
+# [Global Batch](#tab/global-batch)
+
+### Global batch model availability
+
+[!INCLUDE [Global batch](../includes/model-matrix/global-batch.md)]
+
+# [Standard](#tab/standard)
 
 ### Standard deployment model availability
 
 [!INCLUDE [Standard Models](../includes/model-matrix/standard-models.md)]
 
-This table doesn't include fine-tuning regional availability information.  Consult the [fine-tuning section](#fine-tuning-models) for this information.
-
-For information on default quota, refer to the [quota and limits article](../quotas-limits.md).
+# [Provisioned Managed](#tab/provisioned)
 
 ### Provisioned deployment model availability
 
@@ -377,24 +401,20 @@ For information on default quota, refer to the [quota and limits article](../quo
 
 For more information on Provisioned deployments, see our [Provisioned guidance](./provisioned-throughput.md).
 
-### Global standard model availability
+---
 
-[!INCLUDE [Standard Global](../includes/model-matrix/standard-global.md)]
+This table doesn't include fine-tuning regional availability information.  Consult the [fine-tuning section](#fine-tuning-models) for this information.
 
-### Global provisioned managed model availability
+### Standard models by endpoint
 
-[!INCLUDE [Provisioned Managed Global](../includes/model-matrix/provisioned-global.md)]
+# [Chat Completions](#tab/standard-chat-completions)
 
-### Global batch model availability
+### Chat completions
 
-[!INCLUDE [Global batch](../includes/model-matrix/global-batch.md)]
+[!INCLUDE [Chat Completions](../includes/model-matrix/standard-chat-completions.md)]
 
 ### GPT-4 and GPT-4 Turbo model availability
 
-#### Public cloud regions
-
-[!INCLUDE [GPT-4](../includes/model-matrix/standard-gpt-4.md)]
-
 #### Select customer access
 
 In addition to the regions above which are available to all Azure OpenAI customers, some select pre-existing customers have been granted access to versions of GPT-4 in additional regions:
@@ -406,23 +426,14 @@ In addition to the regions above which are available to all Azure OpenAI custome
 
 ### GPT-3.5 models
 
-> [!IMPORTANT]
-> The NEW `gpt-35-turbo (0125)`  model has various improvements, including higher accuracy at responding in requested formats and a fix for a bug which caused a text encoding issue for non-English language function calls.
-
-GPT-3.5 Turbo is used with the Chat Completion API. GPT-3.5 Turbo version 0301 can also be used with the Completions API, though this is not recommended.  GPT-3.5 Turbo versions 0613 and 1106 only support the Chat Completions API.
-
-GPT-3.5 Turbo version 0301 is the first version of the model released.  Version 0613 is the second version of the model and adds function calling support.
-
 See [model versions](../concepts/model-versions.md) to learn about how Azure OpenAI Service handles model version upgrades, and [working with models](../how-to/working-with-models.md) to learn how to view and configure the model version settings of your GPT-3.5 Turbo deployments.
 
-### GPT-3.5-Turbo model availability
-
-#### Public cloud regions
-
-[!INCLUDE [GPT-35-Turbo](../includes/model-matrix/standard-gpt-35-turbo.md)]
+# [Embeddings](#tab/standard-embeddings)
 
 ### Embeddings models
 
+[!INCLUDE [Embeddings](../includes/model-matrix/standard-embeddings.md)]
+
 These models can only be used with Embedding API requests.
 
 > [!NOTE]
@@ -438,21 +449,51 @@ These models can only be used with Embedding API requests.
 > [!NOTE]
 > When sending an array of inputs for embedding, the max number of input items in the array per call to the embedding endpoint is 2048.
 
-#### Public cloud regions
+# [Image Generation](#tab/standard-image-generations)
 
-[!INCLUDE [Embeddings](../includes/model-matrix/standard-embeddings.md)]
+### Image generation models
+
+[!INCLUDE [Image Generation](../includes/model-matrix/standard-image-generation.md)]
 
 ### DALL-E models
 
-|  Model ID  | Feature Availability | Max Request (characters) |
-|  --- |  --- | :---: |
-| dalle2 (preview) | East US | 1,000 |
-| dall-e-3 | East US, Australia East, Sweden Central | 4,000 |
+|  Model ID  | Max Request (characters) |
+|  --- | :---: |
+| dalle2 (preview)  | 1,000 |
+| dall-e-3  | 4,000 |
+
+# [Audio](#tab/standard-audio)
+
+### Audio models
+
+[!INCLUDE [Audio](../includes/model-matrix/standard-audio.md)]
+
+### Whisper models
 
-### Fine-tuning models
+|  Model ID  | Max Request (audio file size) |
+|  --- | :---: |
+| `whisper` | 25 MB |
+
+### Text to speech models (Preview)
+
+|  Model ID  | Description |
+|  --- | :--- |
+| `tts` | The latest Azure OpenAI text to speech model, optimized for speed. |
+| `tts-hd` | The latest Azure OpenAI text to speech model, optimized for quality.|
+ |
+
+# [Completions (Legacy)](#tab/standard-completions)
+
+### Completions models
 
 `babbage-002` and `davinci-002` are not trained to follow instructions. Querying these base models should only be done as a point of reference to a fine-tuned version to evaluate the progress of your training.
 
+[!INCLUDE [Completions](../includes/model-matrix/standard-completions.md)]
+
+---
+
+## Fine-tuning models
+
 `gpt-35-turbo` - fine-tuning of this model is limited to a subset of regions, and is not available in every region the base model is available.  
 
 |  Model ID  | Fine-Tuning Regions | Max Request (tokens) | Training Data (up to) |
@@ -468,20 +509,7 @@ These models can only be used with Embedding API requests.
 
 **<sup>1</sup>** GPT-4 is currently in public preview.
 
-### Whisper models
-
-|  Model ID  | Model Availability | Max Request (audio file size) |
-|  --- |  --- | :---: |
-| `whisper` | East US 2 <br> North Central US <br> Norway East <br> South India <br> Sweden Central <br> West Europe | 25 MB |
-
-### Text to speech models (Preview)
-
-|  Model ID  | Model Availability |
-|  --- |  --- | :---: |
-| `tts-1` | North Central US <br> Sweden Central |
-| `tts-1-hd` | North Central US <br> Sweden Central |
-
-### Assistants (Preview)
+## Assistants (Preview)
 
 For Assistants you need a combination of a supported model, and a supported region. Certain tools and capabilities require the latest models. The following models are available in the Assistants API, SDK, Azure AI Studio and Azure OpenAI Studio. The following table is for pay-as-you-go. For information on Provisioned Throughput Unit (PTU) availability, see [provisioned throughput](./provisioned-throughput.md). The listed models and regions can be used with both Assistants v1 and v2. You can use [global standard models](#global-standard-model-availability) if they are supported in the regions listed below. 
 
 
@@ -49,7 +49,7 @@ To help with simplifying the sizing effort, the following table outlines the TPM
 | Input TPM per PTU | 2,500 | 37,000  |
 | Output TPM per PTU | 833  | 12,333 |
 
-\** For a full list see the [AOAI Studio calcualator](https://oai.azure.com/portal/calculator)
+For a full list see the [AOAI Studio calculator](https://oai.azure.com/portal/calculator).
 
 
 ## Key concepts
@@ -114,7 +114,7 @@ In Azure OpenAI Studio, the deployment experience identifies when a region lacks
 
 Details on the new deployment experience can be found in the Azure OpenAI [Provisioned get started guide](../how-to/provisioned-get-started.md).
 
-The new [model capacities API](/rest/api/aiservices/accountmanagement/model-capacities/list?view=rest-aiservices-accountmanagement-2024-04-01-preview&tabs=HTTP&preserve-view=true) can  be used to programmatically identify the maximum sized deployment of a specified model.  The API consideres both the your quota and service capacity in the region.
+The new [model capacities API](/rest/api/aiservices/accountmanagement/model-capacities/list?view=rest-aiservices-accountmanagement-2024-04-01-preview&tabs=HTTP&preserve-view=true) can  be used to programmatically identify the maximum sized deployment of a specified model.  The API considers both your quota and service capacity in the region.
 
 If an acceptable region isn't available to support the desire model, version and/or PTUs, customers can also try the following steps:
 
 
@@ -5,11 +5,11 @@ description: Regional availability for Global Batch models
 manager: nitinme
 ms.service: azure-ai-openai
 ms.topic: include
-ms.date: 10/03/2024
+ms.date: 10/24/2024
 ---
 
-| **Region**   | **gpt-4**, **0613**   | **gpt-4**, **turbo-2024-04-09**   | **gpt-4o**, **2024-05-13**   | **gpt-4o**, **2024-08-06**   | **gpt-4o-mini**, **2024-07-18**   | **gpt-35-turbo**, **0613**   | **gpt-35-turbo**, **1106**   | **gpt-35-turbo**, **0125**   |
-|:-----------------|:-------------------:|:-------------------------------:|:--------------------------:|:--------------------------:|:-------------------------------:|:--------------------------:|:--------------------------:|:--------------------------:|
-| eastus           | ✅                | ✅                            | ✅                       | ✅                       | ✅                            | ✅                       | ✅                       | ✅                       |
-| swedencentral    | ✅                | ✅                            | ✅                       | ✅                       | ✅                            | ✅                       | ✅                       | ✅                       |
-| westus           | ✅                | ✅                            | ✅                       | ✅                       | ✅                            | ✅                       | ✅                       | ✅                       |
+| **Region**   | **gpt-4o**, **2024-05-13**   | **gpt-4o**, **2024-08-06**   | **gpt-4o-mini**, **2024-07-18**   | **gpt-4**, **0613**   | **gpt-4**, **turbo-2024-04-09**   | **gpt-35-turbo**, **0613**   | **gpt-35-turbo**, **1106**   | **gpt-35-turbo**, **0125**   |
+|:-----------------|:--------------------------:|:--------------------------:|:-------------------------------:|:-------------------:|:-------------------------------:|:--------------------------:|:--------------------------:|:--------------------------:|
+| eastus           | ✅                       | ✅                       | ✅                            | ✅                | ✅                            | ✅                       | ✅                       | ✅                       |
+| swedencentral    | ✅                       | ✅                       | ✅                            | ✅                | ✅                            | ✅                       | ✅                       | ✅                       |
+| westus           | ✅                       | ✅                       | ✅                            | ✅                | ✅                            | ✅                       | ✅                       | ✅                       |
@@ -6,7 +6,7 @@ manager: nitinme
 ms.service: azure-ai-openai
 ms.topic: include
 ms.custom: references_regions
-ms.date: 10/03/2024
+ms.date: 10/25/2024
 ---
 
 | **Region**     | **gpt-4o**, **2024-08-06**   | **gpt-4o-mini**, **2024-07-18**   |
@@ -34,4 +34,4 @@ ms.date: 10/03/2024
 | uksouth            | ✅                       | ✅                            |
 | westeurope         | ✅                       | ✅                            |
 | westus             | ✅                       | ✅                            |
-| westus3            | ✅                       | ✅                            |
+| westus3            | ✅                       | ✅                            |
Original file line number	Diff line number	Diff line change
`@@ -19,6 +19,11 @@`
`19`	`19`	`"source_path_from_root": "/articles/search/search-howto-index-csv-blobs.md",`
`20`	`20`	`"redirect_url": "/azure/search/search-how-to-index-csv-blobs",`
`21`	`21`	`"redirect_document_id": false`
	`22`	`+ },`
	`23`	`+ {`
	`24`	`+ "source_path_from_root": "/articles/search/search-howto-large-index.md",`
	`25`	`+ "redirect_url": "/azure/search/search-how-to-large-index",`
	`26`	`+ "redirect_document_id": false`
`22`	`27`	`}`
`23`	`28`	`]`
`24`	`29`	`}`