Merge pull request #5266 from MicrosoftDocs/main

PhilKang0704 · web-flow · commit cfe2517783a1 · 2025-05-29T13:30:06.000+08:00
5/29/2025 11:00 AM IST Publish
diff --git a/articles/ai-services/language-service/overview.md b/articles/ai-services/language-service/overview.md
@@ -6,7 +6,7 @@ author: laujan
 manager: nitinme
 ms.service: azure-ai-language
 ms.topic: overview
-ms.date: 03/05/2025
+ms.date: 05/28/2025
 ms.author: lajanuar
 ---
 
@@ -24,6 +24,7 @@ The Language service also provides several new features as well, which can eithe
 * Customizable, which means you train an AI model using our tools to fit your data specifically.
 
 Language features are also utilized in [agent templates](https://github.com/azure-ai-foundry/foundry-samples/tree/main/samples/agent-catalog):
+
 * [Intent routing agent](https://github.com/azure-ai-foundry/foundry-samples/tree/main/samples/agent-catalog/msft-agent-samples/foundry-agent-service-sdk/intent-routing-agent) detects user intent and provides exact answering. Perfect for deterministically intent routing and exact question answering with human controls.
 * [Exact question answering agent](https://github.com/azure-ai-foundry/foundry-samples/tree/main/samples/agent-catalog/msft-agent-samples/foundry-agent-service-sdk/exact-qna-agent) answers high-value predefined questions deterministically to ensure consistent and accurate responses.
 
diff --git a/articles/ai-services/openai/how-to/batch.md b/articles/ai-services/openai/how-to/batch.md
@@ -6,7 +6,7 @@ manager: nitinme
 ms.service: azure-ai-openai
 ms.custom: references_regions
 ms.topic: how-to
-ms.date: 04/14/2025
+ms.date: 05/28/2025
 author: mrbullwinkle
 ms.author: mbullwin
 recommendations: false
diff --git a/articles/ai-services/openai/how-to/provisioned-throughput-onboarding.md b/articles/ai-services/openai/how-to/provisioned-throughput-onboarding.md
@@ -3,7 +3,7 @@ title:  Understanding costs associated with provisioned throughput units (PTU)
 description: Learn about provisioned throughput costs and billing in Azure OpenAI. 
 ms.service: azure-ai-openai
 ms.topic: conceptual 
-ms.date: 05/20/2025
+ms.date: 05/28/2025
 manager: nitinme
 author: aahill 
 ms.author: aahi 
@@ -77,14 +77,14 @@ The amount of throughput (measured in tokens per minute or TPM) a deployment get
 
 For example, for `gpt-4.1:2025-04-14`, 1 output token counts as 4 input tokens towards your utilization limit which matches the [pricing](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/). Older models use a different ratio and for a deeper understanding on how different ratios of input and output tokens impact the throughput your workload needs, see the [Azure OpenAI capacity calculator](https://ai.azure.com/resource/calculator).
 
-|Topic| **gpt-4.1** | **gpt-4.1-mini** | **gpt-4.1-nano** | **o3** | **o3-mini** | **o1** | **gpt-4o** | **gpt-4o-mini** |
-| --- | --- |  --- |  --- | --- | --- | --- | --- | --- |
-|Global & data zone provisioned minimum deployment|15|15| 15 | 15 |15|15|15|15|
-|Global & data zone provisioned scale increment|5|5| 5 | 5 |5|5|5|5|
-|Regional provisioned minimum deployment|50|25| 25 |50 | 25|25|50|25|
-|Regional provisioned scale increment|50|25| 25 | 50 | 25|50|50|25|
-|Input TPM per PTU|3,000|14,900| 59,400 | 600 | 2,500|230|2,500|37,000|
-|Latency Target Value|44 Tokens Per Second|50 Tokens Per Second| 50 Tokens Per Second | 40 Tokens Per Second | 66 Tokens Per Second |25 Tokens Per Second|25 Tokens Per Second|33 Tokens Per Second|
+|Topic| **o4-mini** | **gpt-4.1** | **gpt-4.1-mini** | **gpt-4.1-nano** | **o3** | **o3-mini** | **o1** | **gpt-4o** | **gpt-4o-mini** |
+| --- |  --- | --- |  --- |  --- | --- | --- | --- | --- | --- |
+|Global & data zone provisioned minimum deployment| 15 | 15|15| 15 | 15 |15|15|15|15|
+|Global & data zone provisioned scale increment| 5 | 5|5| 5 | 5 |5|5|5|5|
+|Regional provisioned minimum deployment|25| 50|25| 25 |50 | 25|25|50|25|
+|Regional provisioned scale increment|25| 50|25| 25 | 50 | 25|50|50|25|
+|Input TPM per PTU|5,400 | 3,000|14,900| 59,400 | 600 | 2,500|230|2,500|37,000|
+|Latency Target Value| 66 Tokens Per Second | 40 Tokens Per Second|50 Tokens Per Second| 60 Tokens Per Second | 40 Tokens Per Second | 66 Tokens Per Second |25 Tokens Per Second|25 Tokens Per Second|33 Tokens Per Second|
 
 
 For a full list, see the [Azure OpenAI in Azure AI Foundry Models in Azure AI Foundry portal calculator](https://ai.azure.com/resource/calculator).
diff --git a/articles/ai-services/openai/includes/model-matrix/global-batch-datazone.md b/articles/ai-services/openai/includes/model-matrix/global-batch-datazone.md
@@ -10,16 +10,17 @@ ms.date: 02/14/2025
 ---
 
 
-| **Region**     | **o3-mini**, **2025-01-31**   | **gpt-4o**, **2024-08-06**   | **gpt-4o-mini**, **2024-07-18**   |
-|:-------------------|:---------------------------:|:--------------------------:|:-------------------------------:|
-| eastus             | ✅                        | ✅                       | ✅                            |
-| eastus2            | ✅                        | ✅                       | ✅                            |
-| francecentral      | -                       | ✅                       | ✅                            |
-| germanywestcentral | -                       | ✅                       | ✅                            |
-| northcentralus     | ✅                        | ✅                       | ✅                            |
-| polandcentral      | -                       | ✅                       | ✅                            |
-| southcentralus     | ✅                        | ✅                       | ✅                            |
-| swedencentral      | -                       | ✅                       | ✅                            |
-| westeurope         | -                       | ✅                       | ✅                            |
-| westus             | ✅                        | ✅                       | ✅                            |
-| westus3            | ✅                        | ✅                       | ✅                            |
+| **Region**     | **o4-mini**, **2025-04-16**   | **gpt-4.1**, **2025-04-14**   | **gpt-4.1-nano**, **2025-04-14**   | **gpt-4.1-mini**, **2025-04-14**   | **o3-mini**, **2025-01-31**   | **gpt-4o**, **2024-08-06**   | **gpt-4o-mini**, **2024-07-18**   |
+|:-------------------|:---------------------------:|:---------------------------:|:--------------------------------:|:--------------------------------:|:---------------------------:|:--------------------------:|:-------------------------------:|
+| eastus             | ✅                        | ✅                        | ✅                             | ✅                             | ✅                        | ✅                       | ✅                            |
+| eastus2            | ✅                        | ✅                        | ✅                             | ✅                             | ✅                        | ✅                       | ✅                            |
+| francecentral      | ✅                        | ✅                        | ✅                             | ✅                             | -                       | ✅                       | ✅                            |
+| germanywestcentral | ✅                        | ✅                        | ✅                             | ✅                             | -                       | ✅                       | ✅                            |
+| northcentralus     | ✅                        | ✅                        | ✅                             | ✅                             | ✅                        | ✅                       | ✅                            |
+| polandcentral      | ✅                        | ✅                        | ✅                             | ✅                             | -                       | ✅                       | ✅                            |
+| southcentralus     | ✅                        | ✅                        | ✅                             | ✅                             | ✅                        | ✅                       | ✅                            |
+| spaincentral       | ✅                        | ✅                        | ✅                             | ✅                             | -                       | -                      | -                           |
+| swedencentral      | ✅                        | ✅                        | ✅                             | ✅                             | -                       | ✅                       | ✅                            |
+| westeurope         | ✅                        | ✅                        | ✅                             | ✅                             | -                       | ✅                       | ✅                            |
+| westus             | ✅                        | ✅                        | ✅                             | ✅                             | ✅                        | ✅                       | ✅                            |
+| westus3            | ✅                        | ✅                        | ✅                             | ✅                             | ✅                        | ✅                       | ✅                            |
diff --git a/articles/ai-services/openai/includes/model-matrix/global-batch.md b/articles/ai-services/openai/includes/model-matrix/global-batch.md
diff --git a/articles/search/search-indexer-howto-access-private.md b/articles/search/search-indexer-howto-access-private.md