Merge pull request #6618 from MicrosoftDocs/main

learn-build-service-prod[bot] · web-flow · commit 770806278cc3 · 2025-08-16T05:05:41.000Z
Auto Publish – main to live - 2025-08-16 05:03 UTC
diff --git a/articles/ai-foundry/openai/concepts/models.md b/articles/ai-foundry/openai/concepts/models.md
@@ -55,9 +55,9 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
 
 |  Model ID  | Description | Context Window | Max Output Tokens | Training Data (up to)  |
 |  --- |  :--- |:--- |:---|:---: |
-| `gpt-5` (2025-08-07) |  - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md).  | 272,000  | 128,000 | October 24, 2024 |
-| `gpt-5-mini` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md).     | 272,000  | 128,000  | June 24, 2024 |
-| `gpt-5-nano` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md).     | 272,000 | 128,000 | May 31, 2024 |
+| `gpt-5` (2025-08-07) |  - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md).  | 400,000<br><br>Input: 272,000<br>Output: 128,000  | 128,000 | October 24, 2024 |
+| `gpt-5-mini` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md).     | 400,000<br><br>Input: 272,000<br>Output: 128,000   | 128,000  | June 24, 2024 |
+| `gpt-5-nano` (2025-08-07) | - [Reasoning](../how-to/reasoning.md) <br> - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - Structured outputs.<br> - Text and image processing. <br> - Functions, tools, and parallel tool calling. <br> [Full summary of capabilities](../how-to/reasoning.md).     | 400,000<br><br>Input: 272,000<br>Output: 128,000  | 128,000 | May 31, 2024 |
 | `gpt-5-chat` (2025-08-07)<br>**Preview** | - Chat Completions API. <br> - [Responses API](../how-to/responses.md). <br> - **Input**: Text/Image <br> - **Output**: Text only  | 128,000 | 16,384 | October 24, 2024 |
 
 ## gpt-oss
diff --git a/articles/ai-foundry/openai/how-to/reasoning.md b/articles/ai-foundry/openai/how-to/reasoning.md
@@ -5,7 +5,7 @@ description: Learn how to use Azure OpenAI's advanced GPT-5 series, o3-mini, o1,
 manager: nitinme
 ms.service: azure-ai-openai
 ms.topic: include
-ms.date: 08/06/2025
+ms.date: 08/15/2025
 author: mrbullwinkle    
 ms.author: mbullwin
 ---
@@ -29,8 +29,8 @@ Azure OpenAI reasoning models are designed to tackle reasoning and problem-solvi
 | Model | Region | Limited access |
 |---|---|---|
 | `gpt-5` | East US2 & Sweden Central (Global Standard & Data Zones)  |  Request access: [gpt-5 limited access model application](https://aka.ms/oai/gpt5access). If you already have `o3 access` no request is required    |
-| `gpt-5-mini` | East US2 & Sweden Central (Global Standard  & Data Zones)  |  No access request needed to use the core capabilities of this model    |
-| `gpt-5-nano` | East US2 & Sweden Central (Global Standard  & Data Zones)  |  No access request needed to use the core capabilities of this model    |
+| `gpt-5-mini` | East US2 & Sweden Central (Global Standard  & Data Zones)  |  No access request needed.    |
+| `gpt-5-nano` | East US2 & Sweden Central (Global Standard  & Data Zones)  |  No access request needed. |
 | `o3-pro`  | East US2 & Sweden Central (Global Standard)    |  Request access: [o3 limited access model application](https://aka.ms/oai/o3access). If you already have `o3 access` no request is required. |
 | `codex-mini`  | East US2 & Sweden Central (Global Standard)    | No access request needed.    |
 | `o4-mini`  | [Model availability](../concepts/models.md#global-standard-model-availability)   | No access request needed to use the core capabilities of this model.<br><br> Request access: [o4-mini reasoning summary feature](https://aka.ms/oai/o3access)     |
@@ -49,7 +49,7 @@ Azure OpenAI reasoning models are designed to tackle reasoning and problem-solvi
 | **API Version** | [v1 preview](../api-version-lifecycle.md#api-evolution) | [v1 preview](../api-version-lifecycle.md#api-evolution) | [v1 preview](../api-version-lifecycle.md#api-evolution) |
 | **[Developer Messages](#developer-messages)** | ✅ | ✅ | ✅ | 
 | **[Structured Outputs](./structured-outputs.md)** | ✅ | ✅ | ✅ |
-| **[Context Window](../concepts/models.md#o-series-models)** |  Input: 272,000 <br> Output: 128,000 | Input: 272,000 <br> Output: 128,000 | Input: 272,000 <br> Output: 128,000 |
+| **[Context Window](../concepts/models.md#o-series-models)** |  400,000 <br><br>Input: 272,000 <br> Output: 128,000 | 400,000 <br><br> Input: 272,000 <br> Output: 128,000 |  400,000 <br><br> Input: 272,000 <br> Output: 128,000 |
 | **[Reasoning effort](#reasoning-effort)** | ✅| ✅| ✅|
 | **[Image input](./gpt-with-vision.md)** | ✅ | ✅ | ✅ |
 | Chat Completions API | ✅ | ✅ | ✅ |
@@ -58,13 +58,13 @@ Azure OpenAI reasoning models are designed to tackle reasoning and problem-solvi
 | Parallel Tool Calls<sup>1</sup> | ✅ | ✅ | ✅ |
 | `max_completion_tokens` <sup>2</sup> |  ✅ | ✅ | ✅ |
 | System Messages <sup>3</sup> | ✅ | ✅| ✅ |
-| [Reasoning summary](#reasoning-summary) <sup>4</sup> |  ✅ | - | - |
+| [Reasoning summary](#reasoning-summary) |  ✅ | ✅ | ✅ |
 | Streaming   | ✅ | ✅ | ✅|
 
 <sup>1</sup> Parallel tool calls are not supported when `reasoning_effort` is set to `minimal`<br><br>
 <sup>2</sup> Reasoning models will only work with the `max_completion_tokens` parameter when using the Chat Completions API. Use `max_output_tokens` with the Responses API. <br><br>
 <sup>3</sup> The latest reasoning models support system messages to make migration easier. You should not use both a developer message and a system message in the same API request.<br><br>
-<sup>4</sup> Access to the chain-of-thought reasoning summary is limited access only for `o3` & `o4-mini`.
+
 
 ### NEW GPT-5 reasoning features
 
@@ -95,13 +95,12 @@ For more information, we also recommend reading OpenAI's [GPT-5 prompting cookbo
 | Parallel Tool Calls | - | - | - | - | -  | -  |  - |
 | `max_completion_tokens` <sup>1</sup> |  ✅ | ✅ | ✅ | ✅ |✅ |✅ | ✅ |
 | System Messages <sup>2</sup> | ✅ | ✅| ✅ | ✅ | ✅ | ✅ | - |
-| [Reasoning summary](#reasoning-summary) <sup>3</sup> |  ✅ | - | ✅ | ✅ | -  | -  | - |
-| Streaming <sup>4</sup>  | ✅ | - | ✅ | ✅| ✅ | - | - |
+| [Reasoning summary](#reasoning-summary) |  ✅ | - | ✅ | ✅ | -  | -  | - |
+| Streaming <sup>3</sup>  | ✅ | - | ✅ | ✅| ✅ | - | - |
 
 <sup>1</sup> Reasoning models will only work with the `max_completion_tokens` parameter when using the Chat Completions API. Use `max_output_tokens` with the Responses API.<br><br>
 <sup>2</sup> The latest o<sup>&#42;</sup> series model support system messages to make migration easier. When you use a system message with `o4-mini`, `o3`, `o3-mini`, and `o1` it will be treated as a developer message. You should not use both a developer message and a system message in the same API request.
-<sup>3</sup> Access to the chain-of-thought reasoning summary is limited access only for `o3` & `o4-mini`.
-<sup>4</sup> Streaming for `o3` is limited access only.
+<sup>3</sup> Streaming for `o3` is limited access only.
 
 ---
 
@@ -432,10 +431,10 @@ Console.WriteLine($"{completion.Role}: {completion.Content[0].Text}");
 
 ## Reasoning summary
 
-When using the latest reasoning models with the [Responses API](./responses.md) you can use the reasoning summary parameter to receive summaries of the model's chain of thought reasoning. This parameter can be set to `auto`, `concise`, or `detailed`. Access to this feature requires you to [Request Access](https://aka.ms/oai/o3access).
+When using the latest reasoning models with the [Responses API](./responses.md) you can use the reasoning summary parameter to receive summaries of the model's chain of thought reasoning. 
 
 > [!NOTE]
-> Even when enabled, reasoning summaries are not generated for every step/request. This is expected behavior.
+> Even when enabled, reasoning summaries are not guaranteed to be generated for every step/request. This is expected behavior.
 
 # [Python](#tab/py)
 
@@ -464,8 +463,8 @@ response = client.responses.create(
     model="gpt-5", # replace with model deployment name
     reasoning={
         "effort": "medium",
-        "summary": "auto" # auto, concise, or detailed 
-    }
+        "summary": "auto" # auto, concise, or detailed, gpt-5 series do not support concise 
+    },
     text={
         "verbosity": "low" # New with GPT-5 models
     }
diff --git a/articles/search/search-security-overview.md b/articles/search/search-security-overview.md
@@ -92,7 +92,7 @@ Internal requests are secured and managed by Microsoft. You can't configure or c
 Internal traffic consists of:
 
 + Service-to-service calls for tasks like authentication and authorization through Microsoft Entra ID, resource logging sent to Azure Monitor, and [private endpoint connections](service-create-private-endpoint.md) that utilize Azure Private Link.
-+ Requests made to Azure AI services APIs for [built-in skills](cognitive-search-predefined-skills.md)
++ Requests for [built-in skills processing](cognitive-search-predefined-skills.md), with same-region requests directed to an internally hosted Azure AI multi-service resource used exclusively for built-in skills processing by Azure AI Search.
 + Requests made to the various models that support [semantic ranking](semantic-search-overview.md#availability-and-pricing).
 
 <a name="service-access-and-authentication"></a>