Skip to content

Commit 5695e61

Browse files
Merge pull request #6975 from MicrosoftDocs/main
Auto Publish – main to live - 2025-09-08 17:07 UTC
2 parents 92e0baf + f1d693d commit 5695e61

File tree

5 files changed

+41
-57
lines changed

5 files changed

+41
-57
lines changed

articles/ai-foundry/openai/concepts/model-retirements.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ titleSuffix: Azure OpenAI
44
description: Learn about model deprecations and retirements in Azure OpenAI.
55
ms.service: azure-ai-openai
66
ms.topic: conceptual
7-
ms.date: 08/14/2025
7+
ms.date: 09/08/2025
88
ms.custom:
99
manager: nitinme
1010
author: mrbullwinkle

articles/ai-foundry/openai/how-to/reasoning.md

Lines changed: 9 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Learn how to use Azure OpenAI's advanced GPT-5 series, o3-mini, o1,
55
manager: nitinme
66
ms.service: azure-ai-openai
77
ms.topic: include
8-
ms.date: 08/27/2025
8+
ms.date: 09/08/2025
99
author: mrbullwinkle
1010
ms.author: mbullwin
1111
---
@@ -158,17 +158,16 @@ pip install openai --upgrade
158158
If you're new to using Microsoft Entra ID for authentication see [How to configure Azure OpenAI in Azure AI Foundry Models with Microsoft Entra ID authentication](../how-to/managed-identity.md).
159159

160160
```python
161-
from openai import AzureOpenAI
161+
from openai import OpenAI
162162
from azure.identity import DefaultAzureCredential, get_bearer_token_provider
163163

164164
token_provider = get_bearer_token_provider(
165165
DefaultAzureCredential(), "https://cognitiveservices.azure.com/.default"
166166
)
167167

168-
client = AzureOpenAI(
169-
azure_endpoint = os.getenv("AZURE_OPENAI_ENDPOINT"),
170-
azure_ad_token_provider=token_provider,
171-
api_version="2025-04-01-preview"
168+
client = OpenAI(
169+
base_url = "https://YOUR-RESOURCE-NAME.openai.azure.com/openai/v1/",
170+
api_key=token_provider,
172171
)
173172

174173
response = client.chat.completions.create(
@@ -371,17 +370,16 @@ pip install openai --upgrade
371370
If you're new to using Microsoft Entra ID for authentication see [How to configure Azure OpenAI with Microsoft Entra ID authentication](../how-to/managed-identity.md).
372371

373372
```python
374-
from openai import AzureOpenAI
373+
from openai import OpenAI
375374
from azure.identity import DefaultAzureCredential, get_bearer_token_provider
376375

377376
token_provider = get_bearer_token_provider(
378377
DefaultAzureCredential(), "https://cognitiveservices.azure.com/.default"
379378
)
380379

381-
client = AzureOpenAI(
382-
azure_endpoint = os.getenv("AZURE_OPENAI_ENDPOINT"),
383-
azure_ad_token_provider=token_provider,
384-
api_version="2025-04-01-preview"
380+
client = OpenAI(
381+
base_url = "https://YOUR-RESOURCE-NAME.openai.azure.com/openai/v1/",
382+
api_key=token_provider,
385383
)
386384

387385
response = client.chat.completions.create(

articles/ai-foundry/openai/how-to/responses.md

Lines changed: 7 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Learn how to use Azure OpenAI's new stateful Responses API.
55
author: mrbullwinkle
66
ms.author: mbullwin
77
manager: nitinme
8-
ms.date: 08/27/2025
8+
ms.date: 09/08/2025
99
ms.service: azure-ai-openai
1010
ms.topic: include
1111
ms.custom:
@@ -109,21 +109,17 @@ print(response.model_dump_json(indent=2))
109109

110110
# [Python (Microsoft Entra ID)](#tab/python-secure)
111111

112-
> [!NOTE]
113-
> Full v1 GA support for the OpenAI Python library with Microsoft Entra ID is coming soon. The example below will be replaced once support is added. To learn more, check out the [API lifecycle guide](../api-version-lifecycle.md#api-evolution).
114-
115112
```python
116-
from openai import AzureOpenAI
113+
from openai import OpenAI
117114
from azure.identity import DefaultAzureCredential, get_bearer_token_provider
118115

119116
token_provider = get_bearer_token_provider(
120117
DefaultAzureCredential(), "https://cognitiveservices.azure.com/.default"
121118
)
122119

123-
client = AzureOpenAI(
120+
client = OpenAI(
124121
base_url = "https://YOUR-RESOURCE-NAME.openai.azure.com/openai/v1/",
125-
azure_ad_token_provider=token_provider,
126-
api_version="preview"
122+
api_key=token_provider,
127123
)
128124

129125
response = client.responses.create(
@@ -238,21 +234,17 @@ response = client.responses.retrieve("resp_67cb61fa3a448190bcf2c42d96f0d1a8")
238234

239235
# [Python (Microsoft Entra ID)](#tab/python-secure)
240236

241-
> [!NOTE]
242-
> Full v1 GA support for the OpenAI Python library with Microsoft Entra ID is coming soon. The older preview API example below will be replaced once support is added. To learn more, check out the [API lifecycle guide](../api-version-lifecycle.md#api-evolution).
243-
244237
```python
245-
from openai import AzureOpenAI
238+
from openai import OpenAI
246239
from azure.identity import DefaultAzureCredential, get_bearer_token_provider
247240

248241
token_provider = get_bearer_token_provider(
249242
DefaultAzureCredential(), "https://cognitiveservices.azure.com/.default"
250243
)
251244

252-
client = AzureOpenAI(
245+
client = OpenAI(
253246
base_url = "https://YOUR-RESOURCE-NAME.openai.azure.com/openai/v1/",
254-
azure_ad_token_provider=token_provider,
255-
api_version="preview"
247+
api_key=token_provider,
256248
)
257249

258250
response = client.responses.retrieve("resp_67cb61fa3a448190bcf2c42d96f0d1a8")

articles/ai-foundry/openai/includes/retirement/models.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ title: Model Retirement Table
33
titleSuffix: Azure OpenAI in Azure AI Foundry Models
44
description: Model retirement table for Azure OpenAI in Azure AI Foundry Models.
55
manager: nitinme
6-
ms.date: 08/14/2025
6+
ms.date: 09/08/2025
77
ms.service: azure-ai-openai
88
ms.topic: include
99
ms.custom: references_regions, build-2025
@@ -17,11 +17,11 @@ ms.custom: references_regions, build-2025
1717
| Model | Version | Lifecycle<br>Status | Retirement date | Replacement model |
1818
| --------------------------|-------------------|:----------------------|------------------------------------|--------------------------------------|
1919
| `computer-use-preview` | 2025-03-11 | Preview | No earlier than October 10, 2025 | |
20-
| `gpt-35-turbo` | 1106 | Generally Available | No earlier than October 15, 2025 | `gpt-4.1-mini` |
21-
| `gpt-35-turbo` | 0125 | Generally Available | No earlier than October 15, 2025 | `gpt-4.1-mini` |
22-
| `gpt-3.5-turbo-instruct` | 0914 | Generally Available | No earlier than October 15, 2025 | |
23-
| `gpt-4` | turbo-2024-04-09 | Generally Available | No earlier than October 15, 2025 | `gpt-4.1` |
2420
| `o1-mini` | 2024-09-12 | Generally Available | No earlier than October 27, 2025 | `o4-mini` |
21+
| `gpt-35-turbo` | 1106 | Generally Available | No earlier than November 11, 2025 | `gpt-4.1-mini` |
22+
| `gpt-35-turbo` | 0125 | Generally Available | No earlier than November 11, 2025 | `gpt-4.1-mini` |
23+
| `gpt-3.5-turbo-instruct` | 0914 | Generally Available | No earlier than November 11, 2025 | |
24+
| `gpt-4` | turbo-2024-04-09 | Generally Available | No earlier than November 11, 2025 | `gpt-4.1` |
2525
| `gpt-5-chat` | 2025-08-07 | Preview | No earlier than November 15, 2025 | |
2626
| `model-router` | 2025-05-19 | Preview | No earlier than November 30, 2025 | |
2727
| `model-router` | 2025-08-07 | Preview | No earlier than November 30, 2025 | |

articles/search/search-agentic-retrieval-how-to-retrieve.md

Lines changed: 19 additions & 25 deletions
Original file line numberDiff line numberDiff line change
@@ -67,7 +67,7 @@ POST https://{{search-url}}/agents/{{agent-name}}/retrieve?api-version=2025-08-0
6767
"content" : [
6868
{ "type" : "text", "text" : "You can answer questions about the Earth at night.
6969
Sources have a JSON format with a ref_id that must be cited in the answer.
70-
If you do not have the answer, respond with "I don't know"." }
70+
If you do not have the answer, respond with 'I don't know'." }
7171
]
7272
},
7373
{
@@ -77,39 +77,29 @@ POST https://{{search-url}}/agents/{{agent-name}}/retrieve?api-version=2025-08-0
7777
]
7878
}
7979
],
80-
"targetIndexParams" : [
81-
{
82-
"indexName" : "{{index-name}}",
83-
"filterAddOn" : "page_number eq '105'",
84-
"IncludeReferenceSourceData": true,
85-
"rerankerThreshold" : 2.5,
86-
"maxDocsForReranker": 50
87-
}
88-
]
80+
"knowledgeSourceParams": [
81+
{
82+
"filterAddOn": null,
83+
"knowledgeSourceName": "earth-at-night-blob-ks",
84+
"kind": "searchIndex"
85+
}
86+
]
8987
}
9088
```
9189

9290
**Key points**:
9391

92+
+ The retrieve action targets a [knowledge agent](search-agentic-retrieval-how-to-create.md). The knowledge agent specifies one or more knowledge sources and a knowledge source configuration. Review your knowledge agent definition for output and semantic ranking configuration.
93+
9494
+ `messages` articulates the messages sent to the model. The message format is similar to Azure OpenAI APIs.
9595

9696
+ `role` defines where the message came from, for example either `assistant` or `user`. The model you use determines which roles are valid.
9797

98-
+ `content` is the message sent to the LLM. It must be text in this preview.
99-
100-
+ `targetIndexParams` provide instructions on the retrieval. Currently in this preview, you can only target a single index.
101-
102-
+ `filterAddOn` lets you set an [OData filter expression](search-filters.md) for keyword or hybrid search.
98+
+ `content` is the message or prompt sent to the LLM. It must be text in this preview.
10399

104-
+ `IncludeReferenceSourceData` tells the retrieval engine to return the source content in the response. This value is initially set in the knowledge agent definition. You can override that setting in the retrieve action to return original source content in the [references section](#review-the-references-array) of the response.
100+
+ [`knowledgeSourceParams`](/rest/api/searchservice/knowledge-retrieval/retrieve?view=rest-searchservice-2025-08-01-preview#searchindexknowledgesourceparams&preserve-view=true) is optional. Specify a knowledge source if the agent has more than one, and you want to focus the retrieve action on just one knowledge source. If the knowledge agent has just one knowledge source with the configuration you want, you can omit this section.
105101

106-
+ `rerankerThreshold` and `maxDocsForReranker` are also initially set in the knowledge agent definition as defaults. You can override them in the retrieve action to configure [semantic reranker](semantic-how-to-configure.md), setting minimum thresholds and the maximum number of inputs sent to the reranker.
107-
108-
`rerankerThreshold` is the minimum semantic reranker score that's acceptable for inclusion in a response. [Reranker scores](semantic-search-overview.md#how-results-are-scored) range from 1 to 4. Plan on revising this value based on testing and what works for your content.
109-
110-
`maxDocsForReranker` dictates the maximum number of documents to consider for the final response string. Semantic reranker accepts 50 documents. If the maximum is 200, four more subqueries are added to the query plan to ensure all 200 documents are semantically ranked. for semantic ranking. If the number isn't evenly divisible by 50, the query plan rounds up to nearest whole number.
111-
112-
The `content` portion of the response consists of the 200 chunks or less, excluding any results that fail to meet the minimum threshold of a 2.5 reranker score.
102+
A knowledge source specification on the retrieve action describes the target search index on the search service. So even if the knowledge source "kind" is Azure blob, the valid value here is `searchIndex`. In this first public preview release, `knowledgeSourceParams.kind` is always `searchIndex`.
113103

114104
## Review the extracted response
115105

@@ -133,9 +123,13 @@ The body of the response is also structured in the chat message style format. Cu
133123

134124
**Key points**:
135125

136-
+ `content` is a JSON array. It's a single string composed of the most relevant documents (or chunks) found in the search index, given the query and chat history inputs. This array is your grounding data that a chat completion model uses to formulate a response to the user's question.
126+
+ `content.text` is a JSON array. It's a single string composed of the most relevant documents (or chunks) found in the search index, given the query and chat history inputs. This array is your grounding data that a chat completion model uses to formulate a response to the user's question.
127+
128+
This portion of the response consists of the 200 chunks or less, excluding any results that fail to meet the minimum threshold of a 2.5 reranker score.
129+
130+
The string starts with the reference ID of the chunk (used for citation purposes), and any fields specified in the semantic configuration of the target index. In this example, you should assume the semantic configuration in the target index has a "title" field, a "terms" field, and a "content" field.
137131

138-
+ "text" is the only valid value for `type`, and it consists of the reference ID of the chunk (used for citation purposes), and any fields specified in the semantic configuration of the target index. In this example, you should assume the semantic configuration in the target index has a "title" field, a "terms" field, and a "content" field.
132+
+ `content.type` has one valid value in this preview: `text`.
139133

140134
> [!NOTE]
141135
> The `maxOutputSize` property on the [knowledge agent](search-agentic-retrieval-how-to-create.md) determines the length of the string. We recommend 5,000 tokens.

0 commit comments

Comments
 (0)