Skip to content

Commit 5b20e5f

Browse files
authored
Merge branch 'main' into 438940-update-toc-kate
2 parents ae32f3a + 68014d7 commit 5b20e5f

File tree

13 files changed

+110
-52
lines changed

13 files changed

+110
-52
lines changed

articles/ai-foundry/how-to/develop/langchain.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.service: azure-ai-foundry
77
ms.custom:
88
- ignite-2024
99
ms.topic: how-to
10-
ms.date: 06/24/2025
10+
ms.date: 06/26/2025
1111
ms.reviewer: fasantia
1212
ms.author: sgilley
1313
author: sdgilley
@@ -31,7 +31,7 @@ To run this tutorial, you need:
3131

3232
* An [Azure subscription](https://azure.microsoft.com).
3333

34-
* A model deployment supporting the [Model Inference API](https://aka.ms/azureai/modelinference) deployed. In this example, we use a `Mistral-medium-2505` deployment in the [Foundry Models](../../../ai-foundry/model-inference/overview.md).
34+
* A model deployment supporting the [Model Inference API](https://aka.ms/azureai/modelinference) deployed. In this example, we use a `Mistral-Large-2411` deployment in the [Foundry Models](../../../ai-foundry/model-inference/overview.md).
3535
* Python 3.9 or later installed, including pip.
3636
* LangChain installed. You can do it with:
3737

@@ -76,7 +76,7 @@ Once configured, create a client to connect with the chat model by using the `in
7676
```python
7777
from langchain.chat_models import init_chat_model
7878
79-
llm = init_chat_model(model="mistral-medium-2505", model_provider="azure_ai")
79+
llm = init_chat_model(model="Mistral-Large-2411", model_provider="azure_ai")
8080
```
8181
8282
You can also use the class `AzureAIChatCompletionsModel` directly.
@@ -97,7 +97,7 @@ from langchain_azure_ai.chat_models import AzureAIChatCompletionsModel
9797
model = AzureAIChatCompletionsModel(
9898
endpoint=os.environ["AZURE_INFERENCE_ENDPOINT"],
9999
credential=DefaultAzureCredential(),
100-
model="mistral-medium-2505",
100+
model="Mistral-Large-2411",
101101
)
102102
```
103103
@@ -115,7 +115,7 @@ from langchain_azure_ai.chat_models import AzureAIChatCompletionsModel
115115
model = AzureAIChatCompletionsModel(
116116
endpoint=os.environ["AZURE_INFERENCE_ENDPOINT"],
117117
credential=DefaultAzureCredentialAsync(),
118-
model="mistral-medium-2505",
118+
model="Mistral-Large-2411",
119119
)
120120
```
121121
@@ -169,7 +169,7 @@ chain.invoke({"language": "italian", "text": "hi"})
169169
170170
Models deployed to Azure AI Foundry support the Model Inference API, which is standard across all the models. Chain multiple LLM operations based on the capabilities of each model so you can optimize for the right model based on capabilities.
171171
172-
In the following example, we create two model clients. One is a producer and another one is a verifier. To make the distinction clear, we're using a multi-model endpoint like the [Foundry Models API](../../model-inference/overview.md) and hence we're passing the parameter `model` to use a `Mistral-Medium` and a `Mistral-Small` model, quoting the fact that **producing content is more complex than verifying it**.
172+
In the following example, we create two model clients. One is a producer and another one is a verifier. To make the distinction clear, we're using a multi-model endpoint like the [Foundry Models API](../../model-inference/overview.md) and hence we're passing the parameter `model` to use a `Mistral-Large` and a `Mistral-Small` model, quoting the fact that **producing content is more complex than verifying it**.
173173
174174
[!notebook-python[](~/azureai-samples-main/scenarios/langchain/getting-started-with-langchain-chat-models.ipynb?name=create_producer_verifier)]
175175
-58.8 KB
Loading

articles/ai-services/luis/includes/deprecation-notice.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,8 +5,8 @@ ms.author: lajanuar
55
manager: nitinme
66
ms.service: azure-ai-language
77
ms.topic: include
8-
ms.date: 06/12/2025
8+
ms.date: 06/26/2025
99
---
1010

1111
> [!IMPORTANT]
12-
> LUIS will be retired on October 1st 2025 and starting April 1st 2023 you will not be able to create new LUIS resources. We recommend [migrating your LUIS applications](../../language-service/conversational-language-understanding/how-to/migrate-from-luis.md) to [conversational language understanding](../../language-service/conversational-language-understanding/overview.md) to benefit from continued product support and multilingual capabilities.
12+
> Language Understanding Intelligent Service (LUIS) will be fully retired on March 31, 2026. LUIS resource creation isn't available. Beginning on October 31, 2025, the LUIS portal will no longer be available. We recommend [migrating your LUIS applications](../../language-service/conversational-language-understanding/how-to/migrate-from-luis.md) to [conversational language understanding](../../language-service/conversational-language-understanding/overview.md) to benefit from continued product support and multilingual capabilities.

articles/ai-services/openai/azure-government.md

Lines changed: 18 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ title: Azure OpenAI in Azure Government
33
titleSuffix: Azure OpenAI
44
description: Learn how to use Azure OpenAI in the Azure Government cloud.
55
author: challenp
6-
ms.date: 5/29/2025
6+
ms.date: 6/25/2025
77
ms.service: azure-ai-openai
88
ms.topic: how-to
99
ms.custom:
@@ -25,17 +25,21 @@ The following sections show model availability by region and deployment type. Mo
2525

2626
<br>
2727

28-
## Standard deployment model availability
29-
| **Region** | **o3-mini USGov DataZone** | **gpt-4o**, **2024-05-13** | **gpt-4o-mini**, **2024-07-18** | **gpt-35-turbo**, **0125** | **text-embedding-3-large**, **1** | **text-embedding-3-small**, **1** | **text-embedding-ada-002**, **2** |
30-
|:--------------|:--------------------------:|:--------------------------:|:-------------------------------:|:--------------------------:|:---------------------------------:|:---------------------------------:|:---------------------------------:|
31-
| usgovarizona ||||||||
32-
| usgovvirginia ||| - || - | - ||
28+
### USGov DataZone
29+
Data zone deployments are available in the same Azure OpenAI resource as all other Azure OpenAI deployment types but allow you to leverage Azure Government infrastructure to dynamically route traffic to the data center within the USGov data zone with the best availability for each request.
3330

3431
* USGov DataZone provides access to the model from both usgovarizona and usgovvirginia.
3532
* Data stored at rest remains in the designated Azure region of the resource.
36-
* Data may be processed for inferencing in either of the two Azure Government regions.
33+
* Data may be processed for inferencing in either of the two Azure Government regions.
3734

38-
Data zone standard deployments are available in the same Azure OpenAI resource as all other Azure OpenAI deployment types but allow you to leverage Azure Government infrastructure to dynamically route traffic to the data center within the USGov data zone with the best availability for each request.
35+
<br>
36+
37+
### Standard deployment model availability
38+
| **Region** | **o3-mini** | **gpt-4o**, **2024-11-20** | **gpt-4o**, **2024-05-13** | **gpt-4o-mini**, **2024-07-18** | **gpt-35-turbo**, **0125** | **text-embedding-3-large**, **1** | **text-embedding-3-small**, **1** | **text-embedding-ada-002**, **2** |
39+
|:---------------|:--------------------------:|:--------------------------:|:--------------------------:|:-------------------------------:|:--------------------------:|:---------------------------------:|:---------------------------------:|:---------------------------------:|
40+
| usgovarizona | - | - |||||||
41+
| usgovvirginia | - ||| - || - | - ||
42+
| USGov DataZone ||| - | - | - | - | - | - |
3943

4044
To request quota increases for these models, submit a request at [https://aka.ms/AOAIGovQuota](https://aka.ms/AOAIGovQuota). Note the following maximum quota limits allowed via that form:
4145

@@ -45,11 +49,12 @@ To request quota increases for these models, submit a request at [https://aka.ms
4549

4650
<br>
4751

48-
## Provisioned deployment model availability
49-
| **Region** | **gpt-4o**, **2024-05-13** | **gpt-4o-mini**, **2024-07-18** | **gpt-35-turbo**, **0125** |
50-
|:--------------|:--------------------------:|:-------------------------------:|:--------------------------:|
51-
| usgovarizona || - ||
52-
| usgovvirginia || - ||
52+
### Provisioned deployment model availability
53+
| **Region** | **gpt-4o**, **2024-11-20** | **gpt-4o**, **2024-05-13** | **gpt-4o-mini**, **2024-07-18** | **gpt-35-turbo**, **0125** |
54+
|:---------------|:--------------------------:|:--------------------------:|:-------------------------------:|:--------------------------:|
55+
| usgovarizona | - || - ||
56+
| usgovvirginia ||| - ||
57+
| USGov DataZone || - | - | - |
5358

5459
<br>
5560

articles/ai-services/openai/includes/fine-tune-models.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -45,10 +45,12 @@ ms.custom:
4545
>- Poland Central
4646
>- Southeast Asia
4747
>- South Africa North
48+
>- South Central US
4849
>- Spain Central
4950
>- Sweden Central
5051
>- Switzerland West
5152
>- Switzerland North
5253
>- UK South
54+
>- West Europe
5355
>- West US
5456
>- West US3

articles/ai-services/openai/realtime-audio-reference.md

Lines changed: 9 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -1,26 +1,25 @@
11
---
2-
title: Azure OpenAI in Azure AI Foundry Models Realtime API Reference
2+
title: Audio events reference
33
titleSuffix: Azure OpenAI
4-
description: Learn how to use the Realtime API to interact with the Azure OpenAI in real-time.
4+
description: Learn how to use events with the Realtime API and Voice Live API.
55
manager: nitinme
66
ms.service: azure-ai-openai
77
ms.topic: conceptual
8-
ms.date: 4/28/2025
8+
ms.date: 6/27/2025
99
author: eric-urban
1010
ms.author: eur
1111
recommendations: false
1212
---
1313

14-
# Realtime events reference
14+
# Audio events reference
1515

16-
[!INCLUDE [Feature preview](includes/preview-feature.md)]
16+
Realtime events are used to communicate between the client and server in real-time audio applications. The events are sent as JSON objects over various endpoints, such as WebSockets or WebRTC. The events are used to manage the conversation, audio buffers, and responses in real-time.
1717

18-
The Realtime API is a WebSocket-based API that allows you to interact with the Azure OpenAI in real-time.
18+
You can use audio client and server events with these APIs:
19+
- [Azure OpenAI Realtime API](/azure/ai-services/openai/realtime-audio-quickstart)
20+
- [Azure AI Voice Live API](/azure/ai-services/speech-service/voice-live)
1921

20-
The Realtime API (via `/realtime`) is built on [the WebSockets API](https://developer.mozilla.org/docs/Web/API/WebSockets_API) to facilitate fully asynchronous streaming communication between the end user and model. Device details like capturing and rendering audio data are outside the scope of the Realtime API. It should be used in the context of a trusted, intermediate service that manages both connections to end users and model endpoint connections. Don't use it directly from untrusted end user devices.
21-
22-
> [!TIP]
23-
> To get started with the Realtime API, see the [quickstart](realtime-audio-quickstart.md) and [how-to guide](./how-to/realtime-audio.md).
22+
Unless otherwise specified, the events described in this document are applicable to both APIs.
2423

2524
## Client events
2625

articles/ai-services/openai/toc.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -441,7 +441,7 @@ items:
441441
displayName: RAG, rag
442442
- name: Azure OpenAI monitoring data reference
443443
href: monitor-openai-reference.md
444-
- name: Realtime API (preview) events reference
444+
- name: Audio events reference
445445
href: realtime-audio-reference.md
446446
- name: Resources
447447
items:

articles/ai-services/qnamaker/includes/new-version.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,8 +5,8 @@ ms.topic: include
55
ms.custom: include file
66
ms.service: azure-ai-language
77
ms.subservice: azure-ai-qna-maker
8-
ms.date: 06/12/2025
8+
ms.date: 06/26/2025
99
---
1010

1111
> [!NOTE]
12-
> The QnA Maker service is being retired on the 31st of March, 2025. A newer version of the question and answering capability is now available as part of [Azure AI Language](../../language-service/index.yml). For question answering capabilities within the Language Service, see [question answering](../../language-service/question-answering/overview.md). Starting 1st October, 2022 you won't be able to create new QnA Maker resources. For information on migrating existing QnA Maker knowledge bases to question answering, consult the [migration guide](../../language-service/question-answering/how-to/migrate-qnamaker.md).
12+
> The QnA Maker service is being retired on the October 31, 2025 (extended from March 31, 2025). A newer version of the question and answering capability is now available as part of [Azure AI Language](../../language-service/index.yml). For question answering capabilities within the Language Service, see [question answering](../../language-service/question-answering/overview.md). As of October 1, 2022, you're no longer able to create new QnA Maker resources. Beginning on March 31, 2025, the QnA Maker portal is no longer available. For information on migrating existing QnA Maker knowledge bases to question answering, consult the [migration guide](../../language-service/question-answering/how-to/migrate-qnamaker.md).

articles/ai-services/speech-service/includes/quickstarts/voice-live-api/realtime-python.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ author: eric-urban
44
ms.author: eur
55
ms.service: azure-ai-openai
66
ms.topic: include
7-
ms.date: 5/19/2025
7+
ms.date: 6/27/2025
88
---
99

1010
## Prerequisites
@@ -151,6 +151,7 @@ For the recommended keyless authentication with Microsoft Entra ID, you need to:
151151
session_update = {
152152
"type": "session.update",
153153
"session": {
154+
"instructions": "You are a helpful AI assistant responding in natural, engaging language.",
154155
"turn_detection": {
155156
"type": "azure_semantic_vad",
156157
"threshold": 0.3,
@@ -170,7 +171,7 @@ For the recommended keyless authentication with Microsoft Entra ID, you need to:
170171
"type": "server_echo_cancellation"
171172
},
172173
"voice": {
173-
"name": "en-US-Aria:DragonHDLatestNeural",
174+
"name": "en-US-Ava:DragonHDLatestNeural",
174175
"type": "azure-standard",
175176
"temperature": 0.8,
176177
},
@@ -417,7 +418,7 @@ For the recommended keyless authentication with Microsoft Entra ID, you need to:
417418
The output of the script is printed to the console. You see messages indicating the status of the connection, audio stream, and playback. The audio is played back through your speakers or headphones.
418419

419420
```text
420-
Session created: {"type": "session.update", "session": {"turn_detection": {"type": "azure_semantic_vad", "threshold": 0.3, "prefix_padding_ms": 200, "silence_duration_ms": 200, "remove_filler_words": false, "end_of_utterance_detection": {"model": "semantic_detection_v1", "threshold": 0.1, "timeout": 4}}, "input_audio_noise_reduction": {"type": "azure_deep_noise_suppression"}, "input_audio_echo_cancellation": {"type": "server_echo_cancellation"}, "voice": {"name": "en-US-Aria:DragonHDLatestNeural", "type": "azure-standard", "temperature": 0.8}}, "event_id": ""}
421+
Session created: {"type": "session.update", "session": {"instructions": "You are a helpful AI assistant responding in natural, engaging language.","turn_detection": {"type": "azure_semantic_vad", "threshold": 0.3, "prefix_padding_ms": 200, "silence_duration_ms": 200, "remove_filler_words": false, "end_of_utterance_detection": {"model": "semantic_detection_v1", "threshold": 0.1, "timeout": 4}}, "input_audio_noise_reduction": {"type": "azure_deep_noise_suppression"}, "input_audio_echo_cancellation": {"type": "server_echo_cancellation"}, "voice": {"name": "en-US-Ava:DragonHDLatestNeural", "type": "azure-standard", "temperature": 0.8}}, "event_id": ""}
421422
Starting the chat ...
422423
Received event: {'session.created'}
423424
Press 'q' and Enter to quit the chat.

articles/ai-services/speech-service/toc.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -239,7 +239,7 @@ items:
239239
href: voice-live-quickstart.md
240240
- name: How to use Voice Live API
241241
href: voice-live-how-to.md
242-
- name: Realtime API events reference documentation
242+
- name: Audio events reference
243243
href: /azure/ai-services/openai/realtime-audio-reference?context=/azure/ai-services/speech-service/context/context
244244
- name: Intent recognition
245245
items:

0 commit comments

Comments
 (0)