You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/includes/quickstarts/voice-live-agents/python.md
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,10 +1,10 @@
1
1
---
2
2
manager: nitinme
3
-
author: PatrickFarley
4
-
ms.author: pafarley
3
+
author: goergenj
4
+
ms.author: jagoerge
5
5
ms.service: azure-ai-openai
6
6
ms.topic: include
7
-
ms.date: 7/31/2025
7
+
ms.date: 9/26/2025
8
8
---
9
9
10
10
In this article, you learn how to use Azure AI Speech voice live with [Azure AI Foundry Agent Service](/azure/ai-foundry/agents/overview) using Python code.
@@ -165,7 +165,7 @@ client = AsyncAzureVoiceLive(
165
165
endpoint = os.environ.get("AZURE_VOICE_LIVE_ENDPOINT") or "<https://your-endpoint.azure.com/>"
166
166
agent_id = os.environ.get("AI_FOUNDRY_AGENT_ID") or "<your-agent-id>"
167
167
project_name = os.environ.get("AI_FOUNDRY_PROJECT_NAME") or "<your-project-name>"
168
-
api_version = os.environ.get("AZURE_VOICE_LIVE_API_VERSION") or "2025-05-01-preview"
168
+
api_version = os.environ.get("AZURE_VOICE_LIVE_API_VERSION") or "2025-10-01"
169
169
api_key = os.environ.get("AZURE_VOICE_LIVE_API_KEY") or "<your-api-key>"
170
170
171
171
# For the recommended keyless authentication, get and
@@ -669,7 +669,7 @@ The `voicelive.log` file contains information about the connection to the Voice
669
669
670
670
```text
671
671
2025-07-29 09:43:32,574:websockets.client:DEBUG:= connection is CONNECTING
672
-
2025-07-29 09:43:32,825:websockets.client:DEBUG:> GET /voice-live/realtime?api-version=2025-05-01-preview&agent-project-name=contoso-proj-agentic&agent-id=<your-agent-id>&agent-access-token=<your-token>&debug=on HTTP/1.1
672
+
2025-07-29 09:43:32,825:websockets.client:DEBUG:> GET /voice-live/realtime?api-version=2025-10-01&agent-project-name=contoso-proj-agentic&agent-id=<your-agent-id>&agent-access-token=<your-token>&debug=on HTTP/1.1
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/includes/quickstarts/voice-live-agents/resource-authentication.md
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,9 +1,9 @@
1
1
---
2
-
author: PatrickFarley
3
-
ms.author: pafarley
2
+
author: goergenj
3
+
ms.author: jagoerge
4
4
ms.service: azure-ai-speech
5
5
ms.topic: include
6
-
ms.date: 7/31/2025
6
+
ms.date: 9/26/2025
7
7
---
8
8
9
9
Create a new file named `.env` in the folder where you want to run the code.
@@ -14,7 +14,7 @@ In the `.env` file, add the following environment variables for authentication:
14
14
AZURE_VOICE_LIVE_ENDPOINT=<your_endpoint>
15
15
AI_FOUNDRY_PROJECT_NAME=<your_project_name>
16
16
AI_FOUNDRY_AGENT_ID=<your_agent_id>
17
-
AZURE_VOICE_LIVE_API_VERSION=2025-05-01-preview
17
+
AZURE_VOICE_LIVE_API_VERSION=2025-10-01
18
18
AZURE_VOICE_LIVE_API_KEY=<your_api_key> # Only required if using API key authentication
19
19
```
20
20
@@ -27,7 +27,7 @@ Replace the default values with your actual project name, agent ID, API version,
27
27
|`AZURE_VOICE_LIVE_ENDPOINT`| This value can be found in the **Keys and Endpoint** section when examining your resource from the Azure portal. |
28
28
|`AI_FOUNDRY_PROJECT_NAME`| The name of your Azure AI Foundry project. |
29
29
|`AI_FOUNDRY_AGENT_ID`| The ID of your Azure AI Foundry agent. |
30
-
|`AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-05-01-preview`. |
30
+
|`AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-10-01`. |
31
31
32
32
Learn more about [keyless authentication](/azure/ai-services/authentication) and [setting environment variables](/azure/ai-services/cognitive-services-environment-variables).
33
33
@@ -38,7 +38,7 @@ Learn more about [keyless authentication](/azure/ai-services/authentication) and
38
38
|`AZURE_VOICE_LIVE_ENDPOINT`| This value can be found in the **Keys and Endpoint** section when examining your resource from the Azure portal. |
39
39
|`AI_FOUNDRY_PROJECT_NAME`| The name of your Azure AI Foundry project. |
40
40
|`AI_FOUNDRY_AGENT_ID`| The ID of your Azure AI Foundry agent. |
41
-
|`AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-05-01-preview`. |
41
+
|`AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-10-01`. |
42
42
|`AZURE_VOICE_LIVE_API_KEY`| This value can be found in the **Keys and Endpoint** section when examining your resource from the Azure portal. You can use either `KEY1` or `KEY2`.|
43
43
44
44
Learn more about [finding API keys](/azure/ai-services/cognitive-services-environment-variables) and [setting environment variables](/azure/ai-services/cognitive-services-environment-variables).
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/includes/quickstarts/voice-live-api/resource-authentication.md
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,9 +1,9 @@
1
1
---
2
-
author: PatrickFarley
3
-
ms.author: pafarley
2
+
author: goergenj
3
+
ms.author: jagoerge
4
4
ms.service: azure-ai-speech
5
5
ms.topic: include
6
-
ms.date: 7/31/2025
6
+
ms.date: 9/26/2025
7
7
---
8
8
9
9
Create a new file named `.env` in the folder where you want to run the code.
@@ -13,7 +13,7 @@ In the `.env` file, add the following environment variables for authentication:
13
13
```plaintext
14
14
AZURE_VOICE_LIVE_ENDPOINT=<your_endpoint>
15
15
VOICE_LIVE_MODEL=<your_model>
16
-
AZURE_VOICE_LIVE_API_VERSION=2025-05-01-preview
16
+
AZURE_VOICE_LIVE_API_VERSION=2025-10-01
17
17
AZURE_VOICE_LIVE_API_KEY=<your_api_key> # Only required if using API key authentication
18
18
```
19
19
@@ -25,7 +25,7 @@ Replace the default values with your actual endpoint, model, API version, and AP
25
25
|--------------------------|-------------|
26
26
|`AZURE_VOICE_LIVE_ENDPOINT`| This value can be found in the **Keys and Endpoint** section when examining your resource from the Azure portal. |
27
27
|`AZURE_VOICE_LIVE_MODEL`| The model you want to use. For example, `gpt-4o` or `gpt-4o-mini-realtime-preview`. For more information about models availability, see the [Voice Live API overview documentation](../../../voice-live.md). |
28
-
|`AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-05-01-preview`. |
28
+
|`AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-10-01`. |
29
29
30
30
Learn more about [keyless authentication](/azure/ai-services/authentication) and [setting environment variables](/azure/ai-services/cognitive-services-environment-variables).
31
31
@@ -35,7 +35,7 @@ Learn more about [keyless authentication](/azure/ai-services/authentication) and
35
35
|--------------------------|-------------|
36
36
|`AZURE_VOICE_LIVE_ENDPOINT`| This value can be found in the **Keys and Endpoint** section when examining your resource from the Azure portal. |
37
37
|`AZURE_VOICE_LIVE_MODEL`| The model you want to use. For example, `gpt-4o` or `gpt-4o-mini-realtime-preview`. For more information about models availability, see the [Voice Live API overview documentation](../../../voice-live.md). |
38
-
|`AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-05-01-preview`. |
38
+
|`AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-10-01`. |
39
39
|`AZURE_VOICE_LIVE_API_KEY`| This value can be found in the **Keys and Endpoint** section when examining your resource from the Azure portal. You can use either `KEY1` or `KEY2`.|
40
40
41
41
Learn more about [finding API keys](/azure/ai-services/cognitive-services-environment-variables) and [setting environment variables](/azure/ai-services/cognitive-services-environment-variables).
| centralindia | Cross-region<sup>1</sup> | Cross-region<sup>1</sup> |Cross-region<sup>1</sup> |Global standard | Global standard | Global standard | Global standard | - | - | - | - | - | - |
180
-
| eastus2 | Global standard | Global standard |Global standard |Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Global standard | Regional | Regional |
181
-
| southeastasia | - | - | - | - |- |Global standard | Global standard | - | - | - | - | Regional | Regional |
182
-
| swedencentral | Global standard | Global standard |Global standard |Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Global standard | Regional | Regional |
183
-
| westus2 | Cross-region<sup>2</sup> | Cross-region<sup>2</sup> |Cross-region<sup>2</sup> |Data zone standard | Data zone standard | Data zone standard | Data zone standard | - | - | - | - | Regional | Regional |
184
-
|australiaeast| - | - |- |Global standard | Global standard | Global standard | Global standard | - | - | - | - | - | - |
185
-
|japaneast| - | - |- |Global standard | Global standard | Global standard | Global standard | - | - | - | - | Regional | Regional |
186
-
|eastus| - | - |- |Data zone standard | Data zone standard | Data zone standard | Data zone standard | - | - | - | - | - | - |
187
-
|uksouth| - | - |- |Global standard | Global standard | Global standard | Global standard | - | - | - | - | - | - |
188
-
|westeurope| - | - |- |Data zone standard | Data zone standard | Data zone standard | Data zone standard | - | - | - | - | - | - |
| centralindia | Cross-region<sup>1</sup> | Cross-region<sup>1</sup> | Global standard | Global standard | Global standard | Global standard | - | - | - | - | - | - |
180
+
| eastus2 | Global standard | Global standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Global standard | Regional | Regional |
181
+
| southeastasia | - | - | - | - | Global standard | Global standard | - | - | - | - | Regional | Regional |
182
+
| swedencentral | Global standard | Global standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Global standard | Regional | Regional |
183
+
| westus2 | Cross-region<sup>2</sup> | Cross-region<sup>2</sup> | Data zone standard | Data zone standard | Data zone standard | Data zone standard | - | - | - | - | Regional | Regional |
184
+
|australiaeast| - | - | Global standard | Global standard | Global standard | Global standard | - | - | - | - | - | - |
185
+
|japaneast| - | - | Global standard | Global standard | Global standard | Global standard | - | - | - | - | Regional | Regional |
186
+
|eastus| - | - | Data zone standard | Data zone standard | Data zone standard | Data zone standard | - | - | - | - | - | - |
187
+
|uksouth| - | - | Global standard | Global standard | Global standard | Global standard | - | - | - | - | - | - |
188
+
|westeurope| - | - | Data zone standard | Data zone standard | Data zone standard | Data zone standard | - | - | - | - | - | - |
189
189
190
190
<sup>1</sup> The Azure AI Foundry resource must be in Central India. Azure AI Speech features remain in Central India. The voice live API uses Sweden Central as needed for generative AI load balancing.
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/voice-live-how-to.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -30,10 +30,10 @@ An [Azure AI Foundry resource](../multi-service-resource.md) is required to acce
30
30
31
31
### WebSocket endpoint
32
32
33
-
The WebSocket endpoint for the voice live API is `wss://<your-ai-foundry-resource-name>.services.ai.azure.com/voice-live/realtime?api-version=2025-05-01-preview` or, for older resources, `wss://<your-ai-foundry-resource-name>.cognitiveservices.azure.com/voice-live/realtime?api-version=2025-05-01-preview`.
33
+
The WebSocket endpoint for the voice live API is `wss://<your-ai-foundry-resource-name>.services.ai.azure.com/voice-live/realtime?api-version=2025-10-01` or, for older resources, `wss://<your-ai-foundry-resource-name>.cognitiveservices.azure.com/voice-live/realtime?api-version=2025-10-01`.
34
34
The endpoint is the same for all models. The only difference is the required `model` query parameter, or, when using the Agent service, the `agent_id` and `project_id` parameters.
35
35
36
-
For example, an endpoint for a resource with a custom domain would be `wss://<your-ai-foundry-resource-name>.services.ai.azure.com/voice-live/realtime?api-version=2025-05-01-preview&model=gpt-realtime`
36
+
For example, an endpoint for a resource with a custom domain would be `wss://<your-ai-foundry-resource-name>.services.ai.azure.com/voice-live/realtime?api-version=2025-10-01&model=gpt-realtime`
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/voice-live-language-support.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -22,7 +22,7 @@ The voice live API supports multiple languages and configuration options. In thi
22
22
23
23
## [Speech input](#tab/speechinput)
24
24
25
-
Depending on which model is being used voice live speech input is processed either by one of the multimodal models (for example, `gpt-realtime`, `gpt-4o-realtime-preview`, `gpt-4o-mini-realtime-preview`, and `phi4-mm-realtime`) or by `azure speech to text` models.
25
+
Depending on which model is being used voice live speech input is processed either by one of the multimodal models (for example, `gpt-realtime`, `gpt-4o-mini-realtime-preview`, and `phi4-mm-realtime`) or by `azure speech to text` models.
26
26
27
27
### Azure speech to text supported languages
28
28
@@ -78,11 +78,11 @@ To configure a single or multiple languages not supported by the multimodal mode
78
78
}
79
79
```
80
80
81
-
### gpt-realtime, gpt-4o-realtime-preview and gpt-4o-mini-realtime-preview supported languages
81
+
### gpt-realtime and gpt-4o-mini-realtime-preview supported languages
82
82
83
83
While the underlying model was trained on 98 languages, OpenAI only lists the languages that exceeded <50% word error rate (WER) which is an industry standard benchmark for speech to text model accuracy. The model returns results for languages not listed but the quality will be low.
84
84
85
-
The following languages are supported by `gpt-realtime`, `gpt-4o-realtime-preview` and `gpt-4o-mini-realtime-preview`:
85
+
The following languages are supported by `gpt-realtime` and `gpt-4o-mini-realtime-preview`:
86
86
- Afrikaans
87
87
- Arabic
88
88
- Armenian
@@ -175,7 +175,7 @@ Multimodal models don't require a language configuration for the general process
175
175
176
176
## [Speech output](#tab/speechoutput)
177
177
178
-
Depending on which model is being used voice live speech output is processed either by one of the multimodal OpenAI voices integrated into `gpt-realtime`, `gpt-4o-realtime-preview`, and `gpt-4o-mini-realtime-preview` or by `azure text to speech` voices.
178
+
Depending on which model is being used voice live speech output is processed either by one of the multimodal OpenAI voices integrated into `gpt-realtime` and `gpt-4o-mini-realtime-preview` or by `azure text to speech` voices.
0 commit comments