Skip to content

Commit e4f46a8

Browse files
authored
Merge pull request #7109 from goergenj/voicelive-customization-ga
Voicelive customization ga
2 parents 095be79 + 5ba7a08 commit e4f46a8

File tree

11 files changed

+220
-109
lines changed

11 files changed

+220
-109
lines changed

articles/ai-services/speech-service/includes/quickstarts/voice-live-agents/python.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,10 @@
11
---
22
manager: nitinme
3-
author: PatrickFarley
4-
ms.author: pafarley
3+
author: goergenj
4+
ms.author: jagoerge
55
ms.service: azure-ai-openai
66
ms.topic: include
7-
ms.date: 7/31/2025
7+
ms.date: 9/26/2025
88
---
99

1010
In this article, you learn how to use Azure AI Speech voice live with [Azure AI Foundry Agent Service](/azure/ai-foundry/agents/overview) using Python code.
@@ -165,7 +165,7 @@ client = AsyncAzureVoiceLive(
165165
endpoint = os.environ.get("AZURE_VOICE_LIVE_ENDPOINT") or "<https://your-endpoint.azure.com/>"
166166
agent_id = os.environ.get("AI_FOUNDRY_AGENT_ID") or "<your-agent-id>"
167167
project_name = os.environ.get("AI_FOUNDRY_PROJECT_NAME") or "<your-project-name>"
168-
api_version = os.environ.get("AZURE_VOICE_LIVE_API_VERSION") or "2025-05-01-preview"
168+
api_version = os.environ.get("AZURE_VOICE_LIVE_API_VERSION") or "2025-10-01"
169169
api_key = os.environ.get("AZURE_VOICE_LIVE_API_KEY") or "<your-api-key>"
170170
171171
# For the recommended keyless authentication, get and
@@ -669,7 +669,7 @@ The `voicelive.log` file contains information about the connection to the Voice
669669
670670
```text
671671
2025-07-29 09:43:32,574:websockets.client:DEBUG:= connection is CONNECTING
672-
2025-07-29 09:43:32,825:websockets.client:DEBUG:> GET /voice-live/realtime?api-version=2025-05-01-preview&agent-project-name=contoso-proj-agentic&agent-id=<your-agent-id>&agent-access-token=<your-token>&debug=on HTTP/1.1
672+
2025-07-29 09:43:32,825:websockets.client:DEBUG:> GET /voice-live/realtime?api-version=2025-10-01&agent-project-name=contoso-proj-agentic&agent-id=<your-agent-id>&agent-access-token=<your-token>&debug=on HTTP/1.1
673673
2025-07-29 09:43:32,825:websockets.client:DEBUG:> Host: your-ai-foundry-resource.cognitiveservices.azure.com
674674
2025-07-29 09:43:32,825:websockets.client:DEBUG:> Upgrade: websocket
675675
2025-07-29 09:43:32,825:websockets.client:DEBUG:> Connection: Upgrade
@@ -864,7 +864,7 @@ def main() -> None:
864864
endpoint = os.environ.get("AZURE_VOICE_LIVE_ENDPOINT") or "<https://your-endpoint.azure.com/>"
865865
agent_id = os.environ.get("AI_FOUNDRY_AGENT_ID") or "<your-agent-id>"
866866
agent_connection_string = os.environ.get("AI_FOUNDRY_AGENT_CONNECTION_STRING") or "<your-agent-connection-string>"
867-
api_version = os.environ.get("AZURE_VOICE_LIVE_API_VERSION") or "2025-05-01-preview"
867+
api_version = os.environ.get("AZURE_VOICE_LIVE_API_VERSION") or "2025-10-01"
868868
api_key = os.environ.get("AZURE_VOICE_LIVE_API_KEY") or "<your-api-key>"
869869
870870
# For the recommended keyless authentication, get and

articles/ai-services/speech-service/includes/quickstarts/voice-live-agents/resource-authentication.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
11
---
2-
author: PatrickFarley
3-
ms.author: pafarley
2+
author: goergenj
3+
ms.author: jagoerge
44
ms.service: azure-ai-speech
55
ms.topic: include
6-
ms.date: 7/31/2025
6+
ms.date: 9/26/2025
77
---
88

99
Create a new file named `.env` in the folder where you want to run the code.
@@ -14,7 +14,7 @@ In the `.env` file, add the following environment variables for authentication:
1414
AZURE_VOICE_LIVE_ENDPOINT=<your_endpoint>
1515
AI_FOUNDRY_PROJECT_NAME=<your_project_name>
1616
AI_FOUNDRY_AGENT_ID=<your_agent_id>
17-
AZURE_VOICE_LIVE_API_VERSION=2025-05-01-preview
17+
AZURE_VOICE_LIVE_API_VERSION=2025-10-01
1818
AZURE_VOICE_LIVE_API_KEY=<your_api_key> # Only required if using API key authentication
1919
```
2020

@@ -27,7 +27,7 @@ Replace the default values with your actual project name, agent ID, API version,
2727
| `AZURE_VOICE_LIVE_ENDPOINT` | This value can be found in the **Keys and Endpoint** section when examining your resource from the Azure portal. |
2828
| `AI_FOUNDRY_PROJECT_NAME` | The name of your Azure AI Foundry project. |
2929
| `AI_FOUNDRY_AGENT_ID` | The ID of your Azure AI Foundry agent. |
30-
| `AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-05-01-preview`. |
30+
| `AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-10-01`. |
3131

3232
Learn more about [keyless authentication](/azure/ai-services/authentication) and [setting environment variables](/azure/ai-services/cognitive-services-environment-variables).
3333

@@ -38,7 +38,7 @@ Learn more about [keyless authentication](/azure/ai-services/authentication) and
3838
| `AZURE_VOICE_LIVE_ENDPOINT` | This value can be found in the **Keys and Endpoint** section when examining your resource from the Azure portal. |
3939
| `AI_FOUNDRY_PROJECT_NAME` | The name of your Azure AI Foundry project. |
4040
| `AI_FOUNDRY_AGENT_ID` | The ID of your Azure AI Foundry agent. |
41-
| `AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-05-01-preview`. |
41+
| `AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-10-01`. |
4242
| `AZURE_VOICE_LIVE_API_KEY` | This value can be found in the **Keys and Endpoint** section when examining your resource from the Azure portal. You can use either `KEY1` or `KEY2`.|
4343

4444
Learn more about [finding API keys](/azure/ai-services/cognitive-services-environment-variables) and [setting environment variables](/azure/ai-services/cognitive-services-environment-variables).

articles/ai-services/speech-service/includes/quickstarts/voice-live-api/python.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -602,14 +602,14 @@ The sample code in this quickstart uses either Microsoft Entra ID or an API key
602602
"--model",
603603
help="VoiceLive model to use",
604604
type=str,
605-
default=os.environ.get("VOICE_LIVE_MODEL", "gpt-4o-realtime-preview"),
605+
default=os.environ.get("VOICE_LIVE_MODEL", "gpt-realtime"),
606606
)
607607

608608
parser.add_argument(
609609
"--voice",
610610
help="Voice to use for the assistant",
611611
type=str,
612-
default=os.environ.get("VOICE_LIVE_VOICE", "en-US-AvaNeural"),
612+
default=os.environ.get("VOICE_LIVE_VOICE", "en-US-Ava:DragonHDLatestNeural"),
613613
help="Voice to use for the assistant. E.g. alloy, echo, fable, en-US-AvaNeural, en-US-GuyNeural",
614614
)
615615

articles/ai-services/speech-service/includes/quickstarts/voice-live-api/resource-authentication.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,9 +1,9 @@
11
---
2-
author: PatrickFarley
3-
ms.author: pafarley
2+
author: goergenj
3+
ms.author: jagoerge
44
ms.service: azure-ai-speech
55
ms.topic: include
6-
ms.date: 7/31/2025
6+
ms.date: 9/26/2025
77
---
88

99
Create a new file named `.env` in the folder where you want to run the code.
@@ -13,7 +13,7 @@ In the `.env` file, add the following environment variables for authentication:
1313
```plaintext
1414
AZURE_VOICE_LIVE_ENDPOINT=<your_endpoint>
1515
VOICE_LIVE_MODEL=<your_model>
16-
AZURE_VOICE_LIVE_API_VERSION=2025-05-01-preview
16+
AZURE_VOICE_LIVE_API_VERSION=2025-10-01
1717
AZURE_VOICE_LIVE_API_KEY=<your_api_key> # Only required if using API key authentication
1818
```
1919

@@ -25,7 +25,7 @@ Replace the default values with your actual endpoint, model, API version, and AP
2525
|--------------------------|-------------|
2626
| `AZURE_VOICE_LIVE_ENDPOINT` | This value can be found in the **Keys and Endpoint** section when examining your resource from the Azure portal. |
2727
| `AZURE_VOICE_LIVE_MODEL` | The model you want to use. For example, `gpt-4o` or `gpt-4o-mini-realtime-preview`. For more information about models availability, see the [Voice Live API overview documentation](../../../voice-live.md). |
28-
| `AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-05-01-preview`. |
28+
| `AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-10-01`. |
2929

3030
Learn more about [keyless authentication](/azure/ai-services/authentication) and [setting environment variables](/azure/ai-services/cognitive-services-environment-variables).
3131

@@ -35,7 +35,7 @@ Learn more about [keyless authentication](/azure/ai-services/authentication) and
3535
|--------------------------|-------------|
3636
| `AZURE_VOICE_LIVE_ENDPOINT` | This value can be found in the **Keys and Endpoint** section when examining your resource from the Azure portal. |
3737
| `AZURE_VOICE_LIVE_MODEL` | The model you want to use. For example, `gpt-4o` or `gpt-4o-mini-realtime-preview`. For more information about models availability, see the [Voice Live API overview documentation](../../../voice-live.md). |
38-
| `AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-05-01-preview`. |
38+
| `AZURE_VOICE_LIVE_API_VERSION`| The API version you want to use. For example, `2025-10-01`. |
3939
| `AZURE_VOICE_LIVE_API_KEY` | This value can be found in the **Keys and Endpoint** section when examining your resource from the Azure portal. You can use either `KEY1` or `KEY2`.|
4040

4141
Learn more about [finding API keys](/azure/ai-services/cognitive-services-environment-variables) and [setting environment variables](/azure/ai-services/cognitive-services-environment-variables).

articles/ai-services/speech-service/includes/release-notes/release-notes-tts.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -15,9 +15,9 @@ Our new “DragonV2.1” model brings improvements to the naturalness of speech,
1515
### June 2025 release
1616

1717
#### VoiceLive API update
18-
- Support more GenAI models: GPT-4.1, GPT-4.1 Mini and GPT-4.1 Nano, Phi-4 mini and Phi-4 Multimodal models are now natively supported.
18+
- Support more GenAI models: GPT-4.1, GPT-4.1 Mini, Phi-4 mini and Phi-4 Multimodal models are now natively supported.
1919
- Support more customization capabilities
20-
- Azure Semantic VAD is extended to support GPT-4o-Realtime and GPT-4o-Mini-Realtime.
20+
- Azure Semantic VAD is extended to support GPT-Realtime and GPT-4o-Mini-Realtime.
2121
- Availability in more regions
2222

2323
#### Public preview of Voice Conversion feature on selected en-US voices

articles/ai-services/speech-service/regions.md

Lines changed: 18 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -2,12 +2,12 @@
22
title: Regions - Speech service
33
titleSuffix: Azure AI services
44
description: A list of available regions and endpoints for the Speech service, including speech to text, text to speech, and speech translation.
5-
author: PatrickFarley
5+
author: goergenj
66
manager: nitinme
77
ms.service: azure-ai-speech
88
ms.topic: conceptual
9-
ms.date: 7/1/2025
10-
ms.author: pafarley
9+
ms.date: 9/16/2025
10+
ms.author: jagoerge
1111
ms.custom: references_regions
1212
#Customer intent: As a developer, I want to learn about the available regions and endpoints for the Speech service.
1313
---
@@ -174,15 +174,20 @@ The regions in these tables support most of the core features of the Speech serv
174174

175175
# [Voice live](#tab/voice-live)
176176

177-
| **Region** | **gpt-4o-realtime-preview** | **gpt-4o-mini-realtime-preview** | **gpt-4o** | **gpt-4o-mini** | **gpt-4.1** | **gpt-4.1-mini** | **gpt-4.1-nano** | **phi4-mm-realtime** | **phi4-mini** |
178-
|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|
179-
| centralindia | Cross-region<sup>1</sup> | Cross-region<sup>1</sup> | Global standard | Global standard | - | Global standard | Global standard | - | - |
180-
| eastus2 | Global standard | Global standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Regional | Regional |
181-
| southeastasia | - | - | - | - | - | Global standard | Global standard | - | - |
182-
| swedencentral | Global standard | Global standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Regional | Regional |
183-
| westus2 | Cross-region<sup>2</sup> | Cross-region<sup>2</sup> | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Regional | Regional |
184-
185-
<sup>1</sup> The Azure AI Foundry resource must be in Central India. Azure AI Speech features remain in Central India. The voice live API uses Sweden Central as needed for generative AI load balancing.
177+
| **Region** | **gpt-realtime** | **gpt-4o-mini-realtime-preview** (Preview) | **gpt-4o** | **gpt-4o-mini** | **gpt-4.1** | **gpt-4.1-mini** | **gpt-5** (Preview) | **gpt-5-mini** (Preview) | **gpt-5-nano** (Preview) | **gpt-5-chat** (Preview) | **phi4-mm-realtime** (Preview) | **phi4-mini** (Preview) |
178+
|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|-----|
179+
| centralindia | Cross-region<sup>1</sup> | Cross-region<sup>1</sup> | Global standard | Global standard | Global standard | Global standard | - | - | - | - | - | - |
180+
| eastus2 | Global standard | Global standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Global standard | Regional | Regional |
181+
| southeastasia | - | - | - | - | Global standard | Global standard | - | - | - | - | Regional | Regional |
182+
| swedencentral | Global standard | Global standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Data zone standard | Global standard | Regional | Regional |
183+
| westus2 | Cross-region<sup>2</sup> | Cross-region<sup>2</sup> | Data zone standard | Data zone standard | Data zone standard | Data zone standard | - | - | - | - | Regional | Regional |
184+
|australiaeast| - | - | Global standard | Global standard | Global standard | Global standard | - | - | - | - | - | - |
185+
|japaneast| - | - | Global standard | Global standard | Global standard | Global standard | - | - | - | - | Regional | Regional |
186+
|eastus| - | - | Data zone standard | Data zone standard | Data zone standard | Data zone standard | - | - | - | - | - | - |
187+
|uksouth| - | - | Global standard | Global standard | Global standard | Global standard | - | - | - | - | - | - |
188+
|westeurope| - | - | Data zone standard | Data zone standard | Data zone standard | Data zone standard | - | - | - | - | - | - |
189+
190+
<sup>1</sup> The Azure AI Foundry resource must be in Central India. Azure AI Speech features remain in Central India. The voice live API uses Sweden Central as needed for generative AI load balancing.
186191

187192
<sup>2</sup> The Azure AI Foundry resource must be in West US 2. Azure AI Speech features remain in West US 2. The voice live API uses East US 2 as needed for generative AI load balancing.
188193

@@ -262,7 +267,7 @@ The regions in these tables support most of the core features of the Speech serv
262267

263268
# [Scenarios](#tab/scenarios)
264269

265-
| **Region** | **Pronunciation assessment** | **Speaker recognition** | **Voice assistants** |
270+
| **Region** | **Pronunciation assessment** | **Speaker recognition** | **Voice assistants** |
266271
|-----|-----|-----|
267272
| australiaeast ||| |
268273
| brazilsouth || | |

articles/ai-services/speech-service/toc.yml

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -236,14 +236,16 @@ items:
236236
items:
237237
- name: Voice live overview
238238
href: voice-live.md
239+
- name: Voice live language support
240+
href: voice-live-language-support.md
239241
- name: Voice live with Foundry models quickstart
240242
href: voice-live-quickstart.md
241243
- name: Voice live with Foundry agents quickstart
242244
href: voice-live-agents-quickstart.md
243245
- name: How to use voice live
244246
href: voice-live-how-to.md
245-
- name: Voice live language support
246-
href: voice-live-language-support.md
247+
- name: How to customize voice live input and output
248+
href: voice-live-how-to-customize.md
247249
- name: Audio events reference
248250
href: /azure/ai-foundry/openai/realtime-audio-reference?context=/azure/ai-services/speech-service/context/context
249251
- name: Keyword recognition

0 commit comments

Comments
 (0)