Skip to content

Commit 1492c5e

Browse files
xitzhangXiting ZhangCopilot
authored
[VoiceLive]Add English-Specific Detection (#43048)
* [VoiceLive] Add async function-calling agent sample * add phrase list * fix typo * Update sdk/ai/azure-ai-voicelive/samples/async_function_calling_sample.py Co-authored-by: Copilot <[email protected]> * Update sdk/ai/azure-ai-voicelive/samples/async_function_calling_sample.py Co-authored-by: Copilot <[email protected]> * update * fix typo * update changelog * update * remove breaking change section * update changelog * fix change log * revert changelog I lost * update version and change log * enable type verification * update * update models * update Dict * update changelog * Update sdk/ai/azure-ai-voicelive/CHANGELOG.md Co-authored-by: Copilot <[email protected]> * Add CachedTokenDetails --------- Co-authored-by: Xiting Zhang <[email protected]> Co-authored-by: Copilot <[email protected]>
1 parent d77526a commit 1492c5e

File tree

4 files changed

+220
-92
lines changed

4 files changed

+220
-92
lines changed

sdk/ai/azure-ai-voicelive/CHANGELOG.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,13 +1,15 @@
11
# Release History
22

3-
## 1.0.0b3 (Unreleased)
3+
## 1.0.0b3 (2025-09-17)
44

55
### Features Added
66

77
- **Transcription improvement**: Added phrase list
88
- **New Voice Types**: Added `AzurePlatformVoice` and `LLMVoice` classes
99
- **Enhanced Speech Detection**: Added `AzureSemanticVadServer` class
1010
- **Improved Function Calling**: Enhanced async function calling sample with better error handling
11+
- **English-Specific Detection**: Added `AzureSemanticDetectionEn` class for optimized English-only semantic end-of-utterance detection
12+
- **English-Specific Voice Activity Detection**: Added `AzureSemanticVadEn` class for enhanced English-only voice activity detection
1113

1214
### Breaking Changes
1315

sdk/ai/azure-ai-voicelive/apiview-properties.json

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -18,10 +18,13 @@
1818
"azure.ai.voicelive.models.AzurePlatformVoice": "VoiceLive.AzurePlatformVoice",
1919
"azure.ai.voicelive.models.EOUDetection": "VoiceLive.EOUDetection",
2020
"azure.ai.voicelive.models.AzureSemanticDetection": "VoiceLive.AzureSemanticDetection",
21+
"azure.ai.voicelive.models.AzureSemanticDetectionEn": "VoiceLive.AzureSemanticDetectionEn",
2122
"azure.ai.voicelive.models.AzureSemanticDetectionMultilingual": "VoiceLive.AzureSemanticDetectionMultilingual",
2223
"azure.ai.voicelive.models.AzureSemanticVad": "VoiceLive.AzureSemanticVad",
24+
"azure.ai.voicelive.models.AzureSemanticVadEn": "VoiceLive.AzureSemanticVadEn",
2325
"azure.ai.voicelive.models.AzureSemanticVadServer": "VoiceLive.AzureSemanticVadServer",
2426
"azure.ai.voicelive.models.AzureStandardVoice": "VoiceLive.AzureStandardVoice",
27+
"azure.ai.voicelive.models.CachedTokenDetails": "VoiceLive.CachedTokenDetails",
2528
"azure.ai.voicelive.models.ClientEvent": "VoiceLive.ClientEvent",
2629
"azure.ai.voicelive.models.ClientEventConversationItemCreate": "VoiceLive.ClientEventConversationItemCreate",
2730
"azure.ai.voicelive.models.ClientEventConversationItemDelete": "VoiceLive.ClientEventConversationItemDelete",
@@ -81,7 +84,7 @@
8184
"azure.ai.voicelive.models.ServerEventConversationItemRetrieved": "VoiceLive.ServerEventConversationItemRetrieved",
8285
"azure.ai.voicelive.models.ServerEventConversationItemTruncated": "VoiceLive.ServerEventConversationItemTruncated",
8386
"azure.ai.voicelive.models.ServerEventError": "VoiceLive.ServerEventError",
84-
"azure.ai.voicelive.models.ServerEventErrorDetails": "VoiceLive.ServerEventError.error.anonymous",
87+
"azure.ai.voicelive.models.ServerEventErrorDetails": "VoiceLive.ServerEventErrorDetails",
8588
"azure.ai.voicelive.models.ServerEventInputAudioBufferCleared": "VoiceLive.ServerEventInputAudioBufferCleared",
8689
"azure.ai.voicelive.models.ServerEventInputAudioBufferCommitted": "VoiceLive.ServerEventInputAudioBufferCommitted",
8790
"azure.ai.voicelive.models.ServerEventInputAudioBufferSpeechStarted": "VoiceLive.ServerEventInputAudioBufferSpeechStarted",
@@ -94,7 +97,7 @@
9497
"azure.ai.voicelive.models.ServerEventResponseContentPartDone": "VoiceLive.ServerEventResponseContentPartDone",
9598
"azure.ai.voicelive.models.ServerEventResponseCreated": "VoiceLive.ServerEventResponseCreated",
9699
"azure.ai.voicelive.models.ServerEventResponseDone": "VoiceLive.ServerEventResponseDone",
97-
"azure.ai.voicelive.models.ServerEventResponseFunctionCallArgumentsDelta": "VoiceLive.ServerEventResponseFunctionCallArgumentsDelta",
100+
"azure.ai.voicelive.models.ServerEventResponseFunctionCallArgumentsDelta": "VoiceLive.ServerEventResponseFunctionCallArgumentsDelta",
98101
"azure.ai.voicelive.models.ServerEventResponseFunctionCallArgumentsDone": "VoiceLive.ServerEventResponseFunctionCallArgumentsDone",
99102
"azure.ai.voicelive.models.ServerEventResponseOutputItemAdded": "VoiceLive.ServerEventResponseOutputItemAdded",
100103
"azure.ai.voicelive.models.ServerEventResponseOutputItemDone": "VoiceLive.ServerEventResponseOutputItemDone",
@@ -122,9 +125,9 @@
122125
"azure.ai.voicelive.models.ResponseStatus": "VoiceLive.ResponseStatus",
123126
"azure.ai.voicelive.models.OAIVoice": "VoiceLive.OAIVoice",
124127
"azure.ai.voicelive.models.Phi4mmVoice": "VoiceLive.Phi4mmVoice",
128+
"azure.ai.voicelive.models.AudioFormat": "VoiceLive.AudioFormat",
125129
"azure.ai.voicelive.models.Modality": "VoiceLive.Modality",
126130
"azure.ai.voicelive.models.AnimationOutputType": "VoiceLive.AnimationOutputType",
127-
"azure.ai.voicelive.models.AudioFormat": "VoiceLive.AudioFormat",
128131
"azure.ai.voicelive.models.AudioTimestampType": "VoiceLive.AudioTimestampType",
129132
"azure.ai.voicelive.models.ToolType": "VoiceLive.ToolType",
130133
"azure.ai.voicelive.models.ToolChoiceLiteral": "VoiceLive.ToolChoiceLiteral",

sdk/ai/azure-ai-voicelive/azure/ai/voicelive/models/__init__.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -25,6 +25,7 @@
2525
AzureSemanticVadServer,
2626
AzureStandardVoice,
2727
AzureVoice,
28+
CachedTokenDetails,
2829
ClientEvent,
2930
ClientEventConversationItemCreate,
3031
ClientEventConversationItemDelete,
@@ -160,6 +161,7 @@
160161
"AzureSemanticVadServer",
161162
"AzureStandardVoice",
162163
"AzureVoice",
164+
"CachedTokenDetails",
163165
"ClientEvent",
164166
"ClientEventConversationItemCreate",
165167
"ClientEventConversationItemDelete",

0 commit comments

Comments
 (0)