Azure
diff --git a/‎sdk/ai/azure-ai-voicelive/CHANGELOG.md‎
Lines changed: 94 additions & 0 deletions b/‎sdk/ai/azure-ai-voicelive/CHANGELOG.md‎
Lines changed: 94 additions & 0 deletions
diff --git a/‎sdk/ai/azure-ai-voicelive/README.md‎
Lines changed: 45 additions & 77 deletions b/‎sdk/ai/azure-ai-voicelive/README.md‎
Lines changed: 45 additions & 77 deletions
diff --git a/‎sdk/ai/azure-ai-voicelive/_metadata.json‎
Lines changed: 1 addition & 1 deletion b/‎sdk/ai/azure-ai-voicelive/_metadata.json‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎sdk/ai/azure-ai-voicelive/apiview-properties.json‎
Lines changed: 12 additions & 8 deletions b/‎sdk/ai/azure-ai-voicelive/apiview-properties.json‎
Lines changed: 12 additions & 8 deletions
@@ -1,5 +1,99 @@
 # Release History
 
+## 1.0.0b5 (Unreleased)
+
+### Features Added
+
+- **Enhanced Semantic Detection Type Safety**: Added new `EouThresholdLevel` enum for better type safety in end-of-utterance detection:
+  - `LOW` for low sensitivity threshold level
+  - `MEDIUM` for medium sensitivity threshold level  
+  - `HIGH` for high sensitivity threshold level
+  - `DEFAULT` for default sensitivity threshold level
+- **Improved Semantic Detection Configuration**: Enhanced semantic detection classes with better type annotations:
+  - `threshold_level` parameter now supports both string values and `EouThresholdLevel` enum
+  - Cleaner type definitions for `AzureSemanticDetection`, `AzureSemanticDetectionEn`, and `AzureSemanticDetectionMultilingual`
+  - Improved documentation for threshold level parameters
+- **Comprehensive Unit Test Suite**: Added extensive unit test coverage with 200+ test cases covering:
+  - All enum types and their functionality
+  - Model creation, validation, and serialization
+  - Async connection functionality with proper mocking
+  - Client event handling and workflows
+  - Voice configuration across all supported types
+  - Message handling with content part hierarchy
+  - Integration scenarios and real-world usage patterns
+  - Recent changes validation and backwards compatibility
+- **API Version Update**: Updated to API version `2025-10-01` (from `2025-05-01-preview`)
+- **Enhanced Type Safety**: Added new `AzureVoiceType` enum with values for better Azure voice type categorization:
+  - `AZURE_CUSTOM` for custom voice configurations
+  - `AZURE_STANDARD` for standard voice configurations  
+  - `AZURE_PERSONAL` for personal voice configurations
+- **Improved Message Handling**: Added `MessageRole` enum for better role type safety in message items
+- **Enhanced Model Documentation**: Comprehensive documentation improvements across all models:
+  - Added detailed docstrings for model classes and their parameters
+  - Enhanced enum value documentation with descriptions
+  - Improved type annotations and parameter descriptions
+- **Enhanced Semantic Detection**: Added improved configuration options for all semantic detection classes:
+  - Added `threshold_level` parameter with options: `"low"`, `"medium"`, `"high"`, `"default"` (recommended over deprecated `threshold`)
+  - Added `timeout_ms` parameter for timeout configuration in milliseconds (recommended over deprecated `timeout`)
+- **Video Background Support**: Added new `Background` model for video background customization:
+  - Support for solid color backgrounds in hex format (e.g., `#00FF00FF`)
+  - Support for image URL backgrounds
+  - Mutually exclusive color and image URL options
+- **Enhanced Video Parameters**: Extended `VideoParams` model with:
+  - `background` parameter for configuring video backgrounds using the new `Background` model
+  - `gop_size` parameter for Group of Pictures (GOP) size control, affecting compression efficiency and seeking performance
+- **Improved Type Safety**: Added `TurnDetectionType` enum for better type safety and IntelliSense support
+- **Package Structure Modernization**: Simplified package initialization with namespace package support
+- **Enhanced Error Handling**: Added `ConnectionError` and `ConnectionClosed` exception classes to the async API for better WebSocket error management
+
+### Breaking Changes
+
+- **Cross-Language Package Identity Update**: Updated package ID from `VoiceLive` to `VoiceLive.WebSocket` for better cross-language consistency
+- **Model Refactoring**: 
+  - Renamed `UserContentPart` to `MessageContentPart` for clearer content part hierarchy
+  - All message items now require a `content` field with list of `MessageContentPart` objects
+  - `OutputTextContentPart` now inherits from `MessageContentPart` instead of being standalone
+- **Enhanced Type Safety**: 
+  - Azure voice classes now use `AzureVoiceType` enum discriminators instead of string literals
+  - Message role discriminators now use `MessageRole` enum values for better type safety
+- **Removed Deprecated Parameters**: Completely removed deprecated parameters from semantic detection classes:
+  - Removed `threshold` parameter from all semantic detection classes (`AzureSemanticDetection`, `AzureSemanticDetectionEn`, `AzureSemanticDetectionMultilingual`)
+  - Removed `timeout` parameter from all semantic detection classes
+  - Users must now use `threshold_level` and `timeout_ms` parameters respectively
+- **Removed Synchronous API**: Completely removed synchronous WebSocket operations to focus exclusively on async patterns:
+  - Removed sync `connect()` function and sync `VoiceLiveConnection` class from main patch implementation
+  - Removed sync `basic_voice_assistant.py` sample (only async version remains)
+  - Simplified sync patch to minimal structure with empty exports
+  - All functionality now available only through async patterns
+- **Updated Dependencies**: Modified package dependencies to reflect async-only architecture:
+  - Moved `aiohttp>=3.9.0,<4.0.0` from optional to required dependency
+  - Removed `websockets` optional dependency as sync API no longer exists
+  - Removed optional dependency groups `websockets`, `aiohttp`, and `all-websockets`
+- **Model Rename**:
+  - Renamed `AudioInputTranscriptionSettings` to `AudioInputTranscriptionOptions` for consistency with naming conventions
+  - Renamed `AzureMultilingualSemanticVad` to `AzureSemanticVadMultilingual` for naming consistency with other multilingual variants
+- **Enhanced Type Safety**: Turn detection discriminator types now use enum values instead of string literals for better type safety
+
+### Bug Fixes
+
+- **Serialization Improvements**: Fixed type casting issue in serialization utilities for better enum handling and type safety
+
+### Other Changes
+
+- **Testing Infrastructure**: Added comprehensive unit test suite with extensive coverage:
+  - 8 main test files with 200+ individual test methods
+  - Tests for all enums, models, async operations, client events, voice configurations, and message handling
+  - Integration tests covering real-world scenarios and recent changes
+  - Proper mocking for async WebSocket connections
+  - Backwards compatibility validation
+  - Test coverage for all recent changes and enhancements
+- **API Documentation**: Updated API view properties to reflect model structure changes, new enums, and cross-language package identity
+- **Documentation Updates**: Comprehensive updates to all markdown documentation:
+  - Updated README.md to reflect async-only nature with updated examples and installation instructions
+  - Updated samples README.md to remove sync sample references
+  - Enhanced BASIC_VOICE_ASSISTANT.md with comprehensive async implementation guide
+  - Added MIGRATION_GUIDE.md for users upgrading from previous versions
+
 ## 1.0.0b4 (2025-09-19)
 
 ### Features Added
 
@@ -7,6 +7,8 @@ typed server events (including audio) for responsive, interruptible conversation
 
 > **Status:** Preview. APIs are subject to change.
 
+> **Important:** As of version 1.0.0b5, this SDK is **async-only**. The synchronous API has been removed to focus exclusively on async patterns. All examples and samples use `async`/`await` syntax.
+
 ---
 
 Getting started
@@ -25,21 +27,14 @@ Getting started
 # Base install (core client only)
 python -m pip install azure-ai-voicelive
 
-# For synchronous streaming (uses websockets)
-python -m pip install "azure-ai-voicelive[websockets]"
-
 # For asynchronous streaming (uses aiohttp)
 python -m pip install "azure-ai-voicelive[aiohttp]"
 
-# For both sync + async scenarios (recommended if unsure)
-python -m pip install "azure-ai-voicelive[all-websockets]" pyaudio python-dotenv
+# For voice samples (includes audio processing)
+python -m pip install azure-ai-voicelive[aiohttp] pyaudio python-dotenv
 ```
 
-WebSocket streaming features require additional dependencies.
-Install them with:
-    pip install "azure-ai-voicelive[websockets]"   # for sync
-    pip install "azure-ai-voicelive[aiohttp]"     # for async
-    pip install "azure-ai-voicelive[all-websockets]"  # for both
+The SDK now exclusively provides async-only WebSocket connections using `aiohttp`.
 
 ### Authenticate
 
@@ -58,50 +53,65 @@ AZURE_VOICELIVE_ENDPOINT="your-endpoint"
 Then, use the key in your code:
 
 ```python
+import asyncio
 from azure.core.credentials import AzureKeyCredential
 from azure.ai.voicelive import connect
 
-connection = connect(
-    endpoint="your-endpoint",
-    credential=AzureKeyCredential("your-api-key"),
-    model="gpt-4o-realtime-preview"
-)
+async def main():
+    async with connect(
+        endpoint="your-endpoint",
+        credential=AzureKeyCredential("your-api-key"),
+        model="gpt-4o-realtime-preview"
+    ) as connection:
+        # Your async code here
+        pass
+
+asyncio.run(main())
 ```
 
 #### AAD Token Authentication
 
 For production applications, AAD authentication is recommended:
 
 ```python
-from azure.identity import DefaultAzureCredential
+import asyncio
+from azure.identity.aio import DefaultAzureCredential
 from azure.ai.voicelive import connect
 
-credential = DefaultAzureCredential()
+async def main():
+    credential = DefaultAzureCredential()
+    
+    async with connect(
+        endpoint="your-endpoint",
+        credential=credential,
+        model="gpt-4o-realtime-preview"
+    ) as connection:
+        # Your async code here
+        pass
 
-connection = connect(
-    endpoint="your-endpoint",
-    credential=credential,
-    model="gpt-4o-realtime-preview"
-)
+asyncio.run(main())
 ```
 
 ---
 
 Key concepts
 ------------
 
-- **VoiceLiveConnection** – Manages an active WebSocket connection to the service
+- **VoiceLiveConnection** – Manages an active async WebSocket connection to the service
 - **Session Management** – Configure conversation parameters:
-  - **SessionResource** – Update session parameters (voice, formats, VAD)
+  - **SessionResource** – Update session parameters (voice, formats, VAD) with async methods
   - **RequestSession** – Strongly-typed session configuration
   - **ServerVad** – Configure voice activity detection
   - **AzureStandardVoice** – Configure voice settings
 - **Audio Handling**:
-  - **InputAudioBufferResource** – Manage audio input to the service
-  - **OutputAudioBufferResource** – Control audio output from the service
+  - **InputAudioBufferResource** – Manage audio input to the service with async methods
+  - **OutputAudioBufferResource** – Control audio output from the service with async methods
 - **Conversation Management**:
-  - **ResponseResource** – Create or cancel model responses
-  - **ConversationResource** – Manage conversation items
+  - **ResponseResource** – Create or cancel model responses with async methods
+  - **ConversationResource** – Manage conversation items with async methods
+- **Error Handling**: 
+  - **ConnectionError** – Base exception for WebSocket connection errors
+  - **ConnectionClosed** – Raised when WebSocket connection is closed
 - **Strongly-Typed Events** – Process service events with type safety:
   - `SESSION_UPDATED`, `RESPONSE_AUDIO_DELTA`, `RESPONSE_DONE`
   - `INPUT_AUDIO_BUFFER_SPEECH_STARTED`, `INPUT_AUDIO_BUFFER_SPEECH_STOPPED`
@@ -112,25 +122,25 @@ Key concepts
 Examples
 --------
 
-### Basic async Voice Assistant (Featured Sample)
+### Basic Voice Assistant (Featured Sample)
 
-The Basic async Voice Assistant sample demonstrates full-featured voice interaction with:
+The Basic Voice Assistant sample demonstrates full-featured voice interaction with:
 
 - Real-time speech streaming
-- Server-side voice activity detection
+- Server-side voice activity detection  
 - Interruption handling
 - High-quality audio processing
 
 ```bash
 # Run the basic voice assistant sample
-# Requires [aiohttp] for async (easiest: [all-websockets])
+# Requires [aiohttp] for async
 python samples/basic_voice_assistant_async.py
 
 # With custom parameters
 python samples/basic_voice_assistant_async.py --model gpt-4o-realtime-preview --voice alloy --instructions "You're a helpful assistant"
 ```
 
-### Minimal async example
+### Minimal example
 
 ```python
 import asyncio
@@ -172,44 +182,6 @@ async def main():
 asyncio.run(main())
 ```
 
-### Minimal sync example
-
-```python
-from azure.core.credentials import AzureKeyCredential
-from azure.ai.voicelive import connect
-from azure.ai.voicelive.models import (
-    RequestSession, Modality, InputAudioFormat, OutputAudioFormat, ServerVad, ServerEventType
-)
-
-API_KEY = "your-api-key"
-ENDPOINT = "your-endpoint"
-MODEL = "gpt-4o-realtime-preview"
-
-with connect(
-    endpoint=ENDPOINT,
-    credential=AzureKeyCredential(API_KEY),
-    model=MODEL
-) as conn:
-    session = RequestSession(
-        modalities=[Modality.TEXT, Modality.AUDIO],
-        instructions="You are a helpful assistant.",
-        input_audio_format=InputAudioFormat.PCM16,
-        output_audio_format=OutputAudioFormat.PCM16,
-        turn_detection=ServerVad(
-            threshold=0.5, 
-            prefix_padding_ms=300, 
-            silence_duration_ms=500
-        ),
-    )
-    conn.session.update(session=session)
-
-    # Process events
-    for evt in conn:
-        print(f"Event: {evt.type}")
-        if evt.type == ServerEventType.RESPONSE_DONE:
-            break
-```
-
 Available Voice Options
 -----------------------
 
@@ -279,12 +251,8 @@ Troubleshooting
   Verify `AZURE_VOICELIVE_ENDPOINT`, network rules, and that your credential has access.
 
 - **Missing WebSocket dependencies:**  
-  If you see:
-    WebSocket streaming features require additional dependencies.
-    Install them with:
-        pip install "azure-ai-voicelive[websockets]"   # for sync
-        pip install "azure-ai-voicelive[aiohttp]"     # for async
-        pip install "azure-ai-voicelive[all-websockets]"  # for both
+  If you see import errors, make sure you have installed the package:
+    pip install azure-ai-voicelive[aiohttp]
 
 - **Auth failures:**  
   For API key, double-check `AZURE_VOICELIVE_API_KEY`. For AAD, ensure the identity is authorized.
 
@@ -1,3 +1,3 @@
 {
-  "apiVersion": "2025-05-01-preview"
+  "apiVersion": "2025-10-01"
 }
@@ -1,27 +1,28 @@
 {
-    "CrossLanguagePackageId": "VoiceLive",
+    "CrossLanguagePackageId": "VoiceLive.WebSocket",
     "CrossLanguageDefinitionId": {
         "azure.ai.voicelive.models.AgentConfig": "VoiceLive.AgentConfig",
         "azure.ai.voicelive.models.Animation": "VoiceLive.Animation",
         "azure.ai.voicelive.models.ConversationRequestItem": "VoiceLive.ConversationRequestItem",
         "azure.ai.voicelive.models.MessageItem": "VoiceLive.MessageItem",
         "azure.ai.voicelive.models.AssistantMessageItem": "VoiceLive.AssistantMessageItem",
         "azure.ai.voicelive.models.AudioEchoCancellation": "VoiceLive.AudioEchoCancellation",
-        "azure.ai.voicelive.models.AudioInputTranscriptionSettings": "VoiceLive.AudioInputTranscriptionSettings",
+        "azure.ai.voicelive.models.AudioInputTranscriptionOptions": "VoiceLive.AudioInputTranscriptionOptions",
         "azure.ai.voicelive.models.AudioNoiseReduction": "VoiceLive.AudioNoiseReduction",
         "azure.ai.voicelive.models.AvatarConfig": "VoiceLive.AvatarConfig",
         "azure.ai.voicelive.models.AzureVoice": "VoiceLive.AzureVoice",
         "azure.ai.voicelive.models.AzureCustomVoice": "VoiceLive.AzureCustomVoice",
-        "azure.ai.voicelive.models.TurnDetection": "VoiceLive.TurnDetection",
-        "azure.ai.voicelive.models.AzureMultilingualSemanticVad": "VoiceLive.AzureMultilingualSemanticVad",
         "azure.ai.voicelive.models.AzurePersonalVoice": "VoiceLive.AzurePersonalVoice",
         "azure.ai.voicelive.models.EOUDetection": "VoiceLive.EOUDetection",
         "azure.ai.voicelive.models.AzureSemanticDetection": "VoiceLive.AzureSemanticDetection",
         "azure.ai.voicelive.models.AzureSemanticDetectionEn": "VoiceLive.AzureSemanticDetectionEn",
         "azure.ai.voicelive.models.AzureSemanticDetectionMultilingual": "VoiceLive.AzureSemanticDetectionMultilingual",
+        "azure.ai.voicelive.models.TurnDetection": "VoiceLive.TurnDetection",
         "azure.ai.voicelive.models.AzureSemanticVad": "VoiceLive.AzureSemanticVad",
         "azure.ai.voicelive.models.AzureSemanticVadEn": "VoiceLive.AzureSemanticVadEn",
+        "azure.ai.voicelive.models.AzureSemanticVadMultilingual": "VoiceLive.AzureSemanticVadMultilingual",
         "azure.ai.voicelive.models.AzureStandardVoice": "VoiceLive.AzureStandardVoice",
+        "azure.ai.voicelive.models.Background": "VoiceLive.Background",
         "azure.ai.voicelive.models.CachedTokenDetails": "VoiceLive.CachedTokenDetails",
         "azure.ai.voicelive.models.ClientEvent": "VoiceLive.ClientEvent",
         "azure.ai.voicelive.models.ClientEventConversationItemCreate": "VoiceLive.ClientEventConversationItemCreate",
@@ -48,7 +49,7 @@
         "azure.ai.voicelive.models.Tool": "VoiceLive.Tool",
         "azure.ai.voicelive.models.FunctionTool": "VoiceLive.FunctionTool",
         "azure.ai.voicelive.models.IceServer": "VoiceLive.IceServer",
-        "azure.ai.voicelive.models.UserContentPart": "VoiceLive.UserContentPart",
+        "azure.ai.voicelive.models.MessageContentPart": "VoiceLive.MessageContentPart",
         "azure.ai.voicelive.models.InputAudioContentPart": "VoiceLive.InputAudioContentPart",
         "azure.ai.voicelive.models.InputTextContentPart": "VoiceLive.InputTextContentPart",
         "azure.ai.voicelive.models.InputTokenDetails": "VoiceLive.InputTokenDetails",
@@ -123,19 +124,22 @@
         "azure.ai.voicelive.models.ClientEventType": "VoiceLive.ClientEventType",
         "azure.ai.voicelive.models.ItemType": "VoiceLive.ItemType",
         "azure.ai.voicelive.models.ItemParamStatus": "VoiceLive.ItemParamStatus",
+        "azure.ai.voicelive.models.MessageRole": "VoiceLive.MessageRole",
         "azure.ai.voicelive.models.Modality": "VoiceLive.Modality",
         "azure.ai.voicelive.models.OAIVoice": "VoiceLive.OAIVoice",
+        "azure.ai.voicelive.models.AzureVoiceType": "VoiceLive.AzureVoiceType",
         "azure.ai.voicelive.models.PersonalVoiceModels": "VoiceLive.PersonalVoiceModels",
         "azure.ai.voicelive.models.OutputAudioFormat": "VoiceLive.OutputAudioFormat",
         "azure.ai.voicelive.models.ToolType": "VoiceLive.ToolType",
         "azure.ai.voicelive.models.AnimationOutputType": "VoiceLive.AnimationOutputType",
         "azure.ai.voicelive.models.InputAudioFormat": "VoiceLive.InputAudioFormat",
+        "azure.ai.voicelive.models.TurnDetectionType": "VoiceLive.TurnDetectionType",
+        "azure.ai.voicelive.models.EouThresholdLevel": "VoiceLive.EouThresholdLevel",
         "azure.ai.voicelive.models.AudioTimestampType": "VoiceLive.AudioTimestampType",
         "azure.ai.voicelive.models.ToolChoiceLiteral": "VoiceLive.ToolChoiceLiteral",
-        "azure.ai.voicelive.models.ServerEventType": "VoiceLive.ServerEventType",
+        "azure.ai.voicelive.models.ResponseStatus": "VoiceLive.ResponseStatus",
         "azure.ai.voicelive.models.ResponseItemStatus": "VoiceLive.ResponseItemStatus",
-        "azure.ai.voicelive.models.MessageRole": "VoiceLive.MessageRole",
         "azure.ai.voicelive.models.ContentPartType": "VoiceLive.ContentPartType",
-        "azure.ai.voicelive.models.ResponseStatus": "VoiceLive.ResponseStatus"
+        "azure.ai.voicelive.models.ServerEventType": "VoiceLive.ServerEventType"
     }
 }
Original file line number	Diff line number	Diff line change
`@@ -1,3 +1,3 @@`
`1`	`1`	`{`
`2`		`- "apiVersion": "2025-05-01-preview"`
	`2`	`+ "apiVersion": "2025-10-01"`
`3`	`3`	`}`