|
| 1 | +--- |
| 2 | +title: Data, privacy, and security for Voice live |
| 3 | +titleSuffix: Azure AI services |
| 4 | +description: This document details issues for data, privacy, and security for Voice live. |
| 5 | +author: PatrickFarley |
| 6 | +ms.author: pafarley |
| 7 | +manager: nitinme |
| 8 | +ms.service: azure-ai-speech |
| 9 | +ms.topic: article |
| 10 | +ms.date: 09/29/2025 |
| 11 | +--- |
| 12 | + |
| 13 | +# Data, privacy, and security for Azure AI Voice Live API |
| 14 | + |
| 15 | +[!INCLUDE [non-english-translation](../../includes/non-english-translation.md)] |
| 16 | + |
| 17 | +> [!NOTE] |
| 18 | +> This article is provided for informational purposes only and not for the purpose of providing legal advice. We strongly recommend seeking specialist legal advice when implementing Speech Services. |
| 19 | +
|
| 20 | +This article provides details regarding how data provided by you to the Azure AI Voice Live API ("Voice Live API") is processed, used, and stored. |
| 21 | + |
| 22 | +Voice Live API is a fully managed service designed to empower developers to securely build, deploy, and scale high-quality, and extensible speech to speech experience for their voice agents. With Voice Live API, developers can choose from a list of different natively supported language models like GPT-Realtime, GPT-4.1, GPT-4o and GPT-4o-mini; incorporate an agent they have built using the Azure AI Foundry Agent Service to give the agent speech-in and speech-out capabilities; or bring their own model of choice deployed in Azure AI Foundry. |
| 23 | + |
| 24 | +Voice Live API stores and processes data to provide the service and to monitor for violations of the applicable [Product Terms](https://www.microsoft.com/licensing/terms/). See also [the Microsoft Products and Services Data Protection Addendum](https://aka.ms/DPA), which governs data processing by the Azure AI services, including Voice Live API. Voice Live API is an Azure service;[ learn more about applicable Azure compliance offerings](/compliance/regulatory/offering-home). |
| 25 | + |
| 26 | +> [!IMPORTANT] |
| 27 | +> Your prompts (inputs), completions (outputs), and your training data: |
| 28 | +> |
| 29 | +> - are NOT available to other customers. |
| 30 | +> - are NOT available to OpenAI or other model providers. |
| 31 | +> - are NOT used to improve OpenAI models or other model providers’ models. |
| 32 | +> - are NOT used to train, retrain, or improve Azure OpenAI Service or Azure AI Speech foundation models. |
| 33 | +> - are NOT used to improve any Microsoft or third-party products or services without your permission or instruction. |
| 34 | +> |
| 35 | +> With Voice Live API, your fine-tuned speech models are available exclusively for your use. |
| 36 | +
|
| 37 | +The language models provided with Voice Live API are operated by Microsoft as an Azure service. If you choose to bring your own agent created with [Azure AI Foundry Agent Service](/azure/ai-foundry/agents/overview) or bring your deployed model in [Azure AI Foundry Models](/azure/ai-foundry/concepts/foundry-models-overview) to Voice Live API, additional information on data, privacy, and security is available at [Data, privacy, and security for Azure AI Foundry Agent Service](/azure/ai-foundry/responsible-ai/agents/data-privacy-security) and [Data, privacy, and security for use of models through the model catalog in Azure AI Foundry](/azure/ai-foundry/how-to/concept-data-privacy). |
| 38 | + |
| 39 | +## What data does Azure AI Voice live API process? |
| 40 | + |
| 41 | +Voice Live API processes the following types of data: |
| 42 | + |
| 43 | +- **Prompts and output**. Prompts are submitted by the user, and content is generated by the GenAI model selected and converted to audio with or without avatar by Voice Live API. |
| 44 | +- **Uploaded data**. You can provide your own data for use with Voice Live API using your own Azure Storage account or a configured data store, for example, your custom lexicon file to improve the pronunciation of the text to speech output, or your text and speech data to fine-tune the speech to text, text to speech and avatar model. |
| 45 | +- **External data**. When you use the tools that support function calling, the service processes the outputs of those tools. |
| 46 | + |
| 47 | +> [!IMPORTANT] |
| 48 | +> Custom neural voice ("custom voice") and custom avatar are available with [limited access](/azure/ai-foundry/responsible-ai/speech-service/text-to-speech/limited-access?tabs=cnv). Learn more about data processing, storage and retention for [custom text to speech (custom voice)](/azure/ai-foundry/responsible-ai/speech-service/text-to-speech/data-privacy-security?tabs=custom-neural-voice#recorded-acknowledgement-statement-verification) and [custom avatar](/azure/ai-foundry/responsible-ai/speech-service/text-to-speech/data-privacy-security?tabs=custom-avatar#video-acknowledgement-statement-verification). |
| 49 | +
|
| 50 | +## How does Azure AI Voice live API process data? |
| 51 | + |
| 52 | +The diagram below shows the data processing workflow for Voice Live API. It depicts how the API handles prompts (user audio input) through inferencing to produce content (agent audio output or video output with avatar), as well as how data from external tools is ingested into the service. |
| 53 | + |
| 54 | +:::image type="content" source="media/voice-live-diagram.png" alt-text="Diagram of the Voice live scenario."::: |
| 55 | + |
| 56 | +When these features are enabled by the user, Voice Live API processes audio input for noise suppression, echo cancellation, voice activity detection, and end of utterance detection , prior to sending the audio for speech recognition and language generation. For speech-to-speech models, audio output is generated directly from the language model. If a text-based language model is specified, Voice Live API converts the text response into audio. When an avatar is selected, the service streams the avatar and returns both the audio response and the avatar together. |
| 57 | + |
| 58 | +When you bring your own model deployed in Azure AI Foundry or an agent built with Azure AI Foundry Agent Service to Voice Live API, the service interacts with the specified model endpoints to process your input prompts transcribed from audio and generate text output responses which may be further used or processed by Voice Live API for audio and avatar video generation. Data is processed for model inferencing in accordance with the terms that apply to the relevant model. Learn more at [Data, privacy, and security for Azure OpenAI Service](/azure/ai-foundry/responsible-ai/openai/data-privacy) and [Data, privacy, and security for use of models through the model catalog in AI Foundry portal](/azure/ai-studio/how-to/concept-data-privacy). |
| 59 | + |
| 60 | +To reduce the risk of harmful use of Voice Live API, the service includes [content filtering](/azure/ai-foundry/openai/concepts/content-filter) support. The outputs processed by the service will be filtered in accordance with any content filtering that has been applied to the natively supported models, or the model deployment used by your Foundry Agent. |
| 61 | + |
| 62 | + |
| 63 | +## Data storage and retention |
| 64 | + |
| 65 | +While Voice Live API itself does not store or retain customer data, the features (for example, custom voice, custom avatar, AI Foundry Agent) it interacts with may store customer data as the feature requires. Check data storage for [custom voice](/azure/ai-foundry/responsible-ai/speech-service/text-to-speech/data-privacy-security?tabs=custom-neural-voice#data-storage-and-retention), [custom avatar](/azure/ai-foundry/responsible-ai/speech-service/text-to-speech/data-privacy-security?tabs=custom-avatar#data-storage-and-retention), [AI Foundry Agents](/azure/ai-foundry/responsible-ai/agents/data-privacy-security#data-storage-for-azure-ai-agent-service-features), and [Azure OpenAI](/azure/ai-foundry/responsible-ai/openai/data-privacy?tabs=azure-portal#data-storage-for-azure-openai-service-features) if you are using these components. Learn more about [locations of processing for ‘global’ and ‘data zone’ deployments](/azure/ai-foundry/responsible-ai/openai/data-privacy?tabs=azure-portal#understanding-location-of-processing-for-global-and-data-zone-deployment-types). |
| 66 | + |
| 67 | +Users can opt into a logging feature per debugging assistance from Microsoft engineers, when there is a [support ticket](/azure/ai-services/cognitive-services-support-options?context=%2Fazure%2Fai-services%2Fspeech-service%2Fcontext%2Fcontext#create-an-azure-support-request) filed. With this logging feature, users’ speech data is secured and stored in Azure storage managed by Microsoft within the same resource region. Microsoft’s debugging engineers are authorized Microsoft employees who access the data via point wise queries using request IDs, Secure Access Workstations (SAWs), and Just-In-Time (JIT) request approval granted by team managers. These logs are automatically removed in 30 days after generated. |
| 68 | + |
| 69 | +To learn more about Microsoft's privacy and security commitments visit the [Microsoft Trust Center](https://www.microsoft.com/TrustCenter/CloudServices/Azure/default.aspx). |
0 commit comments