Skip to content

Commit adacc72

Browse files
authored
Merge pull request #5315 from eric-urban/eur/custom-voice-terms
Custom voice terminology updates
2 parents e3b675c + cd42986 commit adacc72

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

45 files changed

+362
-376
lines changed

articles/ai-services/speech-service/bring-your-own-storage-speech-resource.md

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -26,13 +26,12 @@ BYOS can be used with several Azure AI services. For Speech, it can be used in t
2626

2727
- [Batch transcription](batch-transcription.md)
2828
- Real-time transcription with [audio and transcription result logging](logging-audio-transcription.md) enabled
29-
- [Custom speech](custom-speech-overview.md) (Custom models for Speech recognition)
29+
- [Custom speech](custom-speech-overview.md) - Fine-tuning of speech to text models with custom data.
3030

3131
**Text to speech**
3232

3333
- [Audio Content Creation](how-to-audio-content-creation.md)
34-
- [Custom neural voice](custom-neural-voice.md) (Custom models for Speech synthesizing)
35-
34+
- [Custom voice](custom-neural-voice.md) - Fine-tuning of text to speech models with custom data.
3635

3736
One Speech resource – Storage account combination can be used for all four scenarios simultaneously in all combinations.
3837

@@ -436,7 +435,7 @@ For more information, see [Prevent anonymous public read access to containers an
436435

437436
**Configure Azure Storage firewall**
438437

439-
Custom neural voice uses [User delegation SAS](/azure/storage/common/storage-sas-overview#user-delegation-sas) to read the data for custom neural voice model training. It requires allowing external network traffic access to the Storage account.
438+
Custom voice uses [User delegation SAS](/azure/storage/common/storage-sas-overview#user-delegation-sas) to read the data for professional voice fine-tuning. It requires allowing external network traffic access to the Storage account.
440439

441440
1. Go to the [Azure portal](https://portal.azure.com/) and sign in to your Azure account.
442441
1. Select the Storage account.

articles/ai-services/speech-service/call-center-overview.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ You might want to further customize and fine-tune the experience for your produc
4747
| Speech customization | Description |
4848
| -------------- | ----------- |
4949
| [Custom speech](./custom-speech-overview.md) | A speech to text feature used to evaluate and improve the speech recognition accuracy of use-case specific entities (such as alpha-numeric customer, case, and contract IDs, license plates, and names). You can also train a custom model with your own product names and industry terminology. |
50-
| [Custom neural voice](./custom-neural-voice.md) | A text to speech feature that lets you create a one-of-a-kind, customized, synthetic voice for your applications. |
50+
| [Custom voice](./custom-neural-voice.md) | A text to speech feature that lets you create a one-of-a-kind, customized, synthetic voice for your applications. |
5151

5252
### Language service
5353

Lines changed: 26 additions & 26 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
2-
title: Custom neural voice lite - Speech service
2+
title: Custom voice lite - Speech service
33
titleSuffix: Azure AI services
4-
description: Use custom neural voice lite to demo and evaluate custom neural voice before investing in professional recordings to create a higher-quality voice.
4+
description: Use custom voice lite to demo and evaluate custom voice before investing in professional recordings to create a higher-quality voice.
55
author: eric-urban
66
manager: nitinme
77
ms.service: azure-ai-speech
@@ -10,57 +10,57 @@ ms.date: 5/12/2025
1010
ms.author: eur
1111
---
1212

13-
# Custom neural voice lite
13+
# Custom voice lite
1414

15-
Azure AI Speech provides two custom neural voice (CNV) project types: CNV lite and CNV professional.
15+
Custom voice lite is a project type where can demo and evaluate custom voice before investing in professional recordings to create a higher-quality voice. No application is required for demo and evaluation purposes. However, Microsoft restricts and selects the recording and testing samples for use with custom voice lite. You must apply for full access to professional voice fine-tuning in order to deploy and use the custom voice lite model for business purpose. In that case, request access on the [intake form](https://aka.ms/customneural).
1616

17-
- Custom neural voice (CNV) professional allows you to upload your training data collected through professional recording studios and create a higher-quality voice that is nearly indistinguishable from its human samples. CNV professional access is limited based on eligibility and usage criteria. Request access on the [intake form](https://aka.ms/customneural).
18-
- Custom neural voice (CNV) lite is a project type where can demo and evaluate custom neural voice before investing in professional recordings to create a higher-quality voice. No application is required for demo and evaluation purposes. However, Microsoft restricts and selects the recording and testing samples for use with CNV lite. You must apply for full access to CNV professional in order to deploy and use the CNV lite model for business purpose. In that case, request access on the [intake form](https://aka.ms/customneural).
17+
> [!NOTE]
18+
> Custom voice lite is only available in the [Speech Studio](https://aka.ms/speechstudio/customvoice). It isn't available through the Azure AI Foundry portal, REST API, or SDKs.
1919
20-
With a CNV lite project, you record your voice online by reading 20-50 pre-defined scripts provided by Microsoft. After you've recorded at least 20 samples, you can start to train a model. Once the model is trained successfully, you can review the model and check out 20 output samples produced with another set of pre-defined scripts.
20+
With a custom voice lite project, you record your voice online by reading 20-50 pre-defined scripts provided by Microsoft. After you've recorded at least 20 samples, you can start to train a model. Once the model is trained successfully, you can review the model and check out 20 output samples produced with another set of pre-defined scripts.
2121

22-
See the [supported languages](language-support.md?tabs=tts) for custom neural voice.
22+
See the [supported languages](language-support.md?tabs=tts) for custom voice.
2323

2424
## Compare project types
2525

26-
The following table summarizes key differences between the CNV lite and CNV professional project types.
26+
The following table summarizes key differences between custom voice lite and professional voice fine-tuning.
2727

28-
|**Items**|**Lite**| **Pro**|
28+
|**Items**|**Lite**| **Professional**|
2929
|---------------|---------------|---------------|
3030
|Target scenarios |Demonstration or evaluation |Professional scenarios like brand and character voices for chat bots, or audio content reading.|
3131
|Training data |Record online using Speech Studio |Bring your own data. Recording in a professional studio is recommended. |
3232
|Scripts for recording |Provided in Speech Studio |Use your own scripts that match the use case scenario. Microsoft provides [example scripts](https://github.com/Azure-Samples/Cognitive-Speech-TTS/tree/master/CustomVoice/script) for reference. |
3333
|Required data size |20-50 utterances |300-2000 utterances|
3434
|Training time |Less than one compute hour| Approximately 20-40 compute hours |
3535
|Voice quality |Moderate quality|High quality |
36-
|Availability |Anyone can record samples online and train a model for demo and evaluation purpose. Full access to custom neural voice is required if you want to deploy the CNV lite model for business use. |Data upload isn't restricted, but you can only train and deploy a CNV professional model after access is approved. CNV professional access is limited based on eligibility and usage criteria. Request access on the [intake form](https://aka.ms/customneural).|
37-
|Pricing |Per unit prices apply equally for both the CNV lite and CNV professional projects. Check the [pricing details here](https://azure.microsoft.com/pricing/details/cognitive-services/speech-services/). |Per unit prices apply equally for both the CNV lite and CNV professional projects. Check the [pricing details here](https://azure.microsoft.com/pricing/details/cognitive-services/speech-services/). |
36+
|Availability |Anyone can record samples online and train a model for demo and evaluation purpose. Full access to custom voice is required if you want to deploy the custom voice lite model for business use. |Data upload isn't restricted, but you can only fine-tune a professional voice after access is approved. Professional voice fine-tuning is limited based on eligibility and usage criteria. Request access on the [intake form](https://aka.ms/customneural).|
37+
|Pricing |Per unit prices apply equally for both custom voice lite and professional voice fine-tuning. Check the [pricing details here](https://azure.microsoft.com/pricing/details/cognitive-services/speech-services/). |Per unit prices apply equally for both custom voice lite and professional voice fine-tuning. Check the [pricing details here](https://azure.microsoft.com/pricing/details/cognitive-services/speech-services/). |
3838

39-
## Create a custom neural voice lite project
39+
## Create a custom voice lite project
4040

41-
To create a custom neural voice lite project, follow these steps:
41+
To create a custom voice lite project, follow these steps:
4242

4343
1. Sign in to the [Speech Studio](https://aka.ms/speechstudio/customvoice).
4444
1. Select the subscription and Speech resource to work with.
4545
1. Select **Custom voice** > **Create a project**.
46-
1. Select **Custom neural voice lite** > **Next**. To create a custom neural voice professional project instead, see [Create a project for custom neural voice](professional-voice-create-project.md).
46+
1. Select **Custom neural voice lite** > **Next**. To create a custom voice professional project instead, see the [professional voice fine-tuning documentation](professional-voice-create-project.md).
4747

4848
1. Follow the instructions provided by the wizard to create your project.
4949

5050
> [!IMPORTANT]
51-
> The CNV lite project expires after 90 days unless the [verbal statement](#submit-verbal-statement) recorded by the voice talent is submitted.
51+
> The custom voice lite project expires after 90 days unless the [verbal statement](#submit-verbal-statement) recorded by the voice talent is submitted.
5252
5353
1. Select the new project by name or select **Go to project**. You see these menu items in the left panel: **Record and build**, **Review model**, and **Deploy model**.
5454

55-
:::image type="content" source="media/custom-voice/lite/lite-project-get-started.png" alt-text="Screenshot with an overview of the CNV lite record, train, test, and deploy workflow.":::
55+
:::image type="content" source="media/custom-voice/lite/lite-project-get-started.png" alt-text="Screenshot with an overview of the custom voice lite record, train, test, and deploy workflow.":::
5656

5757

58-
## Record and build a CNV lite model
58+
## Record and build a custom voice lite model
5959

6060
Record at least 20 voice samples (up to 50) with provided scripts online. Voice samples recorded here are used to create a synthetic version of your voice.
6161

6262
> [!NOTE]
63-
> Custom neural voice training is currently only available in some regions. See footnotes in the [regions](regions.md#regions) table for more information.
63+
> Custom voice training is currently only available in some regions. See footnotes in the [regions](regions.md#regions) table for more information.
6464
6565
Here are some tips to help you record your voice samples:
6666
- Use a good microphone. Increase the clarity of your samples by using a high-quality microphone. Speak about 8 inches away from the microphone to avoid mouth noises.
@@ -69,7 +69,7 @@ Here are some tips to help you record your voice samples:
6969
- Record in one take. To keep a consistent energy level, record all sentences in one session.
7070
- Pronounce each word correctly, and speak clearly.
7171

72-
To record and build a CNV lite model, follow these steps:
72+
To record and build a custom voice lite model, follow these steps in [Speech Studio](https://aka.ms/speechstudio/customvoice):
7373

7474
1. Select **Custom voice** > Your project name > **Record and build**.
7575
1. Select **Get started**.
@@ -90,7 +90,7 @@ The training process takes approximately one compute hour. You can check the pro
9090

9191
## Review model
9292

93-
To review the CNV lite model and listen to your own synthetic voice, follow these steps:
93+
To review the custom voice lite model and listen to your own synthetic voice, follow these steps:
9494

9595
1. Select **Custom voice** > Your project name > **Review model**. Here you can review the voice model name, model language, sample data size, and training progress. The voice name is composed of the word "Neural" appended to your project name.
9696
1. Select the voice model name to review the model details and listen to the sample text to speech results.
@@ -101,7 +101,7 @@ To review the CNV lite model and listen to your own synthetic voice, follow thes
101101

102102
A verbal statement recorded by the voice talent is required before you can [deploy the model](#deploy-model) for your business use.
103103

104-
To submit the voice talent verbal statement, follow these steps:
104+
To submit the voice talent verbal statement, follow these steps in [Speech Studio](https://aka.ms/speechstudio/customvoice):
105105

106106
1. Select **Custom voice** > Your project name > **Deploy model** > **Manage your voice talent**.
107107
:::image type="content" source="media/custom-voice/lite/lite-voice-talent-consent.png" alt-text="Screenshot of the record voice talent consent dashboard.":::
@@ -113,20 +113,20 @@ To submit the voice talent verbal statement, follow these steps:
113113

114114
## Deploy model
115115

116-
To deploy your voice model and use it in your applications, you must get the full access to custom neural voice. Request access on the [intake form](https://aka.ms/customneural). Within approximately 10 business days, you receive an email with the approval status. A [verbal statement](#submit-verbal-statement) recorded by the voice talent is also required before you can deploy the model for your business use.
116+
To deploy your custom voice lite model and use it in your applications, you must get the full access to custom voice. Request access on the [intake form](https://aka.ms/customneural). Within approximately 10 business days, you receive an email with the approval status. A [verbal statement](#submit-verbal-statement) recorded by the voice talent is also required before you can deploy the model for your business use.
117117

118-
To deploy a CNV lite model, follow these steps:
118+
To deploy a custom voice lite model, follow these steps in [Speech Studio](https://aka.ms/speechstudio/customvoice):
119119

120120
1. Select **Custom voice** > Your project name > **Deploy model** > **Deploy model**.
121121
1. Select a voice model name and then select **Next**.
122122
1. Enter a name and description for your endpoint and then select **Next**.
123123
1. Select the checkbox to agree to the terms of use and then select **Next**.
124124
1. Select **Deploy** to deploy the model.
125125

126-
From here, you can use the CNV lite voice model similarly as you would use a CNV professional voice model. For example, you can [suspend or resume](professional-voice-deploy-endpoint.md) an endpoint after it's created, to limit spend and conserve resources that aren't in use. You can also access the voice in the [Audio Content Creation](how-to-audio-content-creation.md) tool in the [Speech Studio](https://aka.ms/speechstudio/audiocontentcreation).
126+
From here, you can use the custom voice lite model similarly as you would use a professional voice model. For example, you can [suspend or resume](professional-voice-deploy-endpoint.md) an endpoint after it's created, to limit spend and conserve resources that aren't in use. You can also access the voice in the [Audio Content Creation](how-to-audio-content-creation.md) tool in the [Speech Studio](https://aka.ms/speechstudio/audiocontentcreation).
127127

128128
## Next steps
129129

130-
* [Create a CNV professional project](professional-voice-create-project.md)
130+
* [Fine-tune a professional voice](professional-voice-create-project.md)
131131
* [Try the text to speech quickstart](get-started-text-to-speech.md)
132132
* [Learn more about speech synthesis](how-to-speech-synthesis.md)

0 commit comments

Comments
 (0)