Skip to content

Commit bd1355b

Browse files
committed
Merge branch 'voicelive-customization-ga' of https://github.com/goergenj/azure-ai-docs-pr-jagoerge into voicelive-customization-ga
2 parents dc11cd4 + 8fd4a3e commit bd1355b

File tree

1 file changed

+12
-2
lines changed

1 file changed

+12
-2
lines changed

articles/ai-services/speech-service/voice-live-how-to-customize.md

Lines changed: 12 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ Voice live provides multiple options to optimize performance and quality by usin
2222
- Phrase-list: A lightweight just-in-time customization based on a list of words or phrases provided as part of the session configuration to help improve recognition quality. See [Improve recognition accuracy with phrase list](./improve-accuracy-phrase-list) to learn more.
2323
- Custom Speech: With custom speech, you can evaluate and improve the accuracy of speech recognition for your applications and products and fine-tune the recognition quality to your business needs. See [What is custom speech?](./custom-speech-overview) to learn more.
2424
- Speech output customization:
25-
- Custom Lexicon: Custom Lexicon allows you to easily customize pronunciation for both standard Azure text to speech voices and custom voices to improve speech output synthesization quality. See [custom lexicon for text to speech](./speech-synthesis-markup-pronunciation.md#custom-lexicon) to learn more.
25+
- Custom lexicon: Custom lexicon allows you to easily customize pronunciation for both standard Azure text to speech voices and custom voices to improve speech synthesis accuracy for your use case. See [custom lexicon for text to speech](./speech-synthesis-markup-pronunciation.md#custom-lexicon) to learn more.
2626
- Custom voice: Custom voice lets you create a one-of-a-kind, customized, synthetic voice for your applications. With custom voice, you can build a highly natural-sounding voice for your brand or characters by providing human speech samples as fine-tuning data. See [What is custom voice?](./custom-neural-voice) to learn more.
2727
- Custom avatar: Custom text to speech avatar allows you to create a customized, one-of-a-kind synthetic talking avatar for your application. With custom text to speech avatar, you can build a unique and natural-looking avatar for your product or brand by providing video recording data of your selected actors. See [What is custom text to speech avatar?](./text-to-speech-avatar/what-is-custom-text-to-speech-avatar) to learn more.
2828

@@ -69,7 +69,8 @@ Example session configuration with custom speech models. In this case, if the de
6969
```
7070

7171
> [!NOTE]
72-
> In order to use a custom speech model with voice live API the model must be available on the same Azure AI Foundry resource you are using to call the voice live API. If you trained the model on a different Azure AI Foundry or Azure AI Speech resource you have to copy the model to the resource you are using to call the voice live API.
72+
> In order to use a custom speech model with voice live API, the model must be available on the same Azure AI Foundry resource you are using to call the voice live API. If you trained the model on a different Azure AI Foundry or Azure AI Speech resource you have to copy the model to the resource you are using to call the voice live API.
73+
> You pay separately for custom speech training and model hosting.
7374
7475
## Speech output customization
7576

@@ -103,12 +104,21 @@ You can use a custom voice for audio output. For information about how to create
103104
}
104105
```
105106

107+
> [!NOTE]
108+
> In order to use a custom voice model with voice live API, the model must be available on the same Azure AI Foundry resource you are using to call the voice live API. If you trained the model on a different Azure AI Foundry or Azure AI Speech resource you have to copy the model to the resource you are using to call the voice live API.
109+
> You pay separately for custom voice training and model hosting.
110+
106111
### Azure custom avatar
107112

108113
[Text to speech avatar](./text-to-speech-avatar/what-is-text-to-speech-avatar.md) converts text into a digital video of a photorealistic human (either a standard avatar or a [custom text to speech avatar](./text-to-speech-avatar/what-is-custom-text-to-speech-avatar.md)) speaking with a natural-sounding voice.
109114

110115
The configuration for a custom avatar does not differ from the configuration of a standard avatar. Please refer to [How to use the voice live API - Azure text to speech avatar](./voice-live-how-to#azure-text-to-speech-avatar) for a detailed example.
111116

117+
> [!NOTE]
118+
> In order to use a custom voice model with voice live API, the model must be available on the same Azure AI Foundry resource you are using to call the voice live API. If you trained the model on a different Azure AI Foundry or Azure AI Speech resource you have to copy the model to the resource you are using to call the voice live API.
119+
> You pay separately for custom avatar training and model hosting.
120+
121+
112122
## Related content
113123

114124
- Try out the [voice live API quickstart](./voice-live-quickstart.md)

0 commit comments

Comments
 (0)