You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/speech-service/speech-synthesis-markup-voice.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,7 @@ author: eric-urban
6
6
manager: nitinme
7
7
ms.service: azure-ai-speech
8
8
ms.topic: how-to
9
-
ms.date: 3/10/2025
9
+
ms.date: 7/9/2025
10
10
ms.author: eur
11
11
ms.custom: references_regions
12
12
#Customer intent: As a developer, I want to learn how to use Speech Synthesis Markup Language (SSML) elements to customize what my Speech service voice sounds like.
@@ -559,9 +559,9 @@ This SSML snippet illustrates how to request blend shapes with your synthesized
559
559
560
560
## Voice conversion element
561
561
562
-
Voice conversion is the process of transforming the voice characteristics of a given audio to a target voice speaker. After voice conversion, the resulting audio reserves source audio's linguistic content and prosody while the voice timbre sounds like the target speaker. For more information, see [voice conversion](./voice-conversion.md).
562
+
Voice conversion (preview) is the process of transforming the voice characteristics of a given audio to a target voice speaker. After voice conversion, the resulting audio reserves source audio's linguistic content and prosody while the voice timbre sounds like the target speaker. For more information, see [voice conversion](./voice-conversion.md).
563
563
564
-
Use the `<mstts:voiceconversion>` tag via Speech Synthesis Markup Language (SSML) to specify the source audio URL and the target voice for the conversion. For a complete list of supported target voices, see [supported voices for voice conversion](./language-support.md#voice-conversion).
564
+
Use the `<mstts:voiceconversion>` tag via Speech Synthesis Markup Language (SSML) to specify the source audio URL and the target voice for the conversion. For a complete list of supported target voices, see [supported voices for voice conversion](./language-support.md?tabs=tts#voice-conversion).
565
565
566
566
The following table describes the usage of the `mstts:voiceconversion` element's attributes:
567
567
@@ -573,7 +573,7 @@ Here's how the voice conversion works:
573
573
- The source audio is a prerecorded audio file that contains the spoken words and prosody.
574
574
- Text content: The final synthesized speech follows the spoken words in the source audio.
575
575
- Prosody and rhythm: The speech maintains the timing and intonation from the source.
576
-
- The `<voice>` tag specifies the target voice used for the output audio. For information about the supported target voices, see [supported voices for voice conversion](./language-support.md#voice-conversion).
576
+
- The `<voice>` tag specifies the target voice used for the output audio. For information about the supported target voices, see [supported voices for voice conversion](./language-support.md?tabs=tts#voice-conversion).
577
577
- The output audio keeps the timbre (tone and voice quality) of the target voice, but follows the text and speaking style of the source audio.
Voice conversion is the process of transforming the voice characteristics of a given audio to a target voice speaker. After voice conversion, the resulting audio reserves source audio's linguistic content and prosody while the voice timbre sounds like the target speaker.
18
20
@@ -31,13 +33,13 @@ Voice conversion (or voice changer or speech to speech conversion) is built on s
31
33
|**High speaker similarity**| Captures the timbre and vocal identity of the target speaker.<br>Generates audio that accurately matches the target voice. |
32
34
|**Prosody preservation**| Maintains rhythm, stress, and intonation of source audio.<br>Preserves expressive and emotional qualities. |
|**Multilingual support**| Enables multilingual voice conversion.<br>Supports 91 locales (same as standard text to speech locale support).<br>See [supported voices for voice conversion](./language-support.md#voice-conversion) for the complete list. |
36
+
|**Multilingual support**| Enables multilingual voice conversion.<br>Supports 91 locales (same as standard text to speech locale support).<br>See [supported voices for voice conversion](./language-support.md?tabs=tts#voice-conversion) for the complete list. |
35
37
36
38
## Use voice conversion
37
39
38
40
You can use Azure AI Speech voice conversion with either the Speech SDK or text to speech REST APIs.
39
41
40
-
Use the `<mstts:voiceconversion>` tag via Speech Synthesis Markup Language (SSML) to specify the source audio URL and the target voice for the conversion. For a complete list of supported target voices, see [supported voices for voice conversion](./language-support.md#voice-conversion).
42
+
Use the `<mstts:voiceconversion>` tag via Speech Synthesis Markup Language (SSML) to specify the source audio URL and the target voice for the conversion. For a complete list of supported target voices, see [supported voices for voice conversion](./language-support.md?tabs=tts#voice-conversion).
0 commit comments