Skip to content

Commit 3eb4d3a

Browse files
Merge pull request #281932 from sally-baolian/patch-275
Update personal-voice-how-to-use.md
2 parents d434a7a + c14bc44 commit 3eb4d3a

File tree

1 file changed

+37
-1
lines changed

1 file changed

+37
-1
lines changed

articles/ai-services/speech-service/personal-voice-how-to-use.md

Lines changed: 37 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -46,9 +46,45 @@ Here's example SSML in a request for text to speech with the voice name and the
4646
You can use the SSML via the [Speech SDK](./get-started-text-to-speech.md) or [REST API](rest-text-to-speech.md).
4747

4848
* **Real-time speech synthesis**: Use the [Speech SDK](./get-started-text-to-speech.md) or [REST API](rest-text-to-speech.md) to convert text to speech.
49-
* When you use Speech SDK, don't set Endpoint Id, just like prebuild voice.
49+
* When you use Speech SDK, don't set Endpoint ID, just like prebuild voice.
5050
* When you use REST API, please use prebuilt neural voices endpoint.
5151

52+
## Supported and unsupported SSML elements for personal voice
53+
54+
For detailed information on the supported and unsupported SSML elements for Phoenix and Dragon models, refer to the following table. For instructions on how to use SSML elements, refer to the [SSML document structure and events](speech-synthesis-markup-structure.md).
55+
56+
| Element | Description | Supported in Phoenix | Supported in Dragon |
57+
|-------------------------|-----------------------------------------------------------------------------|----------------------|---------------------|
58+
| `<voice>` | Specifies the voice and optional effects (`eq_car` and `eq_telecomhp8k`). | Yes | Yes |
59+
| `<mstts:express-as>` | Specifies speaking styles and roles. | No | No |
60+
| `<mstts:ttsembedding>` | Specifies the `speakerProfileId` property for a personal voice. | Yes | No |
61+
| `<lang xml:lang>` | Specifies the speaking language. | Yes | Yes |
62+
| `<prosody>` | Adjusts pitch, contour, range, rate, and volume. | | |
63+
|&nbsp;&nbsp;&nbsp;`pitch` | Indicates the baseline pitch for the text. | No | No |
64+
| &nbsp;&nbsp;&nbsp;`contour`| Represents changes in pitch. | No | No |
65+
| &nbsp;&nbsp;&nbsp;`range` | Represents the range of pitch for the text. | No | No |
66+
| &nbsp;&nbsp;&nbsp;`rate` | Indicates the speaking rate of the text. | Yes | Yes |
67+
| &nbsp;&nbsp;&nbsp;`volume`| Indicates the volume level of the speaking voice. | No | No |
68+
| `<emphasis>` | Adds or removes word-level stress for the text. | No | No |
69+
| `<audio>` | Embeds prerecorded audio into an SSML document. | Yes | No |
70+
| `<mstts:audioduration>` | Specifies the duration of the output audio. | No | No |
71+
| `<mstts:backgroundaudio>`| Adds background audio to your SSML documents or mixes an audio file with text to speech. | Yes | No |
72+
| `<phoneme>` | Specifies phonetic pronunciation in SSML documents. | | |
73+
| &nbsp;&nbsp;&nbsp;`ipa` | One of the phonetic alphabets. | Yes | No |
74+
| &nbsp;&nbsp;&nbsp;`sapi` | One of the phonetic alphabets. | No | No |
75+
| &nbsp;&nbsp;&nbsp;`ups` | One of the phonetic alphabets. | Yes | No |
76+
| &nbsp;&nbsp;&nbsp;`x-sampa`| One of the phonetic alphabets. | Yes | No |
77+
| `<lexicon>` | Defines how multiple entities are read in SSML. | Yes | Yes (only support alias) |
78+
| `<say-as>` | Indicates the content type, such as number or date, of the element's text. | Yes | Yes |
79+
| `<sub>` | Indicates that the alias attribute's text value should be pronounced instead of the element's enclosed text. | Yes | Yes |
80+
| `<math>` | Uses the MathML as input text to properly pronounce mathematical notations in the output audio. | Yes | No |
81+
| `<bookmark>` | Gets the offset of each marker in the audio stream. | Yes | No |
82+
| `<break>` | Overrides the default behavior of breaks or pauses between words. | Yes | Yes |
83+
| `<mstts:silence>` | Inserts pauses before or after text, or between two adjacent sentences. | Yes | No |
84+
| `<mstts:viseme>` | Defines the position of the face and mouth while a person is speaking. | Yes | No |
85+
| `<p>` | Denotes paragraphs in SSML documents. | Yes | Yes |
86+
| `<s>` | Denotes sentences in SSML documents. | Yes | Yes |
87+
5288
## Reference documentation
5389

5490
> [!div class="nextstepaction"]

0 commit comments

Comments
 (0)