Skip to content

Commit 9d51a99

Browse files
authored
Update embedded-speech.md
1 parent 5983486 commit 9d51a99

File tree

1 file changed

+44
-0
lines changed

1 file changed

+44
-0
lines changed

articles/ai-services/speech-service/embedded-speech.md

Lines changed: 44 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -281,6 +281,50 @@ With hybrid speech configuration for [text to speech](text-to-speech.md) (voices
281281

282282
For cloud speech, you use the `SpeechConfig` object, as shown in the [speech to text quickstart](get-started-speech-to-text.md) and [text to speech quickstart](get-started-text-to-speech.md). To run the quickstarts for embedded speech, you can replace `SpeechConfig` with `EmbeddedSpeechConfig` or `HybridSpeechConfig`. Most of the other speech recognition and synthesis code are the same, whether using cloud, embedded, or hybrid configuration.
283283

284+
## Embedded voices capabilities
285+
286+
For embedded voices, it is essential to note that certain SSML tags may not be currently supported due to differences in the model structure. For detailed information regarding the unsupported SSML tags, please refer to the table below.
287+
288+
| Level 1 | Level 2 | Sub values | Support in embedded NTTS |
289+
|-----------------|-----------|-------------------------------------------------------|--------------------------|
290+
| audio | src | | No |
291+
| bookmark | | | Yes |
292+
| break | strength | | No |
293+
| | time | | No |
294+
| silence | type | Leading, Tailing, Comma-exact, etc. | No |
295+
| | value | | No |
296+
| emphasis | level | | No |
297+
| lang | | | No |
298+
| lexicon | uri | | Yes |
299+
| math | | | No |
300+
| msttsaudioduration | value | | No |
301+
| msttsbackgroundaudio | src | | No |
302+
| | volume | | No |
303+
| | fadein | | No |
304+
| | fadeout | | No |
305+
| msttsexpress-as | style | | No |
306+
| | styledegree | | No |
307+
| | role | | No |
308+
| msttssilence | | | No |
309+
| msttsviseme | type | redlips_front, FacialExpression | No |
310+
| p | | | Yes |
311+
| phoneme | alphabet | ipa, sapi, ups, etc. | Yes |
312+
| | ph | | Yes |
313+
| prosody | contour | Sentences level support, word level only en-US and zh-CN | Yes |
314+
| | pitch | | Yes |
315+
| | range | | Yes |
316+
| | rate | | Yes |
317+
| | volume | | Yes |
318+
| s | | | Yes |
319+
| say-as | interpret-as | characters, spell-out, number_digit, date, etc. | Yes |
320+
| | format | | Yes |
321+
| | detail | | Yes |
322+
| sub | alias | | Yes |
323+
| speak | | | Yes |
324+
| voice | | | No |
325+
326+
327+
284328

285329
## Next steps
286330

0 commit comments

Comments
 (0)