Skip to content

Commit d8c6334

Browse files
Merge pull request #6267 from eric-urban/eur/ai-speech
patch toc and viseme
2 parents 594db81 + c57fd28 commit d8c6334

File tree

2 files changed

+9
-0
lines changed

2 files changed

+9
-0
lines changed

articles/ai-foundry/toc.yml

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -483,6 +483,8 @@ items:
483483
items:
484484
- name: What is the Speech service?
485485
href: ../ai-services/speech-service/overview.md?context=/azure/ai-foundry/context/context
486+
- name: Voice live
487+
href: ../ai-services/speech-service/voice-live-quickstart.md?context=/azure/ai-foundry/context/context
486488
- name: Real-time speech to text
487489
href: ../ai-services/speech-service/get-started-speech-to-text.md?context=/azure/ai-foundry/context/context
488490
- name: Fast transcription
@@ -499,6 +501,8 @@ items:
499501
href: ../ai-services/speech-service/pronunciation-assessment-tool.md?context=/azure/ai-foundry/context/context
500502
- name: Speech translation
501503
href: ../ai-services/speech-service/get-started-speech-translation.md?context=/azure/ai-foundry/context/context
504+
- name: Video translation
505+
href: ../ai-services/speech-service/video-translation-get-started.md?context=/azure/ai-foundry/context/context
502506
- name: Translator
503507
items:
504508
- name: What is Azure AI Translator?

articles/ai-services/speech-service/voice-live-how-to.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -324,8 +324,13 @@ And a `response.audio_timestamp.done` message is sent when all timestamps are re
324324

325325
### Viseme
326326

327+
A viseme is the visual description of a phoneme in spoken language. It defines the position of the face and mouth while a person is speaking.
328+
327329
You can use Azure standard voice or Azure custom voice with `animation.outputs` set to `{"viseme_id"}`. The service returns the `response.animation_viseme.delta` in the response and `response.animation_viseme.done` when all viseme messages are returned.
328330

331+
> [!TIP]
332+
> For more information about viseme via Speech Synthesis Markup Language (SSML), see [viseme element documentation](speech-synthesis-markup-voice.md#viseme-element).
333+
329334
To configure the viseme, you can set the `animation.outputs` in the `session.update` message. The `animation.outputs` parameter is optional. It configures which animation outputs should be returned. Currently, it only supports `viseme_id`.
330335

331336
```json

0 commit comments

Comments
 (0)