Skip to content

Commit eefdf85

Browse files
Merge pull request #5835 from yulin-li/yulin/phrase-list-doc
[voice live] update phrase list path
2 parents 242c200 + 6d85930 commit eefdf85

File tree

1 file changed

+30
-29
lines changed

1 file changed

+30
-29
lines changed

articles/ai-services/speech-service/voice-live-how-to.md

Lines changed: 30 additions & 29 deletions
Original file line numberDiff line numberDiff line change
@@ -174,20 +174,21 @@ Here's an example of end of utterance detection in a session object:
174174

175175
### Phrase list
176176

177-
Use phrase list for lightweight just-in-time customization on audio input. To configure phrase list, you can set the phrase_list in the `session.update` message.
177+
Use phrase list for lightweight just-in-time customization on audio input. To configure phrase list, you can set the phrase_list in the `session.update` message.
178178

179179
```json
180-
{
181-
"session": {
182-
"input_audio": {
183-
"phrase_list": ["Neo QLED TV", "TUF Gaming", "AutoQuote Explorer"]
184-
}
185-
}
186-
}
180+
{
181+
"session": {
182+
"input_audio_transcription": {
183+
"model": "azure-fast-transcription",
184+
"phrase_list": ["Neo QLED TV", "TUF Gaming", "AutoQuote Explorer"]
185+
}
186+
}
187+
}
187188
```
188189

189190
> [!NOTE]
190-
> Phrase list currently doesn't support gpt-4o-realtime-preview, gpt-4o-mini-realtime-preview, and phi4-mm-realtime. To learn more about phrase list, see [phrase list for speech to text](./improve-accuracy-phrase-list.md).
191+
> Phrase list currently doesn't support gpt-4o-realtime-preview, gpt-4o-mini-realtime-preview, and phi4-mm-realtime. To learn more about phrase list, see [phrase list for speech to text](./improve-accuracy-phrase-list.md).
191192
192193
## Audio output through Azure text to speech
193194

@@ -247,36 +248,36 @@ You can use a custom voice for audio output. For information about how to create
247248
}
248249
```
249250

250-
### Custom lexicon
251+
### Custom lexicon
251252

252253
Use the `custom_lexicon_url` string property to customize pronunciation for both standard Azure text to speech voices and custom voices. To learn more about how to format the custom lexicon (the same as Speech Synthesis Markup Language (SSML)), see [custom lexicon for text to speech](./speech-synthesis-markup-pronunciation.md#custom-lexicon).
253254

254255
```json
255-
{
256-
  "voice": {
257-
    "name": "en-US-Ava:DragonHDLatestNeural",
258-
    "type": "azure-standard",
259-
    "temperature": 0.8, // optional
260-
    "custom_lexicon_url": "<custom lexicon url>"
261-
}
262-
}
256+
{
257+
  "voice": {
258+
    "name": "en-US-Ava:DragonHDLatestNeural",
259+
    "type": "azure-standard",
260+
    "temperature": 0.8, // optional
261+
    "custom_lexicon_url": "<custom lexicon url>"
262+
}
263+
}
263264
```
264265

265-
### Speaking rate
266+
### Speaking rate
266267

267-
Use the `rate` string property to adjust the speaking speed for any standard Azure text to speech voices and custom voices.
268+
Use the `rate` string property to adjust the speaking speed for any standard Azure text to speech voices and custom voices.
268269

269-
The rate value should range from 0.5 to 1.5, with higher values indicating faster speeds.
270+
The rate value should range from 0.5 to 1.5, with higher values indicating faster speeds.
270271

271272
```json
272-
{
273-
  "voice": {
274-
    "name": "en-US-Ava:DragonHDLatestNeural",
275-
    "type": "azure-standard",
276-
    "temperature": 0.8, // optional
277-
   "rate": "1.2"
278-
}
279-
}
273+
{
274+
  "voice": {
275+
    "name": "en-US-Ava:DragonHDLatestNeural",
276+
    "type": "azure-standard",
277+
    "temperature": 0.8, // optional
278+
   "rate": "1.2"
279+
}
280+
}
280281
```
281282

282283
### Audio timestamps

0 commit comments

Comments
 (0)