Skip to content

Commit 0998b27

Browse files
Update batch-synthesis.md
1 parent 449db5b commit 0998b27

File tree

1 file changed

+18
-0
lines changed

1 file changed

+18
-0
lines changed

articles/ai-services/speech-service/batch-synthesis.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -365,6 +365,24 @@ Batch synthesis properties are described in the following table.
365365
|`synthesisConfig.voice`|The voice that speaks the audio output.<br/><br/>For information about the available prebuilt neural voices, see [language and voice support](language-support.md?tabs=tts). To use a custom voice, you must specify a valid custom voice and deployment ID mapping in the `customVoices` property.<br/><br/>This property is required when `textType` is set to `"PlainText"`.|
366366
|`synthesisConfig.volume`|The volume of the audio output.<br/><br/>For information about the accepted values, see the [adjust prosody](speech-synthesis-markup-voice.md#adjust-prosody) table in the Speech Synthesis Markup Language (SSML) documentation. Invalid values are ignored.<br/><br/>This optional property is only applicable when `textType` is set to `"PlainText"`.|
367367
|`textType`|Indicates whether the `inputs` text property should be plain text or SSML. The possible case-insensitive values are "PlainText" and "SSML". When the `textType` is set to `"PlainText"`, you must also set the `synthesisConfig` voice property.<br/><br/>This property is required.|
368+
369+
## Batch synthesis latency and best practices
370+
371+
When using batch synthesis for generating synthesized speech, it's important to consider the latency involved and follow best practices for achieving optimal results.
372+
373+
### Latency in batch synthesis
374+
375+
The latency in batch synthesis depends on various factors, including the complexity of the input text, the number of inputs in the batch, and the processing capabilities of the underlying hardware.
376+
377+
The latency for batch synthesis is as follows (approximately):
378+
379+
- The latency of 50% of the synthesized speech outputs is within 10-20 seconds.
380+
381+
- The latency of 95% of the synthesized speech outputs is within 120 seconds.
382+
383+
### Best practices
384+
385+
When considering batch synthesis for your application, it's recommended to assess whether the latency meets your requirements. If the latency aligns with your desired performance, batch synthesis can be a suitable choice. However, if the latency does not meet your needs, you might consider using real-time API.
368386

369387
## HTTP status codes
370388

0 commit comments

Comments
 (0)