Skip to content

Commit c675235

Browse files
Merge pull request #249766 from sally-baolian/patch-146
Update batch-synthesis.md
2 parents ce0c65b + 515e51b commit c675235

File tree

2 files changed

+19
-1
lines changed

2 files changed

+19
-1
lines changed

articles/ai-services/speech-service/batch-synthesis.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -365,6 +365,24 @@ Batch synthesis properties are described in the following table.
365365
|`synthesisConfig.voice`|The voice that speaks the audio output.<br/><br/>For information about the available prebuilt neural voices, see [language and voice support](language-support.md?tabs=tts). To use a custom voice, you must specify a valid custom voice and deployment ID mapping in the `customVoices` property.<br/><br/>This property is required when `textType` is set to `"PlainText"`.|
366366
|`synthesisConfig.volume`|The volume of the audio output.<br/><br/>For information about the accepted values, see the [adjust prosody](speech-synthesis-markup-voice.md#adjust-prosody) table in the Speech Synthesis Markup Language (SSML) documentation. Invalid values are ignored.<br/><br/>This optional property is only applicable when `textType` is set to `"PlainText"`.|
367367
|`textType`|Indicates whether the `inputs` text property should be plain text or SSML. The possible case-insensitive values are "PlainText" and "SSML". When the `textType` is set to `"PlainText"`, you must also set the `synthesisConfig` voice property.<br/><br/>This property is required.|
368+
369+
## Batch synthesis latency and best practices
370+
371+
When using batch synthesis for generating synthesized speech, it's important to consider the latency involved and follow best practices for achieving optimal results.
372+
373+
### Latency in batch synthesis
374+
375+
The latency in batch synthesis depends on various factors, including the complexity of the input text, the number of inputs in the batch, and the processing capabilities of the underlying hardware.
376+
377+
The latency for batch synthesis is as follows (approximately):
378+
379+
- The latency of 50% of the synthesized speech outputs is within 10-20 seconds.
380+
381+
- The latency of 95% of the synthesized speech outputs is within 120 seconds.
382+
383+
### Best practices
384+
385+
When considering batch synthesis for your application, it's recommended to assess whether the latency meets your requirements. If the latency aligns with your desired performance, batch synthesis can be a suitable choice. However, if the latency does not meet your needs, you might consider using real-time API.
368386

369387
## HTTP status codes
370388

articles/ai-services/speech-service/speech-services-quotas-and-limits.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -82,7 +82,7 @@ You can use real-time text to speech with the [Speech SDK](speech-sdk.md) or the
8282

8383
#### Batch synthesis
8484

85-
These limits aren't adjustable.
85+
These limits aren't adjustable. For more information on batch synthesis latency, refer to [the batch synthesis latency and best practices](batch-synthesis.md#batch-synthesis-latency-and-best-practices).
8686

8787
| Quota | Free (F0) | Standard (S0) |
8888
|--|--|--|

0 commit comments

Comments
 (0)