Skip to content

Commit 2ae671a

Browse files
committed
add note
1 parent e70adf2 commit 2ae671a

File tree

1 file changed

+6
-1
lines changed

1 file changed

+6
-1
lines changed

articles/ai-services/speech-service/speech-services-quotas-and-limits.md

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -169,7 +169,12 @@ The following quotas are adjustable for Standard (S0) resources. The Free (F0) r
169169
- Text to speech [maximum number of transactions per time period](#text-to-speech-quotas-and-limits-per-resource) for standard voices and custom voices
170170
- Speech translation [concurrent request limit](#real-time-speech-to-text-and-speech-translation)
171171

172-
Before requesting a quota increase (where applicable), check your current TPS (transactions per second) and ensure that it's necessary to increase the quota. Speech service uses autoscaling technologies to bring the required computational resources in on-demand mode. At the same time, Speech service tries to keep your costs low by not maintaining an excessive amount of hardware capacity.
172+
Before requesting a quota increase (where applicable), check your current TPS (transactions per second) and ensure that you need to increase the quota.
173+
174+
> [!NOTE]
175+
> Batch transcription is an asynchronous process, and jobs are processed one-by-one in a queue. So, increasing the quota won't improve transcription performance. For performance improvements, see [Batch transcription best practices](./batch-transcription.md#best-practices-for-improving-performance).
176+
177+
Speech service uses autoscaling technologies to bring the required computational resources in on-demand mode. At the same time, Speech service tries to keep your costs low by not maintaining an excessive amount of hardware capacity.
173178

174179
Let's look at an example. Suppose that your application receives response code 429, which indicates that there are too many requests. Your application receives this response even though your workload is within the limits defined by the [Quotas and limits reference](#quotas-and-limits-reference). The most likely explanation is that Speech service is scaling up to your demand and didn't reach the required scale yet. Therefore the service doesn't immediately have enough resources to serve the request. In such cases, increasing the quota won’t help. In most cases, the Speech service will scale up soon, and the issue causing response code 429 will be resolved.
175180

0 commit comments

Comments
 (0)