You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cognitive-services/Speech-Service/speech-services-quotas-and-limits.md
+5-3Lines changed: 5 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -22,7 +22,7 @@ For the free (F0) pricing tier, see also the monthly allowances at the [pricing
22
22
23
23
The following sections provide you with a quick guide to the quotas and limits that apply to Speech service.
24
24
25
-
For information about adjustable quotas for Standard (S0) Speech resources, see [additional explanations](#detailed-description-quota-adjustment-and-best-practices), [best practices](#general-best-practices-to-mitigate-throttling-during-autoscaling), and [adjustment instructions](#speech-to-text-increase-online-transcription-concurrent-request-limit). Request limits for Free (F0) Speech resources aren't adjustable.
25
+
For information about adjustable quotas for Standard (S0) Speech resources, see [additional explanations](#detailed-description-quota-adjustment-and-best-practices), [best practices](#general-best-practices-to-mitigate-throttling-during-autoscaling), and [adjustment instructions](#speech-to-text-increase-online-transcription-concurrent-request-limit). The quotas and limits for Free (F0) Speech resources aren't adjustable.
26
26
27
27
### Speech-to-text quotas and limits per resource
28
28
@@ -53,6 +53,8 @@ You can use online transcription with the [Speech SDK](speech-sdk.md) or the [sp
53
53
54
54
#### Model customization
55
55
56
+
The limits in this table apply per Speech resource when you create a Custom Speech model.
57
+
56
58
| Quota | Free (F0) | Standard (S0) |
57
59
|--|--|--|
58
60
| REST API limit | 300 requests per minute | 300 requests per minute |
@@ -62,7 +64,7 @@ You can use online transcription with the [Speech SDK](speech-sdk.md) or the [sp
62
64
| Max pronunciation dataset file size for data import | 1 KB | 1 MB |
63
65
| Max text size when you're using the `text` parameter in the [Models_Create](https://westcentralus.dev.cognitive.microsoft.com/docs/services/speech-to-text-api-v3-1/operations/Models_Create/) API request | 200 KB | 500 KB |
64
66
65
-
### Text-to-speech quotas and limits per Speech resource
67
+
### Text-to-speech quotas and limits per resource
66
68
67
69
This section describes text-to-speech quotas and limits per Speech resource.
68
70
@@ -117,7 +119,7 @@ Some of the Speech service quotas are adjustable. This section provides addition
117
119
The following quotas are adjustable for Standard (S0) resources. The Free (F0) request limits aren't adjustable.
118
120
119
121
- Speech-to-text [concurrent request limit](#online-transcription) for base model endpoint and custom endpoint
120
-
- Text-to-speech [maximum number of transactions per time period](#text-to-speech-quotas-and-limits-per-speech-resource) for prebuilt neural voices and custom neural voices
122
+
- Text-to-speech [maximum number of transactions per time period](#text-to-speech-quotas-and-limits-per-resource) for prebuilt neural voices and custom neural voices
Before requesting a quota increase (where applicable), ensure that it's necessary. Speech service uses autoscaling technologies to bring the required computational resources in on-demand mode. At the same time, Speech service tries to keep your costs low by not maintaining an excessive amount of hardware capacity.
0 commit comments