You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/search/search-limits-quotas-capacity.md
+23-27Lines changed: 23 additions & 27 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ author: HeidiSteen
8
8
ms.author: heidist
9
9
ms.service: azure-ai-search
10
10
ms.topic: conceptual
11
-
ms.date: 10/22/2024
11
+
ms.date: 10/24/2024
12
12
ms.custom:
13
13
- references_regions
14
14
- build-2024
@@ -18,21 +18,21 @@ ms.custom:
18
18
19
19
Maximum limits on storage, workloads, and quantities of indexes and other objects depend on whether you [create Azure AI Search](search-create-service-portal.md) at **Free**, **Basic**, **Standard**, or **Storage Optimized** pricing tiers.
20
20
21
-
+**Free** is a multitenant shared service that comes with your Azure subscription.
21
+
+**Free** is a multitenant shared service that comes with your Azure subscription.
22
22
23
23
+**Basic** provides dedicated computing resources for production workloads at a smaller scale, but shares some networking infrastructure with other tenants.
24
24
25
-
+**Standard** runs on dedicated machines with more storage and processing capacity at every level. Standard comes in four levels: S1, S2, S3, and S3 HD. S3 High Density (S3 HD) is engineered for [multi-tenancy](search-modeling-multitenant-saas-applications.md) and large quantities of small indexes (3,000 indexes per service). S3 HD doesn't provide the [indexer feature](search-indexer-overview.md) and data ingestion must use APIs that push data from source to index.
25
+
+**Standard** runs on dedicated machines with more storage and processing capacity at every level. Standard comes in four levels: S1, S2, S3, and S3 HD. S3 High Density (S3 HD) is engineered for [multi-tenancy](search-modeling-multitenant-saas-applications.md) and large quantities of small indexes (3,000 indexes per service). S3 HD doesn't provide the [indexer feature](search-indexer-overview.md) and data ingestion must use APIs that push data from source to index.
26
26
27
27
+**Storage Optimized** runs on dedicated machines with more total storage, storage bandwidth, and memory than **Standard**. This tier targets large, slow-changing indexes. Storage Optimized comes in two levels: L1 and L2.
<sup>1</sup> Basic services created before December 2017 have lower limits (5 instead of 15) on indexes. Basic tier is the only tier with a lower limit of 100 fields per index.
52
+
<sup>1</sup> Basic services created before December 2017 have lower limits (5 instead of 15) on indexes. Basic tier is the only tier with a lower limit of 100 fields per index.
53
53
54
54
<sup>2</sup> The upper limit on fields includes both first-level fields and nested subfields in a complex collection. For example, if an index contains 15 fields and has two complex collections with five subfields each, the field count of your index is 25. Indexes with a very large fields collection can be slow. [Limit fields and attributes](search-what-is-an-index.md#physical-structure-and-size) to just those you need, and run indexing and query test to ensure performance is acceptable.
55
55
@@ -59,9 +59,7 @@ Maximum limits on storage, workloads, and quantities of indexes and other object
59
59
60
60
You might find some variation in maximum limits if your service happens to be provisioned on a more powerful cluster. The limits here represent the common denominator. Indexes built to the above specifications are portable across equivalent service tiers in any region.
61
61
62
-
<aname="document-limits"></a>
63
-
64
-
## Document limits
62
+
## Document limits
65
63
66
64
Maximum number of documents per index are:
67
65
@@ -100,7 +98,7 @@ This table shows the progression of vector quota increases in GB over time. The
100
98
101
99
<sup>2</sup> Vector limits during the later preview period. Three regions didn't have the higher limits: Germany West Central, West India, Qatar Central.
102
100
103
-
<sup>3</sup> Higher vector quota based on the larger partitions for supported tiers and regions.
101
+
<sup>3</sup> Higher vector quota based on the larger partitions for supported tiers and regions.
104
102
105
103
<sup>4</sup> Higher vector quota for more tiers and regions based on partition size updates.
106
104
@@ -160,8 +158,6 @@ Indexers can access other Azure resources [over private endpoints](search-indexe
160
158
161
159
<sup>3</sup> The number of distinct resource types are computed as the number of unique `groupId` values used across all shared private link resources for a given search service, irrespective of the status of the resource.
162
160
163
-
164
-
165
161
## Synonym limits
166
162
167
163
Maximum number of synonym maps varies by tier. Each rule can have up to 20 expansions, where an expansion is an equivalent term. For example, given "cat", association with "kitty", "feline", and "felis" (the genus for cats) would count as 3 expansions.
@@ -188,15 +184,15 @@ An [AI enrichment pipeline](cognitive-search-concept-intro.md) that makes calls
188
184
189
185
## Throttling limits
190
186
191
-
API requests are throttled as the system approaches peak capacity. Throttling behaves differently for different APIs. Query APIs (Search/Suggest/Autocomplete) and indexing APIs throttle dynamically based on the load on the service. Index APIs and service operations API have static request rate limits.
187
+
API requests are throttled as the system approaches peak capacity. Throttling behaves differently for different APIs. Query APIs (Search/Suggest/Autocomplete) and indexing APIs throttle dynamically based on the load on the service. Index APIs and service operations API have static request rate limits.
192
188
193
189
Static rate request limits for operations related to an index:
194
190
195
191
+ List Indexes (GET /indexes): 3 per second per search unit
196
192
+ Get Index (GET /indexes/myindex): 10 per second per search unit
197
193
+ Create Index (POST /indexes): 12 per minute per search unit
198
194
+ Create or Update Index (PUT /indexes/myindex): 6 per second per search unit
199
-
+ Delete Index (DELETE /indexes/myindex): 12 per minute per search unit
195
+
+ Delete Index (DELETE /indexes/myindex): 12 per minute per search unit
200
196
201
197
Static rate request limits for operations related to a service:
202
198
@@ -208,28 +204,28 @@ L2 reranking using the semantic reranker has an expected volume:
208
204
209
205
## API request limits
210
206
211
-
* Maximum of 16 MB per request <sup>1</sup>
212
-
* Maximum 8-KB URL length
213
-
* Maximum 1,000 documents per batch of index uploads, merges, or deletes
214
-
* Maximum 32 fields in $orderby clause
215
-
* Maximum 100,000 characters in a search clause
216
-
* The maximum number of clauses in `search` (expressions separated by AND or OR) is 1024
217
-
* Maximum search term size is 32,766 bytes (32 KB minus 2 bytes) of UTF-8 encoded text
218
-
* Maximum search term size is 1,000 characters for [prefix search](query-simple-syntax.md#prefix-queries) and [regex search](query-lucene-syntax.md#bkmk_regex)
219
-
*[Wildcard search](query-lucene-syntax.md#bkmk_wildcard) and [Regular expression search](query-lucene-syntax.md#bkmk_regex) are limited to a maximum of 1,000 states when processed by [Lucene](https://lucene.apache.org/core/7_0_1/core/org/apache/lucene/util/automaton/RegExp.html).
207
+
+ Maximum of 16 MB per request <sup>1</sup>
208
+
+ Maximum 8-KB URL length
209
+
+ Maximum 1,000 documents per batch of index uploads, merges, or deletes
210
+
+ Maximum 32 fields in $orderby clause
211
+
+ Maximum 100,000 characters in a search clause
212
+
+ The maximum number of clauses in `search` (expressions separated by AND or OR) is 1024
213
+
+ Maximum search term size is 32,766 bytes (32 KB minus 2 bytes) of UTF-8 encoded text
214
+
+ Maximum search term size is 1,000 characters for [prefix search](query-simple-syntax.md#prefix-queries) and [regex search](query-lucene-syntax.md#bkmk_regex)
215
+
+[Wildcard search](query-lucene-syntax.md#bkmk_wildcard) and [Regular expression search](query-lucene-syntax.md#bkmk_regex) are limited to a maximum of 1,000 states when processed by [Lucene](https://lucene.apache.org/core/7_0_1/core/org/apache/lucene/util/automaton/RegExp.html).
220
216
221
217
<sup>1</sup> In Azure AI Search, the body of a request is subject to an upper limit of 16 MB, imposing a practical limit on the contents of individual fields or collections that aren't otherwise constrained by theoretical limits (see [Supported data types](/rest/api/searchservice/supported-data-types) for more information about field composition and restrictions).
222
218
223
219
Limits on query size and composition exist because unbounded queries can destabilize your search service. Typically, such queries are created programmatically. If your application generates search queries programmatically, we recommend designing it in such a way that it doesn't generate queries of unbounded size.
224
220
225
221
## API response limits
226
222
227
-
* Maximum 1,000 documents returned per page of search results
228
-
* Maximum 100 suggestions returned per Suggest API request
223
+
+ Maximum 1,000 documents returned per page of search results
224
+
+ Maximum 100 suggestions returned per Suggest API request
229
225
230
226
## API key limits
231
227
232
228
API keys are used for service authentication. There are two types. Admin keys are specified in the request header and grant full read-write access to the service. Query keys are read-only, specified on the URL, and typically distributed to client applications.
0 commit comments