You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/search/search-limits-quotas-capacity.md
+25-45Lines changed: 25 additions & 45 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ author: HeidiSteen
8
8
ms.author: heidist
9
9
ms.service: cognitive-search
10
10
ms.topic: conceptual
11
-
ms.date: 06/13/2024
11
+
ms.date: 09/04/2024
12
12
ms.custom:
13
13
- references_regions
14
14
- build-2024
@@ -89,63 +89,43 @@ Use the [GET Service Statistics](/rest/api/searchservice/get-service-statistics)
89
89
90
90
Vector limits vary by service creation date and tier. To check the age of your search service and learn more about vector indexes, see [Vector index size and staying under limits](vector-search-index-size.md).
91
91
92
-
### Vector limits on services created after May 17, 2024
92
+
### Storage quota (GB)
93
93
94
-
The highest vector limits are available on search services created after May 17, 2024 in a [supported region](#service-limits).
94
+
This table shows the progression of storage quota increases in GB over time. Vector quota is per partition, so the increase in vector quota is bound to the increase in per-partition storage for each tier. Higher capacity partitions came online starting in April 2024.
<sup>3</sup> Higher capacity storage for Basic, S1, S2, S3 in the following regions. **Americas**: Brazil South, Canada Central, Canada East, East US, East US 2, Central US, North Central US, South Central US, West US, West US 2, West US 3, West Central US. **Europe**: France Central. Italy North, North Europe, Norway East, Poland Central, Switzerland North, Sweden Central, UK South, UK West. **Middle East**: UAE North. **Africa**: South Africa North. **Asia Pacific**: Australia East, Australia Southeast, Central India, Jio India West, East Asia, Southeast Asia, Japan East, Japan West, Korea Central, Korea South.
117
108
118
-
Notice that L1 and L2 limits are unchanged in the April 3 rollout.
109
+
<sup>4</sup> Higher capacity storage for more tiers and more regions. **Europe**: Germany North, Germany West Central, Switzerland West. **Azure Government**: Texas, Arizona, Virginia. **Africa**: South Africa North. **Asia Pacific**: China North 3, China East 3.
119
110
120
-
### Vector limits on services created between July 1, 2023 and April 3, 2024
111
+
### Vector quota per partition (GB)
121
112
122
-
The following limits applied to new services created between July 1 and April 3, 2024, except for the following regions, which have the original limits from before July 1, 2023:
113
+
This table shows the progression of vector quota increases in GB over time. The quota is per partition, so if you scale a new Standard (S1) service to 6 partitions, total vector quota is 35 multiplied by 6.
123
114
124
-
+ Germany West Central
125
-
+ West India
126
-
+ Qatar Central
115
+
| Service creation date |Basic | S1| S2 | S3 | L1 | L2 |
<sup>2</sup> Vector limits during the later preview period. Three regions didn't have the higher limits: Germany West Central, West India, Qatar Central.
138
125
139
-
### Vector limits on services created before July 1, 2023
126
+
<sup>3</sup> Higher vector quota based on the larger partitions for supported tiers and regions.
Copy file name to clipboardExpand all lines: articles/search/search-reliability.md
+5-4Lines changed: 5 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,7 +6,7 @@ author: mattgotteiner
6
6
ms.author: magottei
7
7
ms.service: cognitive-search
8
8
ms.topic: conceptual
9
-
ms.date: 01/02/2024
9
+
ms.date: 09/04/2024
10
10
ms.custom:
11
11
- subject-reliability
12
12
- references_regions
@@ -55,10 +55,9 @@ Availability zones are used when you add two or more replicas to your search ser
55
55
56
56
### Supported regions
57
57
58
-
Support for availability zones depends on infrastructure and storage. Currently, two zones that were announced in October 2023 have insufficient storage and don't provide an availability zone for Azure AI Search:
58
+
Support for availability zones depends on infrastructure and storage. Currently, the following zone has insufficient storage and doesn't provide an availability zone for Azure AI Search:
59
59
60
-
+ Israel Central
61
-
+ Italy North
60
+
+ Japan West
62
61
63
62
Otherwise, availability zones for Azure AI Search are supported in the following regions:
64
63
@@ -75,6 +74,8 @@ Otherwise, availability zones for Azure AI Search are supported in the following
Copy file name to clipboardExpand all lines: articles/search/vector-search-index-size.md
+2-12Lines changed: 2 additions & 12 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -30,12 +30,7 @@ For each vector field, Azure AI Search constructs an internal vector index using
30
30
31
31
+ Vector indexes are also subject to disk quota, in the sense that all indexes are subject disk quota. There's no separate disk quota for vector indexes.
32
32
33
-
+ Vector quotas are enforced on the search service as a whole, per partition, meaning that if you add partitions, vector quota goes up. Per-partition vector quotas are higher on newer services:
34
-
35
-
+[Vector quota for services created after May 17, 2024](search-limits-quotas-capacity.md#vector-limits-on-services-created-after-may-17-2024)
36
-
+[Vector quota for services between April 3, 2024 and May 17, 2024](search-limits-quotas-capacity.md#vector-limits-on-services-created-between-april-3-2024-and-may-17-2024)
37
-
+[Vector quota for services created between July 1, 2023 and April 3, 2024](search-limits-quotas-capacity.md#vector-limits-on-services-created-between-july-1-2023-and-april-3-2024)
38
-
+[Vector quota for services created before July 1, 2023](search-limits-quotas-capacity.md#vector-limits-on-services-created-before-july-1-2023)
33
+
+ Vector quotas are enforced on the search service as a whole, per partition, meaning that if you add partitions, vector quota goes up. Per-partition vector quotas are higher on newer services. For more information, see [Vector index size limits](search-limits-quotas-capacity.md#vector-index-size-limits).
39
34
40
35
## How to check partition size and quantity
41
36
@@ -63,12 +58,7 @@ Newer services created after April 3, 2024 offer five to ten times more vector s
63
58
64
59
:::image type="content" source="media/vector-search-index-size/deployment-details.png" alt-text="Screenshot of the deployment details showing creation date.":::
65
60
66
-
1. Now that you know the age of your search service, review the vector quota limits based on service creation:
67
-
68
-
+[After May 17, 2024](search-limits-quotas-capacity.md#vector-limits-on-services-created-after-may-17-2024)
69
-
+[Between April 3, 2024 and May 17, 2024](search-limits-quotas-capacity.md#vector-limits-on-services-created-between-april-3-2024-and-may-17-2024)
70
-
+[Between July 1, 2023 and April 3, 2024](search-limits-quotas-capacity.md#vector-limits-on-services-created-between-july-1-2023-and-april-3-2024)
71
-
+[Before July 1, 2023](search-limits-quotas-capacity.md#vector-limits-on-services-created-before-july-1-2023)
61
+
1. Now that you know the age of your search service, review the vector quota limits based on service creation: [Vector index size limits](search-limits-quotas-capacity.md#vector-index-size-limits).
|[Security update addressing information disclosure](https://msrc.microsoft.com/update-guide/vulnerability/CVE-2024-29063)| API | GET responses [no longer return connection strings or keys](search-api-migration.md#breaking-change-for-client-code-that-reads-connection-information). Applies to GET Skillset, GET Index, and GET Indexer. This change helps protect your Azure assets integrated with AI Search from unauthorized access. |
75
75
|[More storage on Basic and Standard tiers](search-limits-quotas-capacity.md#service-limits)| Infrastructure | Basic now supports up to three partitions and three replicas. Basic and Standard (S1, S2, S3) tiers have significantly more storage per partition, at the same per-partition billing rate. Extra capacity is subject to [regional availability](search-limits-quotas-capacity.md#service-limits) and applies to new search services created after April 3, 2024. Currently, there's no in-place upgrade, so you must create a new search service to get the extra storage. |
76
-
|[More quota for vectors](search-limits-quotas-capacity.md#vector-limits-on-services-created-between-april-3-2024-and-may-17-2024)| Infrastructure | Vector quotas are also higher on new services created after April 3, 2024 in selected regions. |
76
+
|[More quota for vectors](search-limits-quotas-capacity.md#vector-index-size-limits)| Infrastructure | Vector quotas are also higher on new services created after April 3, 2024 in selected regions. |
77
77
|[Vector quantization, narrow vector data types, and a new `stored` property (preview)](vector-search-how-to-configure-compression-storage.md)| Feature | Collectively, these three features add vector compression and smarter storage options. First, *scalar quantization* reduces vector index size in memory and on disk. Second, [narrow data types](/rest/api/searchservice/supported-data-types) reduce per-field storage by storing smaller values. Third, you can use `stored` to opt-out of storing the extra copy of a vector that's used only for search results. If you don't need vectors in a query response, you can set `stored` to false to save on space. |
78
78
|[2024-03-01-preview Search REST API](/rest/api/searchservice/search-service-api-versions#2024-03-01-preview)| API | New preview version of the Search REST APIs for the new data types, vector compression properties, and vector storage options. |
79
79
|[2024-03-01-preview Management REST API](/rest/api/searchmanagement/operation-groups?view=rest-searchmanagement-2024-03-01-preview&preserve-view=true)| API | New preview version of the Management REST APIs for control plane operations. |
0 commit comments