Skip to content

Commit 5574218

Browse files
author
Jill Grant
authored
Merge pull request #374 from HeidiSteen/heidist-rag
[azure search] simplified and corrected vector limits
2 parents 35cb398 + 73cf6e0 commit 5574218

File tree

4 files changed

+9
-33
lines changed

4 files changed

+9
-33
lines changed
109 KB
Loading
38.8 KB
Loading

articles/search/search-limits-quotas-capacity.md

Lines changed: 8 additions & 32 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ author: HeidiSteen
88
ms.author: heidist
99
ms.service: cognitive-search
1010
ms.topic: conceptual
11-
ms.date: 09/04/2024
11+
ms.date: 09/19/2024
1212
ms.custom:
1313
- references_regions
1414
- build-2024
@@ -80,53 +80,29 @@ When estimating document size, remember to consider only those fields that add v
8080

8181
When you index documents with vector fields, Azure AI Search constructs internal vector indexes using the algorithm parameters you provide. The size of these vector indexes is restricted by the memory reserved for vector search for your service's tier (or `SKU`).
8282

83-
The service enforces a vector index size quota **for every partition** in your search service. Each extra partition increases the available vector index size quota. This quota is a hard limit to ensure your service remains healthy, which means that further indexing attempts once the limit is exceeded results in failure. You can resume indexing once you free up available quota by either deleting some vector documents or by scaling up in partitions.
84-
85-
Vector limits vary by service creation date and tier.
86-
87-
+ To check the age of your search service or learn more about vector indexes, see [Vector index size and staying under limits](vector-search-index-size.md).
88-
89-
+ To view the vector quota in effect for your search service, use [GET Service Statistics](/rest/api/searchservice/get-service-statistics/get-service-statistics), or check the **Properties** and **Usage** tabs for your search service in the Azure portal.
90-
91-
#### Vector quota per partition (GB)
83+
Vector limits vary by [service creation date](vector-search-index-size.md#how-to-check-service-creation-date) and [tier](search-sku-tier.md). For guidance on managing and maximizing vector storage, see [Vector index size and staying under limits](vector-search-index-size.md).
9284

9385
This table shows the progression of vector quota increases in GB over time. The quota is per partition, so if you scale a new Standard (S1) service to 6 partitions, total vector quota is 35 multiplied by 6.
9486

9587
| Service creation date |Basic | S1| S2 | S3/HD | L1 | L2 |
9688
|-----------------------|------|---|----|----|----|----|
9789
|**Before July 1, 2023** <sup>1</sup> | 0.5 | 1 | 6 | 12 | 12 | 36 |
9890
| **July 1, 2023 through April 3, 2024** <sup>2</sup>| 1 | 3 | 12 | 36 | 12 | 36 |
99-
|**April 3, 2024 through May 17, 2024** <sup>3</sup> | 5 | 35 | 100 | 200 | 12 | 36 |
100-
|**After May 17, 2024** <sup>4</sup> | 5 | 35 | 150 | 300 | 150 | 300 |
91+
|**April 3, 2024 through May 17, 2024** <sup>3</sup> | **5** | **35** | **150** | **300** | 12 | 36 |
92+
|**After May 17, 2024** <sup>4</sup> | 5 | 35 | 150 | 300 | **150** | **300** |
10193

10294
<sup>1</sup> Initial vector limits during early preview.
10395

10496
<sup>2</sup> Vector limits during the later preview period. Three regions didn't have the higher limits: Germany West Central, West India, Qatar Central.
10597

106-
<sup>3</sup> Higher vector quota based on the larger partitions for supported tiers and regions.
98+
<sup>3</sup> Higher vector quota based on the larger partitions for supported tiers and regions.
10799

108100
<sup>4</sup> Higher vector quota for more tiers and regions based on partition size updates.
109101

110-
#### Partition limits (GB)
111-
112-
This table repeats [partition storage limits](#service-limits) for context. The table shows the progression of storage quota increases in GB over time. Vector quota is per partition, so the more significant increases in vector quota that occurred starting in April 2024 correspond to the increases in per-partition storage occuring at the same time.
113-
114-
Higher capacity partitions were brought online starting in April 2024.
115-
116-
| Service creation date |Basic | S1| S2 | S3/HD | L1 | L2 |
117-
|-----------------------|------|---|----|----|----|----|
118-
|**Before July 1, 2023** <sup>1</sup> | 2 | 25 | 100 | 200 | 1,024 | 2,048 |
119-
|**July 1, 2023 through April 3, 2024** <sup>2</sup>| 2 | 25 | 100 | 200 | 1,024 | 2,048 |
120-
|**April 3, 2024 through May 17, 2024** <sup>3</sup> | 15 | 160 | 512 | 1,024 | 1,024 | 2,048 |
121-
|**After May 17, 2024** <sup>4</sup> | 15 | 160 | 512 | 1,024 | 2,048 | 4,096 |
122-
123-
<sup>1</sup> Partition sizes during early preview.
124-
125-
<sup>2</sup> No change during the later preview period.
126-
127-
<sup>3</sup> Higher capacity storage for Basic, S1, S2, S3 in these regions. **Americas**: Brazil South​, Canada Central​, Canada East​​, East US​, East US 2, ​Central US​, North Central US​, South Central US​, West US​, West US 2​, West US 3​, West Central US. **Europe**: France Central​. Italy North​​, North Europe​​, Norway East, Poland Central​​, Switzerland North​, Sweden Central​, UK South​, UK West​. **Middle East**: UAE North. **Africa**: South Africa North. **Asia Pacific**: Australia East​, Australia Southeast​​, Central India, Jio India West​, East Asia, Southeast Asia​, Japan East, Japan West​, Korea Central, Korea South​.
102+
The service enforces a vector index size quota **for every partition** in your search service. Each extra partition increases the available vector index size quota. This quota is a hard limit to ensure your service remains healthy, which means that further indexing attempts once the limit is exceeded results in failure. You can resume indexing once you free up available quota by either deleting some vector documents or by scaling up in partitions.
128103

129-
<sup>4</sup> Higher capacity storage for more tiers and more regions. **Europe**: Germany North​, Germany West Central, Switzerland West​. **Azure Government**: Texas, Arizona, Virginia. **Africa**: South Africa North​. **Asia Pacific**: China North 3, China East 3.
104+
> [!IMPORTANT]
105+
> Higher vector limits are tied to larger partition sizes. Regions that run on older infrastructure are subject to the July-April limits. Review the [regions list](search-region-support.md) for status on partition storage limits.
130106
131107
## Indexer limits
132108

articles/search/vector-search-index-size.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ ms.service: cognitive-search
99
ms.custom:
1010
- build-2024
1111
ms.topic: conceptual
12-
ms.date: 08/05/2024
12+
ms.date: 09/19/2024
1313
---
1414

1515
# Vector index size and staying under limits

0 commit comments

Comments
 (0)