Skip to content

Commit 7a81fc2

Browse files
committed
Throttling
1 parent 2c0db9c commit 7a81fc2

File tree

3 files changed

+27
-11
lines changed

3 files changed

+27
-11
lines changed

articles/search/search-limits-quotas-capacity.md

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ author: HeidiSteen
88
ms.author: heidist
99
ms.service: cognitive-search
1010
ms.topic: conceptual
11-
ms.date: 05/21/2024
11+
ms.date: 06/13/2024
1212
ms.custom:
1313
- references_regions
1414
- build-2024
@@ -230,6 +230,10 @@ Static rate request limits for operations related to a service:
230230

231231
+ Service Statistics (GET /servicestats): 4 per second per search unit
232232

233+
L2 reranking using the semantic reranker has an expected volume:
234+
235+
+ Up to 10 concurrent queries per replica. If you anticipate consistent throughput requirements near, at, or higher than this level, please file a support ticket so that we can provision for your workload.
236+
233237
## API request limits
234238

235239
* Maximum of 16 MB per request <sup>1</sup>

articles/search/semantic-how-to-configure.md

Lines changed: 7 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -10,24 +10,26 @@ ms.service: cognitive-search
1010
ms.custom:
1111
- ignite-2023
1212
ms.topic: how-to
13-
ms.date: 02/08/2024
13+
ms.date: 06/13/2024
1414
---
1515

1616
# Configure semantic ranking and return captions in search results
1717

18-
In this article, learn how to invoke a semantic ranking over a result set, promoting the most semantically relevant results to the top of the stack. You can also get semantic captions, with highlights over the most relevant terms and phrases, and [semantic answers](semantic-answers.md).
18+
This article explains how to configure a search index for semantic reranking.
19+
20+
Semantic ranking iterates over an initial result set, applying an L2 ranking methodology that promotes the most semantically relevant results to the top of the stack. You can also get semantic captions, with highlights over the most relevant terms and phrases, and [semantic answers](semantic-answers.md).
1921

2022
## Prerequisites
2123

22-
+ A search service on Basic, Standard tier (S1, S2, S3), or Storage Optimized tier (L1, L2), subject to [region availability](https://azure.microsoft.com/global-infrastructure/services/?products=search).
24+
+ A search service on a basic tier or higher, subject to [region availability](https://azure.microsoft.com/global-infrastructure/services/?products=search).
2325

2426
+ Semantic ranker [enabled on your search service](semantic-how-to-enable-disable.md).
2527

26-
+ An existing search index with rich text content. Semantic ranking applies to text (nonvector) fields and works best on content that is informational or descriptive.
28+
+ An existing search index with rich text content. Semantic ranking applies to strings (nonvector) fields and works best on content that is informational or descriptive.
2729

2830
## Choose a client
2931

30-
Choose a search client that supports semantic ranking. Here are some options:
32+
You can use any of the following tools and SDKs to add a semantic configuration:
3133

3234
+ [Azure portal](https://portal.azure.com), using the index designer to add a semantic configuration.
3335
+ [Visual Studio Code](https://code.visualstudio.com/download) with the [REST client](https://marketplace.visualstudio.com/items?itemName=humao.rest-client)

articles/search/semantic-how-to-query-request.md

Lines changed: 15 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -10,16 +10,18 @@ ms.service: cognitive-search
1010
ms.custom:
1111
- ignite-2023
1212
ms.topic: how-to
13-
ms.date: 02/08/2024
13+
ms.date: 06/13/2024
1414
---
1515

1616
# Create a semantic query in Azure AI Search
1717

18-
In this article, learn how to invoke a semantic ranking over a result set, promoting the most semantically relevant results to the top of the stack. You can also get semantic captions, with highlights over the most relevant terms and phrases, and [semantic answers](semantic-answers.md).
18+
This article explains how to invoke the semantic ranker on queries. You can apply semantic ranking to text queries, hybrid queries, and vector queries if your search documents contain string fields and the [vector query has a text representation](vector-search-how-to-query.md#query-with-integrated-vectorization-preview).
19+
20+
Semantic ranking iterates over an initial result set, applying an L2 ranking methodology that promotes the most semantically relevant results to the top of the stack. You can also get semantic captions, with highlights over the most relevant terms and phrases, and [semantic answers](semantic-answers.md).
1921

2022
## Prerequisites
2123

22-
+ A search service, Basic tier or higher, with [semantic ranking](semantic-how-to-enable-disable.md).
24+
+ A search service, basic tier or higher, with [semantic ranking enabled](semantic-how-to-enable-disable.md).
2325

2426
+ An existing search index with a [semantic configuration](semantic-how-to-configure.md) and rich text content.
2527

@@ -30,7 +32,7 @@ In this article, learn how to invoke a semantic ranking over a result set, promo
3032
3133
## Choose a client
3234

33-
Choose a search client that supports semantic ranking. Here are some options:
35+
You can use any of the following tools and SDKs to build a query that uses semantic ranking:
3436

3537
+ [Azure portal](https://portal.azure.com), using the index designer to add a semantic configuration.
3638
+ [Visual Studio Code](https://code.visualstudio.com/download) with a [REST client](https://marketplace.visualstudio.com/items?itemName=humao.rest-client)
@@ -86,7 +88,7 @@ In this step, add parameters to the query request. To be successful, your query
8688
"count": true
8789
}
8890
```
89-
91+
9092
### [**REST API**](#tab/rest-query)
9193

9294
Use [Search Documents](/rest/api/searchservice/documents/search-post) to formulate the request.
@@ -220,6 +222,14 @@ The response for the above example query returns the following match as the top
220222
]
221223
```
222224

225+
## Expected workloads
226+
227+
For semantic ranking, you should expect a search service to support up to 10 concurrent queries per replica.
228+
229+
The service throttles semantic ranking requests if volumes are too high. Error messages that include `Error in search query: Operation returned an invalid status 'Partial Content'` with a code of `@search.semanticPartialResponseReason` and `CapacityOverloaded` indicate the service is at capacity for semantic ranking.
230+
231+
If you anticipate consistent throughput requirements near, at, or higher than this level, please file a support ticket so that we can provision for your workload.
232+
223233
## Next steps
224234

225235
Semantic ranking can be used in hybrid queries that combine keyword search and vector search into a single request and a unified response.

0 commit comments

Comments
 (0)