You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/search/search-limits-quotas-capacity.md
+5-1Lines changed: 5 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ author: HeidiSteen
8
8
ms.author: heidist
9
9
ms.service: cognitive-search
10
10
ms.topic: conceptual
11
-
ms.date: 05/21/2024
11
+
ms.date: 06/13/2024
12
12
ms.custom:
13
13
- references_regions
14
14
- build-2024
@@ -230,6 +230,10 @@ Static rate request limits for operations related to a service:
230
230
231
231
+ Service Statistics (GET /servicestats): 4 per second per search unit
232
232
233
+
L2 reranking using the semantic reranker has an expected volume:
234
+
235
+
+ Up to 10 concurrent queries per replica. If you anticipate consistent throughput requirements near, at, or higher than this level, please file a support ticket so that we can provision for your workload.
Copy file name to clipboardExpand all lines: articles/search/semantic-how-to-configure.md
+7-5Lines changed: 7 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -10,24 +10,26 @@ ms.service: cognitive-search
10
10
ms.custom:
11
11
- ignite-2023
12
12
ms.topic: how-to
13
-
ms.date: 02/08/2024
13
+
ms.date: 06/13/2024
14
14
---
15
15
16
16
# Configure semantic ranking and return captions in search results
17
17
18
-
In this article, learn how to invoke a semantic ranking over a result set, promoting the most semantically relevant results to the top of the stack. You can also get semantic captions, with highlights over the most relevant terms and phrases, and [semantic answers](semantic-answers.md).
18
+
This article explains how to configure a search index for semantic reranking.
19
+
20
+
Semantic ranking iterates over an initial result set, applying an L2 ranking methodology that promotes the most semantically relevant results to the top of the stack. You can also get semantic captions, with highlights over the most relevant terms and phrases, and [semantic answers](semantic-answers.md).
19
21
20
22
## Prerequisites
21
23
22
-
+ A search service on Basic, Standard tier (S1, S2, S3), or Storage Optimized tier (L1, L2), subject to [region availability](https://azure.microsoft.com/global-infrastructure/services/?products=search).
24
+
+ A search service on a basic tier or higher, subject to [region availability](https://azure.microsoft.com/global-infrastructure/services/?products=search).
23
25
24
26
+ Semantic ranker [enabled on your search service](semantic-how-to-enable-disable.md).
25
27
26
-
+ An existing search index with rich text content. Semantic ranking applies to text (nonvector) fields and works best on content that is informational or descriptive.
28
+
+ An existing search index with rich text content. Semantic ranking applies to strings (nonvector) fields and works best on content that is informational or descriptive.
27
29
28
30
## Choose a client
29
31
30
-
Choose a search client that supports semantic ranking. Here are some options:
32
+
You can use any of the following tools and SDKs to add a semantic configuration:
31
33
32
34
+[Azure portal](https://portal.azure.com), using the index designer to add a semantic configuration.
33
35
+[Visual Studio Code](https://code.visualstudio.com/download) with the [REST client](https://marketplace.visualstudio.com/items?itemName=humao.rest-client)
Copy file name to clipboardExpand all lines: articles/search/semantic-how-to-query-request.md
+15-5Lines changed: 15 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -10,16 +10,18 @@ ms.service: cognitive-search
10
10
ms.custom:
11
11
- ignite-2023
12
12
ms.topic: how-to
13
-
ms.date: 02/08/2024
13
+
ms.date: 06/13/2024
14
14
---
15
15
16
16
# Create a semantic query in Azure AI Search
17
17
18
-
In this article, learn how to invoke a semantic ranking over a result set, promoting the most semantically relevant results to the top of the stack. You can also get semantic captions, with highlights over the most relevant terms and phrases, and [semantic answers](semantic-answers.md).
18
+
This article explains how to invoke the semantic ranker on queries. You can apply semantic ranking to text queries, hybrid queries, and vector queries if your search documents contain string fields and the [vector query has a text representation](vector-search-how-to-query.md#query-with-integrated-vectorization-preview).
19
+
20
+
Semantic ranking iterates over an initial result set, applying an L2 ranking methodology that promotes the most semantically relevant results to the top of the stack. You can also get semantic captions, with highlights over the most relevant terms and phrases, and [semantic answers](semantic-answers.md).
19
21
20
22
## Prerequisites
21
23
22
-
+ A search service, Basic tier or higher, with [semantic ranking](semantic-how-to-enable-disable.md).
24
+
+ A search service, basic tier or higher, with [semantic ranking enabled](semantic-how-to-enable-disable.md).
23
25
24
26
+ An existing search index with a [semantic configuration](semantic-how-to-configure.md) and rich text content.
25
27
@@ -30,7 +32,7 @@ In this article, learn how to invoke a semantic ranking over a result set, promo
30
32
31
33
## Choose a client
32
34
33
-
Choose a search client that supports semantic ranking. Here are some options:
35
+
You can use any of the following tools and SDKs to build a query that uses semantic ranking:
34
36
35
37
+[Azure portal](https://portal.azure.com), using the index designer to add a semantic configuration.
36
38
+[Visual Studio Code](https://code.visualstudio.com/download) with a [REST client](https://marketplace.visualstudio.com/items?itemName=humao.rest-client)
@@ -86,7 +88,7 @@ In this step, add parameters to the query request. To be successful, your query
86
88
"count": true
87
89
}
88
90
```
89
-
91
+
90
92
### [**REST API**](#tab/rest-query)
91
93
92
94
Use [Search Documents](/rest/api/searchservice/documents/search-post) to formulate the request.
@@ -220,6 +222,14 @@ The response for the above example query returns the following match as the top
220
222
]
221
223
```
222
224
225
+
## Expected workloads
226
+
227
+
For semantic ranking, you should expect a search service to support up to 10 concurrent queries per replica.
228
+
229
+
The service throttles semantic ranking requests if volumes are too high. Error messages that include `Error in search query: Operation returned an invalid status 'Partial Content'` with a code of `@search.semanticPartialResponseReason` and `CapacityOverloaded` indicate the service is at capacity for semantic ranking.
230
+
231
+
If you anticipate consistent throughput requirements near, at, or higher than this level, please file a support ticket so that we can provision for your workload.
232
+
223
233
## Next steps
224
234
225
235
Semantic ranking can be used in hybrid queries that combine keyword search and vector search into a single request and a unified response.
0 commit comments