You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
> Higher capacity partitions are available at the same billing rate on newer services created after April and May 2024. For more information, see [Service limits](search-limits-quotas-capacity.md#service-limits) for partition size upgrades.
33
+
31
34
## Concepts: search units, replicas, partitions
32
35
33
36
Capacity is expressed in *search units* that can be allocated in combinations of *partitions* and *replicas*.
Copy file name to clipboardExpand all lines: articles/search/search-sku-manage-costs.md
+4-1Lines changed: 4 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -10,7 +10,7 @@ ms.service: cognitive-search
10
10
ms.custom:
11
11
- ignite-2023
12
12
ms.topic: conceptual
13
-
ms.date: 04/01/2024
13
+
ms.date: 06/19/2024
14
14
---
15
15
16
16
# Plan and manage costs of an Azure AI Search service
@@ -21,6 +21,9 @@ As a first step, estimate your baseline costs by using the Azure pricing calcula
21
21
22
22
Azure provides built-in cost management that cuts across service boundaries to provide inclusive cost monitoring and the ability to set budgets and define alerts. The costs of running a search service will vary depending on capacity and which features you use. After you create your search service, optimize capacity so that you pay only for what you need.
23
23
24
+
> [!NOTE]
25
+
> Higher capacity partitions are available at the same billing rate on newer services created after April and May 2024. For more information, see [Service limits](search-limits-quotas-capacity.md#service-limits) for partition size upgrades.
Copy file name to clipboardExpand all lines: articles/search/vector-search-how-to-index-binary-data.md
+4-1Lines changed: 4 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -14,6 +14,9 @@ ms.date: 05/30/2024
14
14
15
15
# Index binary vectors for vector search
16
16
17
+
> [!IMPORTANT]
18
+
> Binary data types are in public preview under [Supplemental Terms of Use](https://azure.microsoft.com/support/legal/preview-supplemental-terms/). The [2024-05-01-preview REST API](/rest/api/searchservice/operation-groups?view=rest-searchservice-2024-03-01-preview&preserve-view=true) and later preview APIs provide the new data type.
19
+
17
20
Beginning with the 2024-05-01-preview REST API, Azure AI Search supports a packed binary type of `Collection(Edm.Byte)` for further reducing the storage and memory footprint of vector data. You can use this data type for output from models such as [Cohere's Embed v3 binary embedding models](https://cohere.com/blog/introducing-embed-v3).
18
21
19
22
There are three steps to configuring an index for binary vectors:
@@ -23,7 +26,7 @@ There are three steps to configuring an index for binary vectors:
23
26
> + Add a vector profile that points to the algorithm
24
27
> + Add the vector profile to your binary field definition
25
28
26
-
This article assumes you're familiar with [creating an index in Azure AI Search](search-how-to-create-search-index.md). It uses the REST APIs to illustrate each step. You can also add a binary field type to an index in the Azure portal.
29
+
This article assumes you're familiar with [creating an index in Azure AI Search](search-how-to-create-search-index.md). It uses the REST APIs to illustrate each step, but you could also add a binary field to an index in the Azure portal.
Copy file name to clipboardExpand all lines: articles/search/vector-search-how-to-query.md
+11-9Lines changed: 11 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -470,9 +470,11 @@ Search results would include a combination of text and images, assuming your sea
470
470
471
471
## Query with integrated vectorization (preview)
472
472
473
-
This section shows a vector query that invokes the new [integrated vectorization](vector-search-integrated-vectorization.md) preview feature that converts a text query into a vector. Use [**2023-10-01-Preview** REST API](/rest/api/searchservice/documents/search-post?view=rest-searchservice-2023-10-01-preview&preserve-view=true) or an updated beta Azure SDK package.
473
+
This section shows a vector query that invokes the new [integrated vectorization](vector-search-integrated-vectorization.md) preview feature that converts a text query into a vector. Use [**2023-10-01-Preview** REST API](/rest/api/searchservice/documents/search-post?view=rest-searchservice-2023-10-01-preview&preserve-view=true)and newer preview REST APIs or an updated beta Azure SDK package.
474
474
475
-
A prerequisite is a search index having a [vectorizer configured and assigned](vector-search-how-to-configure-vectorizer.md) to a vector field. The vectorizer provides connection information to an embedding model used at query time.
475
+
A prerequisite is a search index having a [vectorizer configured and assigned](vector-search-how-to-configure-vectorizer.md) to a vector field. The vectorizer provides connection information to an embedding model used at query time. Check the index definition for a vectorizers specification.
476
+
477
+
:::image type="content" source="media/vector-search-how-to-query/check-vectorizer.png" alt-text="Screenshot of a vectorizer setting in a search index.":::
476
478
477
479
Queries provide text strings instead of vectors:
478
480
@@ -483,7 +485,7 @@ Queries provide text strings instead of vectors:
483
485
Here's a simple example of a query that's vectorized at query time. The text string is vectorized and then used to query the descriptionVector field.
484
486
485
487
```http
486
-
POST https://{{search-service}}.search.windows.net/indexes/{{index}}/docs/search?api-version=2023-10-01-preview
488
+
POST https://{{search-service}}.search.windows.net/indexes/{{index}}/docs/search?api-version=2024-05-01-preview
487
489
{
488
490
"select": "title, genre, description",
489
491
"vectorQueries": [
@@ -502,7 +504,7 @@ Here's a [hybrid query](hybrid-search-how-to-query.md) using integrated vectoriz
502
504
In this example, the search engine makes three vectorization calls to the vectorizers assigned to `descriptionVector`, `synopsisVector`, and `authorBioVector` in the index. The resulting vectors are used to retrieve documents against their respective fields. The search engine also executes a keyword search on the `search` query, "mystery novel set in London".
503
505
504
506
```http
505
-
POST https://{{search-service}}.search.windows.net/indexes/{{index}}/docs/search?api-version=2023-10-01-preview
507
+
POST https://{{search-service}}.search.windows.net/indexes/{{index}}/docs/search?api-version=2024-05-01-preview
506
508
Content-Type: application/json
507
509
api-key: {{admin-api-key}}
508
510
{
@@ -571,9 +573,11 @@ During query execution, a vector query can only target one internal vector index
571
573
572
574
## Set thresholds to exclude low-scoring results (preview)
573
575
574
-
Because nearest neighbor search always returns the requested `k` neighbors, it's possible to get low scoring matches as part of meeting the `k` number requirement on search results.
576
+
Because nearest neighbor search always returns the requested `k` neighbors, it's possible to get multiple low scoring matches as part of meeting the `k` number requirement on search results.
577
+
578
+
Using the 2024-05-01-preview REST APIs, you can now add a `threshold` query parameter to exclude low-scoring search results based on a minimum score. Filtering occurs before [fusing results](hybrid-search-ranking.md) from different recall sets.
575
579
576
-
Using the 2024-05-01-preview REST APIs, you can now add a `threshold` query parameter to exclude low-scoring search results.
580
+
In this example, all matches that score below 0.8 are excluded from vector search results, even if the number of results fall below `k`.
577
581
578
582
```http
579
583
POST https://[service-name].search.windows.net/indexes/[index-name]/docs/search?api-version=2024-05-01-Preview
@@ -595,8 +599,6 @@ POST https://[service-name].search.windows.net/indexes/[index-name]/docs/search?
595
599
}
596
600
```
597
601
598
-
Filtering occurs before [fusing results](hybrid-search-ranking.md) from different recall sets.
599
-
600
602
<!-- Keep H2 as-is. Direct link from a blog post. Bulk of maxtextsizerecall has moved to hybrid query doc-->
601
603
## MaxTextSizeRecall for hybrid search (preview)
602
604
@@ -608,7 +610,7 @@ For more information, see [Set maxTextRecallSize - Create a hybrid query](hybrid
608
610
609
611
## Vector weighting (preview)
610
612
611
-
Add a `weight` query parameter to specify the relative weight of each vector included in search operations. This value is used when combining the results of multiple ranking lists produced by two or more vector queries in the same request, or from the vector portion of a hybrid query.
613
+
Add a `weight` query parameter to specify the relative weight of each vector query included in search operations. This value is used when combining the results of multiple ranking lists produced by two or more vector queries in the same request, or from the vector portion of a hybrid query.
612
614
613
615
The default is 1.0 and the value must be a positive number larger than zero.
0 commit comments