You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/search/vector-search-how-to-configure-vectorizer.md
+10-10Lines changed: 10 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,17 +17,17 @@ ms.date: 03/27/2024
17
17
> [!IMPORTANT]
18
18
> This feature is in public preview under [Supplemental Terms of Use](https://azure.microsoft.com/support/legal/preview-supplemental-terms/). The [2023-10-01-Preview REST API](/rest/api/searchservice/operation-groups?view=rest-searchservice-2023-10-01-preview&preserve-view=true) supports this feature.
19
19
20
-
In Azure AI Search a *vectorizer* is a vectorization agent, such as a deployed embedding model on Azure OpenAI, that converts text to vectors during query execution.
20
+
In Azure AI Search a *vectorizer* is software that performs vectorization, such as a deployed embedding model on Azure OpenAI, that converts text to vectors during query execution.
21
21
22
-
It's defined in a [search index](search-what-is-an-index.md) on searchable vector fields, and used at query time to generate an embedding for a text query input. If instead you need to vectorize text as part of the indexing process, refer to [Integrated Vectorization (Preview)](vector-search-integrated-vectorization.md). You can configure an indexer and skillset that calls Azure OpenAI embedding model during indexing to generate embeddings for your text content.
22
+
It's defined in a [search index](search-what-is-an-index.md), it applies to searchable vector fields, and it's used at query time to generate an embedding for a text query input. If instead you need to vectorize text as part of the indexing process, refer to [Integrated Vectorization (Preview)](vector-search-integrated-vectorization.md). For built-in vectorization during indexing, you can configure an indexer and skillset that calls an Azure OpenAI embedding model for your raw text content.
23
23
24
-
To add a vectorizer to search index, you can use the index designer in Azure portal, or call the [Create or Update Index 2023-10-01-preview](/rest/api/searchservice/indexes/create-or-update?view=rest-searchservice-2023-10-01-preview&preserve-view=true) REST API, or use any Azure beta SDK package that's been updated to provide this feature.
24
+
To add a vectorizer to search index, you can use the index designer in Azure portal, or call the [Create or Update Index 2023-10-01-preview](/rest/api/searchservice/indexes/create-or-update?view=rest-searchservice-2023-10-01-preview&preserve-view=true) REST API, or use any Azure beta SDK package that's updated to provide this feature.
25
25
26
26
## Prerequisites
27
27
28
28
+[An index with searchable vector fields](vector-search-how-to-create-index.md) on Azure AI Search.
29
29
30
-
+ A deployed embedding model, such as **text-embedding-ada-002** on Azure OpenAI. It's used to vectorize a query. It must be identical to the model used to generate embeddings in the index.
30
+
+ A deployed embedding model, such as **text-embedding-ada-002** on Azure OpenAI. It's used to vectorize a query. It must be identical to the model used to generate the embeddings in your index.
31
31
32
32
+ Permissions to use the embedding model. If you're using Azure OpenAI, the caller must have [Cognitive Services OpenAI User](/azure/ai-services/openai/how-to/role-based-access-control#azure-openai-roles) permissions. Or, you can provide an API key.
33
33
@@ -37,17 +37,17 @@ We recommend that you enable diagnostic logging on your search service to confir
37
37
38
38
## Try a vectorizer with sample data
39
39
40
-
The [Import and vectorize data wizard](search-get-started-portal-import-vectors.md) reads files from Azure Blob storage, creates an index with chunked and vectorized fields, and adds a vectorizer. By design, the vectorizer created by the wizard is set to the same embedding model used to index the blob content.
40
+
The [Import and vectorize data wizard](search-get-started-portal-import-vectors.md) reads files from Azure Blob storage, creates an index with chunked and vectorized fields, and adds a vectorizer. By design, the vectorizer that's created by the wizard is set to the same embedding model used to index the blob content.
41
41
42
42
1.[Upload sample data files](/azure/storage/blobs/storage-quickstart-blobs-portal) to a container on Azure Storage. We used some [small text files from NASA's earth book](https://github.com/Azure-Samples/azure-search-sample-data/tree/main/nasa-e-book/earth-txt-10) to test these instructions on a free search service.
43
43
44
44
1. Run the [Import and vectorize data wizard](search-get-started-portal-import-vectors.md), choosing the blob container for the data source.
45
45
46
-
:::image type="content" source="media/vector-search-how-to-configure-vectorizer/connect-to-data.png" alt-text="Screenshot of the connect to your data page.":::
46
+
:::image type="content" source="media/vector-search-how-to-configure-vectorizer/connect-to-data.png" lightbox="media/vector-search-how-to-configure-vectorizer/connect-to-data.png" alt-text="Screenshot of the connect to your data page.":::
47
47
48
48
1. Choose an existing deployment of **text-embedding-ada-002**. This model generates embeddings during indexing and is also used to configure the vectorizer used during queries.
49
49
50
-
:::image type="content" source="media/vector-search-how-to-configure-vectorizer/vectorize-enrich-data.png" alt-text="Screenshot of the vectorize and enrich data page.":::
50
+
:::image type="content" source="media/vector-search-how-to-configure-vectorizer/vectorize-enrich-data.png" lightbox="media/vector-search-how-to-configure-vectorizer/vectorize-enrich-data.png" alt-text="Screenshot of the vectorize and enrich data page.":::
51
51
52
52
1. After the wizard is finished and all indexer processing is complete, you should have an index with a searchable vector field. The field's JSON definition looks like this:
53
53
@@ -62,7 +62,7 @@ The [Import and vectorize data wizard](search-get-started-portal-import-vectors.
62
62
}
63
63
```
64
64
65
-
1. You should also have a vector profile and a vectorizer that looks similar to the following example:
65
+
1. You should also have a vector profile and a vectorizer, similar to the following example:
66
66
67
67
```json
68
68
"profiles": [
@@ -120,7 +120,7 @@ This section explains the modifications to an index schema for defining a vector
120
120
]
121
121
```
122
122
123
-
1. In the same index, add a vector profiles section that specifies one of your vectorizers. Vector profiles also require an algorithm used during indexing.
123
+
1. In the same index, add a vector profiles section that specifies one of your vectorizers. Vector profiles also require a [vector search algorithm](vector-search-ranking.md) used to create navigation structures.
124
124
125
125
```json
126
126
"profiles": [
@@ -210,7 +210,7 @@ Use a search client to send a query through a vectorizer. This example assumes V
210
210
211
211
1. Send the request. You should get three `k` results, where the first result is the most relevant.
212
212
213
-
Notice that there are no vectorizer properties to set at query time. The query reads the vectorizer properties, as acquired through the vector profile field assignment in the index.
213
+
Notice that there are no vectorizer properties to set at query time. The query reads the vectorizer properties, as per the vector profile field assignment in the index.
0 commit comments