Skip to content

Commit f45909d

Browse files
committed
Updated vector-search-how-to-generate-embeddings.md
1 parent 3a09be2 commit f45909d

File tree

1 file changed

+8
-6
lines changed

1 file changed

+8
-6
lines changed

articles/search/vector-search-how-to-generate-embeddings.md

Lines changed: 8 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -20,17 +20,17 @@ Azure AI Search doesn't host embedding models, so you're responsible for creatin
2020
| Approach | Description |
2121
| --- | --- |
2222
| [Integrated vectorization](vector-search-integrated-vectorization.md) | Use built-in data chunking and vectorization in Azure AI Search. This approach takes a dependency on indexers, skillsets, and built-in or custom skills that point to external embedding models, such as those in Azure AI Foundry. |
23-
| Manual vectorization | Manage data chunking and vectorization yourself. For indexing, you [push prevectorized documents](vector-search-how-to-create-index.md#load-vector-data-for-indexing) into vector fields in a search index. For querying, you provide precomputed vectors to the search engine. For demos of this approach, see the [azure-search-vector-samples](https://github.com/Azure/azure-search-vector-samples/tree/main) GitHub repository. |
23+
| Manual vectorization | Manage data chunking and vectorization yourself. For indexing, you [push prevectorized documents](vector-search-how-to-create-index.md#load-vector-data-for-indexing) into vector fields in a search index. For querying, you [provide precomputed vectors](#generate-an-embedding-for-an-improvised-query) to the search engine. For demos of this approach, see the [azure-search-vector-samples](https://github.com/Azure/azure-search-vector-samples/tree/main) GitHub repository. |
2424

25-
We recommend integrated vectorization for most scenarios. Although you can use any supported embedding model, this article uses Azure OpenAI embedding models for illustration.
25+
We recommend integrated vectorization for most scenarios. Although you can use any supported embedding model, this article uses Azure OpenAI models for illustration.
2626

2727
## How embedding models are used in vector queries
2828

2929
Embedding models generate vectors for both [query inputs](#query-inputs) and [query outputs](#query-outputs).
3030

3131
### Query inputs
3232

33-
Query inputs include the following:
33+
Query inputs include:
3434

3535
+ **Text or images that are converted to vectors during query processing**. As part of integrated vectorization, a [vectorizer](vector-search-how-to-configure-vectorizer.md) performs this task.
3636

@@ -46,11 +46,11 @@ Your search index must already contain documents with one or more vector fields
4646

4747
+ **Identify use cases**. Evaluate specific use cases where embedding model integration for vector search features adds value to your search solution. Examples include [multimodal search](multimodal-search-overview.md) or matching image content with text content, multilingual search, and similarity search.
4848

49-
+ **Design a chunking strategy**. Embedding models have limits on the number of tokens they accept, so data chunking is necessary for large files. For more information, see [Chunk large documents for vector search solutions](vector-search-how-to-chunk-documents.md).
49+
+ **Design a chunking strategy**. Embedding models have limits on the number of tokens they accept, so [data chunking](vector-search-how-to-chunk-documents.md) is necessary for large files.
5050

5151
+ **Optimize cost and performance**. Vector search is resource intensive and subject to maximum limits, so vectorize only the fields that contain semantic meaning. [Reduce vector size](vector-search-how-to-configure-compression-storage.md) to store more vectors for the same price.
5252

53-
+ **Choose the right embedding model**. Select a model for your use case, such as word embeddings for text-based searches or image embeddings for visual searches. Consider pretrained models, such as text-embedding-ada-002 from OpenAI or the Image Retrieval REST API from [Azure AI Computer Vision](/azure/ai-services/computer-vision/how-to/image-retrieval).
53+
+ **Choose the right embedding model**. Select a model for your use case, such as word embeddings for text-based searches or image embeddings for visual searches. Consider pretrained models, such as text-embedding-ada-002 from OpenAI or the Image Retrieval REST API from [Azure AI Vision](/azure/ai-services/computer-vision/how-to/image-retrieval).
5454

5555
+ **Normalize vector lengths**. To improve the accuracy and performance of similarity search, normalize vector lengths before you store them in a search index. Most pretrained models are already normalized.
5656

@@ -79,8 +79,8 @@ When you add knowledge to an agent workflow in the [Azure AI Foundry portal](htt
7979

8080
One step involves selecting an embedding model to vectorize your plain text content. The following models are supported:
8181

82-
+ text-embedding-3-large
8382
+ text-embedding-3-small
83+
+ text-embedding-3-large
8484
+ text-embedding-ada-002
8585
+ Cohere-embed-v3-english
8686
+ Cohere-embed-v3-multilingual
@@ -237,6 +237,8 @@ POST https://YOUR-OPENAI-RESOURCE.openai.azure.com/openai/deployments/text-embed
237237
}
238238
```
239239

240+
---
241+
240242
The output is a vector array of 1,536 dimensions.
241243

242244
## Related content

0 commit comments

Comments
 (0)