Skip to content

Commit 8902481

Browse files
authored
Merge pull request #280579 from GitHubber17/259443-d
Freshness - Azure Machine Learning
2 parents e8b1b93 + dac4e1b commit 8902481

File tree

1 file changed

+32
-22
lines changed

1 file changed

+32
-22
lines changed
Lines changed: 32 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
2-
title: Concept Vector Stores in Azure Machine Learning (preview)
2+
title: Concept vector stores in Azure Machine Learning (preview)
33
titleSuffix: Azure Machine Learning embedding vectors in AzureML
4-
description: This concept article helps you use a vector index in Azure Machine Learning for performing Retrieval Augmented Generation.
4+
description: Explore concepts about vector indexes in Azure Machine Learning and how you can use them for retrieval-augmented generation.
55
services: machine-learning
66
ms.author: ssalgado
77
author: ssalgadodev
@@ -10,46 +10,56 @@ ms.service: machine-learning
1010
ms.subservice: core
1111
ms.custom:
1212
- ignite-2023
13-
ms.date: 07/27/2023
14-
ms.topic: conceptual
13+
ms.date: 07/17/2024
14+
ms.topic: concept-article
15+
16+
#customer intent: As a developer, I want to use vector indexes in Azure Machine Learning so that I can perform retrieval-augmented generation.
1517
---
1618

1719
# Vector stores in Azure Machine Learning (preview)
1820

1921
[!INCLUDE [machine-learning-preview-generic-disclaimer](includes/machine-learning-preview-generic-disclaimer.md)]
2022

21-
This concept article helps you use a vector index in Azure Machine Learning for performing Retrieval Augmented Generation (RAG). A vector index stores embeddings, which are numerical representations of concepts (data) converted to number sequences, which enable LLMs to understand the relationships between those concepts. Creating vector stores helps you to hook up your data with a large language model (LLM) like GPT-4 and retrieve the data efficiently.
23+
This article describes vector indexes in Azure Machine Learning that you can use to perform retrieval-augmented generation (RAG). A vector index stores embeddings that are numerical representations of _concepts_ (data) converted to number sequences. Embeddings enable large language models (LLMs) to understand the relationships between the concepts. You can create vector stores to connect your data with LLMs like GPT-4, and retrieve the data efficiently.
24+
25+
Azure Machine Learning supports two vector stores that contain your supplemental data used in a RAG workflow:
26+
27+
| Vector store | Description | Features and usage |
28+
| --- | --- | --- |
29+
| **Faiss** | Open source library | - Use local file-based store <br> - Incur minimal costs <br> - Support vector-only data <br> - Support development and testing |
30+
| **Azure AI Search** | Azure PaaS resource | - Store text data in search index <br> - Host large number of indexes with single service <br> - Support enterprise-level business requirements <br> - Access hybrid information retrieval |
2231

23-
Azure Machine Learning supports two types of vector stores that contain your supplemental data used in a RAG workflow:
32+
The following sections explore considerations for working with these vector stores.
2433

25-
+ [Faiss](https://github.com/facebookresearch/faiss) is an open source library that provides a local file-based store. The vector index is stored in the storage account of your Azure Machine Learning workspace. Since it's stored locally, the costs are minimal making it ideal for development and testing.
34+
## Faiss library
2635

27-
+ [Azure AI Search](/azure/search/search-what-is-azure-search) (formerly Cognitive Search) is an Azure resource that supports information retrieval over your vector and textual data stored in search indexes. A prompt flow can create, populate, and query your vector data stored in Azure AI Search.
36+
[Faiss](https://github.com/facebookresearch/faiss) is an open source library that provides a local file-based store. The vector index is stored in the Azure storage account of your Azure Machine Learning workspace. To work with Faiss, you download the library and use it as a component of your solution. Because the index is stored locally, the costs are minimal.
2837

29-
## Choose a vector store
38+
You can use the Faiss library as your vector store and perform the following actions:
3039

31-
You can use either store in prompt flow, so which one should you use?
40+
- Store vector data locally, with no costs for creating an index (only storage cost)
3241

33-
**Faiss** is an open source library that you download and use a component of your solution. This library might be the best place to start if you have vector-only data. Some key points about working with Faiss:
42+
- Build and query an index in memory
3443

35-
+ Local storage, with no costs for creating an index (only storage cost).
44+
- Share copies for individual use, and configure hosting of the index for an application
3645

37-
+ You can build and query an index in memory.
46+
- Scale with underlying compute loading index
3847

39-
+ You can share copies for individual use. If you want to host the index for an application, you need to set that up.
48+
## Azure AI Search
4049

41-
+ Faiss scales with underlying compute loading index.
50+
[Azure AI Search](/azure/search/search-what-is-azure-search) (formerly Cognitive Search) is a dedicated Azure PaaS resource that you create in an Azure subscription. The resource supports information retrieval over your vector and textual data stored in search indexes. A prompt flow can create, populate, and query your vector data stored in Azure AI Search. A single search service can host a large number of indexes, which can be queried and used in a RAG pattern.
4251

43-
**Azure AI Search** is a dedicated PaaS resource that you create in an Azure subscription. A single search service can host a large number of indexes, which can be queried and used in a RAG pattern. Some key points about using Azure AI Search for your vector store:
52+
Here are some key points about using Azure AI Search for your vector store:
4453

45-
+ Supports enterprise level business requirements for scale, security, and availability.
54+
- Support enterprise level business requirements for scale, security, and availability.
4655

47-
+ Supports hybrid information retrieval. Vector data can coexist with non-vector data, which means you can use any of the [features of Azure AI Search](/azure/search/search-features-list) for indexing and queries, including [hybrid search](/azure/search/vector-search-how-to-query) and [semantic reranking](/azure/search/semantic-ranking).
56+
- Access hybrid information retrieval. Vector data can coexist with nonvector data, which means you can use any of the [features of Azure AI Search](/azure/search/search-features-list) for indexing and queries, including [hybrid search](/azure/search/vector-search-how-to-query) and [semantic reranking](/azure/search/semantic-search-overview).
4857

49-
+ [Vector support is in public preview](/azure/search/vector-search-overview). Currently, vectors must be generated externally and then passed to Azure AI Search for indexing and query encoding. The prompt flow handles these transitions for you.
58+
- Keep in mind that [vector support is in preview](/azure/search/vector-search-overview). Currently, vectors must be generated externally and then passed to Azure AI Search for indexing and query encoding. The prompt flow handles these transitions for you.
5059

51-
To use AI Search as a vector store for Azure Machine Learning, [you must have a search service](/azure/search/search-create-service-portal). Once the service exists and you've granted access to developers, you can choose **Azure AI Search** as a vector index in a prompt flow. The prompt flow creates the index on Azure AI Search, generates vectors from your source data, sends the vectors to the index, invokes similarity search on AI Search, and returns the response.
60+
To use AI Search as a vector store for Azure Machine Learning, [you must have a search service](/azure/search/search-create-service-portal). After the service exists, and you grant access to developers, you can choose **Azure AI Search** as a vector index in a prompt flow. The prompt flow creates the index on Azure AI Search, generates vectors from your source data, sends the vectors to the index, invokes similarity search on AI Search, and returns the response.
5261

53-
## Next steps
62+
## Related content
5463

55-
[How to create vector index in Azure Machine Learning prompt flow (preview)](how-to-create-vector-index.md)
64+
- [Create vector index in Azure Machine Learning prompt flow (preview)](how-to-create-vector-index.md)
65+
- [Vectors in Azure AI Search](/azure/search/vector-search-overview)

0 commit comments

Comments
 (0)