You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This concept article helps you use a vector index in Azure Machine Learning for performing Retrieval Augmented Generation (RAG). A vector index stores embeddings, which are numerical representations of concepts (data) converted to number sequences, which enable LLMs to understand the relationships between those concepts. Creating vector stores helps you to hook up your data with a large language model (LLM) like GPT-4 and retrieve the data efficiently.
23
+
This article describes vector indexes in Azure Machine Learning that you can use to perform retrieval-augmented generation (RAG). A vector index stores embeddings that are numerical representations of _concepts_ (data) converted to number sequences. Embeddings enable large language models (LLMs) to understand the relationships between the concepts. You can create vector stores to connect your data with LLMs like GPT-4, and retrieve the data efficiently.
24
+
25
+
Azure Machine Learning supports two vector stores that contain your supplemental data used in a RAG workflow:
26
+
27
+
| Vector store | Description | Features and usage |
28
+
| --- | --- | --- |
29
+
|**Faiss**| Open source library | - Use local file-based store <br> - Incur minimal costs <br> - Support vector-only data <br> - Support development and testing |
30
+
|**Azure AI Search**| Azure PaaS resource | - Store text data in search index <br> - Host large number of indexes with single service <br> - Support enterprise-level business requirements <br> - Access hybrid information retrieval |
22
31
23
-
Azure Machine Learning supports two types of vector stores that contain your supplemental data used in a RAG workflow:
32
+
The following sections explore considerations for working with these vector stores.
24
33
25
-
+[Faiss](https://github.com/facebookresearch/faiss) is an open source library that provides a local file-based store. The vector index is stored in the storage account of your Azure Machine Learning workspace. Since it's stored locally, the costs are minimal making it ideal for development and testing.
34
+
## Faisslibrary
26
35
27
-
+[Azure AI Search](/azure/search/search-what-is-azure-search) (formerly Cognitive Search) is an Azure resource that supports information retrieval over your vector and textual data stored in search indexes. A prompt flow can create, populate, and query your vector data stored in Azure AI Search.
36
+
[Faiss](https://github.com/facebookresearch/faiss)is an open source library that provides a local file-based store. The vector index is stored in the Azure storage account of your Azure Machine Learning workspace. To work with Faiss, you download the library and use it as a component of your solution. Because the index is stored locally, the costs are minimal.
28
37
29
-
## Choose a vector store
38
+
You can use the Faiss library as your vector store and perform the following actions:
30
39
31
-
You can use either store in prompt flow, so which one should you use?
40
+
- Store vector data locally, with no costs for creating an index (only storage cost)
32
41
33
-
**Faiss** is an open source library that you download and use a component of your solution. This library might be the best place to start if you have vector-only data. Some key points about working with Faiss:
42
+
- Build and query an index in memory
34
43
35
-
+ Local storage, with no costs for creating an index (only storage cost).
44
+
- Share copies for individual use, and configure hosting of the index for an application
36
45
37
-
+ You can build and query an index in memory.
46
+
- Scale with underlying compute loading index
38
47
39
-
+ You can share copies for individual use. If you want to host the index for an application, you need to set that up.
48
+
## Azure AI Search
40
49
41
-
+ Faiss scales with underlying compute loading index.
50
+
[Azure AI Search](/azure/search/search-what-is-azure-search) (formerly Cognitive Search) is a dedicated Azure PaaS resource that you create in an Azure subscription. The resource supports information retrieval over your vector and textual data stored in search indexes. A prompt flow can create, populate, and query your vector data stored in Azure AI Search. A single search service can host a large number of indexes, which can be queried and used in a RAG pattern.
42
51
43
-
**Azure AI Search** is a dedicated PaaS resource that you create in an Azure subscription. A single search service can host a large number of indexes, which can be queried and used in a RAG pattern. Some key points about using Azure AI Search for your vector store:
52
+
Here are some key points about using Azure AI Search for your vector store:
44
53
45
-
+ Supports enterprise level business requirements for scale, security, and availability.
54
+
- Support enterprise level business requirements for scale, security, and availability.
46
55
47
-
+ Supports hybrid information retrieval. Vector data can coexist with non-vector data, which means you can use any of the [features of Azure AI Search](/azure/search/search-features-list) for indexing and queries, including [hybrid search](/azure/search/vector-search-how-to-query) and [semantic reranking](/azure/search/semantic-ranking).
56
+
- Access hybrid information retrieval. Vector data can coexist with nonvector data, which means you can use any of the [features of Azure AI Search](/azure/search/search-features-list) for indexing and queries, including [hybrid search](/azure/search/vector-search-how-to-query) and [semantic reranking](/azure/search/semantic-search-overview).
48
57
49
-
+[Vector support is in public preview](/azure/search/vector-search-overview). Currently, vectors must be generated externally and then passed to Azure AI Search for indexing and query encoding. The prompt flow handles these transitions for you.
58
+
- Keep in mind that [vector support is in preview](/azure/search/vector-search-overview). Currently, vectors must be generated externally and then passed to Azure AI Search for indexing and query encoding. The prompt flow handles these transitions for you.
50
59
51
-
To use AI Search as a vector store for Azure Machine Learning, [you must have a search service](/azure/search/search-create-service-portal). Once the service exists and you've granted access to developers, you can choose **Azure AI Search** as a vector index in a prompt flow. The prompt flow creates the index on Azure AI Search, generates vectors from your source data, sends the vectors to the index, invokes similarity search on AI Search, and returns the response.
60
+
To use AI Search as a vector store for Azure Machine Learning, [you must have a search service](/azure/search/search-create-service-portal). After the service exists, and you grant access to developers, you can choose **Azure AI Search** as a vector index in a prompt flow. The prompt flow creates the index on Azure AI Search, generates vectors from your source data, sends the vectors to the index, invokes similarity search on AI Search, and returns the response.
52
61
53
-
## Next steps
62
+
## Related content
54
63
55
-
[How to create vector index in Azure Machine Learning prompt flow (preview)](how-to-create-vector-index.md)
64
+
-[Create vector index in Azure Machine Learning prompt flow (preview)](how-to-create-vector-index.md)
65
+
-[Vectors in Azure AI Search](/azure/search/vector-search-overview)
0 commit comments