You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cosmos-db/vector-database.md
+11-10Lines changed: 11 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -16,9 +16,9 @@ ms.date: 03/29/2024
16
16
17
17
Vector databases are used in numerous domains and situations across analytical and generative AI, including natural language processing, video and image recognition, recommendation system, search, etc.
18
18
19
-
Many AI-enhanced systems that emerged in 2023 use standalone vector databases that are distinct from "traditional" databases in their tech stacks. Instead of adding a separate vector database, you might want to consider using vector databases integrated into NoSQL or relational databases when working with multi-modal data. By doing so, you avoid the extra cost of moving data to a separate database. Moreover, this architecture keeps your vector embeddings and original data together, and you can better achieve data consistency, scale, and performance.
19
+
Many AI-enhanced systems that emerged in 2023 use standalone vector databases that are distinct from NoSQL or relational databases in their tech stacks. However, besides adding a separate vector database, you might want to consider using an integrated vector database in the NoSQL or relational database you already use, especially when working with multi-modal data. By doing so, you can reduce cost while achieving greater data consistency, scale, and performance. Read on to find out why and how.
20
20
21
-
> [!NOTE]
21
+
> [!TIP]
22
22
> Data consistency, scale, and performance guarantees are why OpenAI built its ChatGPT service on top of Azure Cosmos DB. You, too, can take advantage of its integrated vector database, as well as its single-digit millisecond response times, automatic and instant scalability, and guaranteed speed at any scale by referring to the [How to implement integrated vector database functionalities](#how-to-implement-integrated-vector-database-functionalities) section of this article.
23
23
24
24
## What is a vector database?
@@ -27,7 +27,7 @@ A vector database is a database designed to store and manage [vector embeddings]
27
27
28
28
In a vector database, embeddings are indexed and queried through [vector search](#vector-search) algorithms based on their vector distance or similarity. A robust mechanism is necessary to identify the most relevant data. Some well-known vector search algorithms include Hierarchical Navigable Small World (HNSW), Inverted File (IVF), DiskANN, etc.
29
29
30
-
Besides the above functionalities of a vector database that can only embeddings, an integrated vector database in a NoSQL or relational database converts the existing raw data in your account into embeddings and stores them alongside your original data. This way, you can better achieve data consistency, scale, and performance while avoiding the extra cost of replicating your data in a separate vector database.
30
+
Besides the typical vector database functionalities above, an integrated vector database in a NoSQL or relational database converts the existing raw data in your account into embeddings and stores them alongside your original data. This way, you can avoid the extra cost of replicating your data in a separate vector database. Moreover, this architecture keeps your vector embeddings and original data together when you need to work with multi-modal data, and you can better achieve data consistency, scale, and performance.
31
31
32
32
## What are some vector database use cases?
33
33
@@ -41,8 +41,8 @@ Vector databases are used in numerous domains and situations across analytical a
41
41
- identify data anomalies or fraudulent activities that are dissimilar from predominant or normal patterns
42
42
- implement persistent memory for AI agents
43
43
44
-
> [!NOTE]
45
-
> Besides these typical use cases for vector database, our integrated vector database is also an ideal solution for production-level LLM caching thanks to its low latency, high scalability, and high availability.
44
+
> [!TIP]
45
+
> Besides these typical use cases for vector databases, our integrated vector database is also an ideal solution for production-level LLM caching thanks to its low latency, high scalability, and high availability.
46
46
47
47
It's especially popular to use vector databases to enable [retrieval-augmented generation (RAG)](#retrieval-augmented-generation) that harnesses LLMs and custom data or domain-specific information. This approach allows you to:
48
48
@@ -97,11 +97,13 @@ Here are multiple ways to implement RAG on your data by using our integrated vec
97
97
98
98
## How to implement integrated vector database functionalities
99
99
100
+
You can implement integrated vector database functionalities for the following [Azure Cosmos DB APIs](choose-api.md):
101
+
100
102
### API for MongoDB
101
103
102
104
Use the natively [integrated vector database in Azure Cosmos DB for MongoDB vCore](mongodb/vcore/vector-search.md), which offers an efficient way to store, index, and search high-dimensional vector data directly alongside other application data. This approach removes the necessity of migrating your data to costlier alternative vector databases and provides a seamless integration of your AI-driven applications.
@@ -115,15 +117,15 @@ Use the natively [integrated vector database in Azure Cosmos DB for MongoDB vCor
115
117
116
118
Use the natively integrated vector database in [Azure Cosmos DB for PostgreSQL](postgresql/howto-use-pgvector.md), which offers an efficient way to store, index, and search high-dimensional vector data directly alongside other application data. This approach removes the necessity of migrating your data to costlier alternative vector databases and provides a seamless integration of your AI-driven applications.
The natively integrated vector database in our NoSQL API will become available in mid-2024. In the meantime, you may implement RAG patterns with Azure Cosmos DB for NoSQL and [Azure AI Search](../search/vector-search-overview.md). This approach enables powerful integration of your data residing in the NoSQL API into your AI-oriented applications.
125
127
126
-
### Code samples
128
+
####Code samples
127
129
128
130
-[.NET tutorial - Build and Modernize AI Applications](https://github.com/Azure/Build-Modern-AI-Apps-Hackathon)
129
131
-[.NET tutorial - Bring Your Data to ChatGPT](https://github.com/Azure/Vector-Search-AI-Assistant/tree/cognitive-search-vector)
@@ -140,9 +142,8 @@ The natively integrated vector database in our NoSQL API will become available i
140
142
> [!div class="nextstepaction"]
141
143
> [Use the Azure Cosmos DB lifetime free tier](free-tier.md)
142
144
143
-
## Related content
145
+
## Other Vector Databases
144
146
145
-
-[Azure Cosmos DB for MongoDB vCore Integrated Vector Database](mongodb/vcore/vector-search.md)
146
147
-[Azure PostgreSQL Server pgvector Extension](../postgresql/flexible-server/how-to-use-pgvector.md)
147
148
-[Azure AI Search](../search/search-what-is-azure-search.md)
0 commit comments