Skip to content

Commit 7967aa1

Browse files
Merge pull request #270719 from wmwxwa/patch-8
Update vector-database.md
2 parents 6cafa1a + 2b92f21 commit 7967aa1

File tree

1 file changed

+11
-10
lines changed

1 file changed

+11
-10
lines changed

articles/cosmos-db/vector-database.md

Lines changed: 11 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -16,9 +16,9 @@ ms.date: 03/29/2024
1616

1717
Vector databases are used in numerous domains and situations across analytical and generative AI, including natural language processing, video and image recognition, recommendation system, search, etc.
1818

19-
Many AI-enhanced systems that emerged in 2023 use standalone vector databases that are distinct from "traditional" databases in their tech stacks. Instead of adding a separate vector database, you might want to consider using vector databases integrated into NoSQL or relational databases when working with multi-modal data. By doing so, you avoid the extra cost of moving data to a separate database. Moreover, this architecture keeps your vector embeddings and original data together, and you can better achieve data consistency, scale, and performance.
19+
Many AI-enhanced systems that emerged in 2023 use standalone vector databases that are distinct from NoSQL or relational databases in their tech stacks. However, besides adding a separate vector database, you might want to consider using an integrated vector database in the NoSQL or relational database you already use, especially when working with multi-modal data. By doing so, you can reduce cost while achieving greater data consistency, scale, and performance. Read on to find out why and how.
2020

21-
> [!NOTE]
21+
> [!TIP]
2222
> Data consistency, scale, and performance guarantees are why OpenAI built its ChatGPT service on top of Azure Cosmos DB. You, too, can take advantage of its integrated vector database, as well as its single-digit millisecond response times, automatic and instant scalability, and guaranteed speed at any scale by referring to the [How to implement integrated vector database functionalities](#how-to-implement-integrated-vector-database-functionalities) section of this article.
2323
2424
## What is a vector database?
@@ -27,7 +27,7 @@ A vector database is a database designed to store and manage [vector embeddings]
2727

2828
In a vector database, embeddings are indexed and queried through [vector search](#vector-search) algorithms based on their vector distance or similarity. A robust mechanism is necessary to identify the most relevant data. Some well-known vector search algorithms include Hierarchical Navigable Small World (HNSW), Inverted File (IVF), DiskANN, etc.
2929

30-
Besides the above functionalities of a vector database that can only embeddings, an integrated vector database in a NoSQL or relational database converts the existing raw data in your account into embeddings and stores them alongside your original data. This way, you can better achieve data consistency, scale, and performance while avoiding the extra cost of replicating your data in a separate vector database.
30+
Besides the typical vector database functionalities above, an integrated vector database in a NoSQL or relational database converts the existing raw data in your account into embeddings and stores them alongside your original data. This way, you can avoid the extra cost of replicating your data in a separate vector database. Moreover, this architecture keeps your vector embeddings and original data together when you need to work with multi-modal data, and you can better achieve data consistency, scale, and performance.
3131

3232
## What are some vector database use cases?
3333

@@ -41,8 +41,8 @@ Vector databases are used in numerous domains and situations across analytical a
4141
- identify data anomalies or fraudulent activities that are dissimilar from predominant or normal patterns
4242
- implement persistent memory for AI agents
4343

44-
> [!NOTE]
45-
> Besides these typical use cases for vector database, our integrated vector database is also an ideal solution for production-level LLM caching thanks to its low latency, high scalability, and high availability.
44+
> [!TIP]
45+
> Besides these typical use cases for vector databases, our integrated vector database is also an ideal solution for production-level LLM caching thanks to its low latency, high scalability, and high availability.
4646
4747
It's especially popular to use vector databases to enable [retrieval-augmented generation (RAG)](#retrieval-augmented-generation) that harnesses LLMs and custom data or domain-specific information. This approach allows you to:
4848

@@ -97,11 +97,13 @@ Here are multiple ways to implement RAG on your data by using our integrated vec
9797

9898
## How to implement integrated vector database functionalities
9999

100+
You can implement integrated vector database functionalities for the following [Azure Cosmos DB APIs](choose-api.md):
101+
100102
### API for MongoDB
101103

102104
Use the natively [integrated vector database in Azure Cosmos DB for MongoDB vCore](mongodb/vcore/vector-search.md), which offers an efficient way to store, index, and search high-dimensional vector data directly alongside other application data. This approach removes the necessity of migrating your data to costlier alternative vector databases and provides a seamless integration of your AI-driven applications.
103105

104-
### Code samples
106+
#### Code samples
105107

106108
- [.NET RAG Pattern retail reference solution](https://github.com/Azure/Vector-Search-AI-Assistant-MongoDBvCore)
107109
- [.NET tutorial - recipe chatbot](https://github.com/microsoft/AzureDataRetrievalAugmentedGenerationSamples/tree/main/C%23/CosmosDB-MongoDBvCore)
@@ -115,15 +117,15 @@ Use the natively [integrated vector database in Azure Cosmos DB for MongoDB vCor
115117

116118
Use the natively integrated vector database in [Azure Cosmos DB for PostgreSQL](postgresql/howto-use-pgvector.md), which offers an efficient way to store, index, and search high-dimensional vector data directly alongside other application data. This approach removes the necessity of migrating your data to costlier alternative vector databases and provides a seamless integration of your AI-driven applications.
117119

118-
### Code samples
120+
#### Code samples
119121

120122
- Python: [Python notebook tutorial - food review chatbot](https://github.com/microsoft/AzureDataRetrievalAugmentedGenerationSamples/tree/main/Python/CosmosDB-PostgreSQL_CognitiveSearch)
121123

122124
### NoSQL API
123125

124126
The natively integrated vector database in our NoSQL API will become available in mid-2024. In the meantime, you may implement RAG patterns with Azure Cosmos DB for NoSQL and [Azure AI Search](../search/vector-search-overview.md). This approach enables powerful integration of your data residing in the NoSQL API into your AI-oriented applications.
125127

126-
### Code samples
128+
#### Code samples
127129

128130
- [.NET tutorial - Build and Modernize AI Applications](https://github.com/Azure/Build-Modern-AI-Apps-Hackathon)
129131
- [.NET tutorial - Bring Your Data to ChatGPT](https://github.com/Azure/Vector-Search-AI-Assistant/tree/cognitive-search-vector)
@@ -140,9 +142,8 @@ The natively integrated vector database in our NoSQL API will become available i
140142
> [!div class="nextstepaction"]
141143
> [Use the Azure Cosmos DB lifetime free tier](free-tier.md)
142144
143-
## Related content
145+
## Other Vector Databases
144146

145-
- [Azure Cosmos DB for MongoDB vCore Integrated Vector Database](mongodb/vcore/vector-search.md)
146147
- [Azure PostgreSQL Server pgvector Extension](../postgresql/flexible-server/how-to-use-pgvector.md)
147148
- [Azure AI Search](../search/search-what-is-azure-search.md)
148149
- [Open Source Vector Database List](/semantic-kernel/memories/vector-db#available-connectors-to-vector-databases)

0 commit comments

Comments
 (0)