Skip to content

Commit 5d57505

Browse files
Merge pull request #270721 from wmwxwa/patch-8
Update vector-database.md
2 parents 0506824 + 78999d6 commit 5d57505

File tree

1 file changed

+5
-5
lines changed

1 file changed

+5
-5
lines changed

articles/cosmos-db/vector-database.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.author: wangwilliam
77
ms.reviewer: sidandrews
88
ms.service: cosmos-db
99
ms.topic: conceptual
10-
ms.date: 03/29/2024
10+
ms.date: 03/30/2024
1111
---
1212

1313
# Vector database
@@ -16,18 +16,18 @@ ms.date: 03/29/2024
1616

1717
Vector databases are used in numerous domains and situations across analytical and generative AI, including natural language processing, video and image recognition, recommendation system, search, etc.
1818

19-
Many AI-enhanced systems that emerged in 2023 use standalone vector databases that are distinct from NoSQL or relational databases in their tech stacks. However, besides adding a separate vector database, you might want to consider using an integrated vector database in the NoSQL or relational database you already use, especially when working with multi-modal data. By doing so, you can reduce cost while achieving greater data consistency, scale, and performance. Read on to find out why and how.
19+
In 2023, a notable trend in software was the integration of AI enhancements, often achieved by incorporating specialized standalone vector databases into existing tech stacks. This article explains what vector databases are, as well as presents an alternative architecture that you might want to consider: using an integrated vector database in the NoSQL or relational database you already use, especially when working with multi-modal data. This approach not only allows you to reduce cost but also achieve greater data consistency, scale, and performance.
2020

2121
> [!TIP]
22-
> Data consistency, scale, and performance guarantees are why OpenAI built its ChatGPT service on top of Azure Cosmos DB. You, too, can take advantage of its integrated vector database, as well as its single-digit millisecond response times, automatic and instant scalability, and guaranteed speed at any scale by referring to the [How to implement integrated vector database functionalities](#how-to-implement-integrated-vector-database-functionalities) section of this article.
22+
> Data consistency, scale, and performance guarantees are why OpenAI built its ChatGPT service on top of Azure Cosmos DB. You, too, can take advantage of its integrated vector database, as well as its single-digit millisecond response times, automatic and instant scalability, and guaranteed speed at any scale. Please consult the [implementation samples](#how-to-implement-integrated-vector-database-functionalities) section of this article and [try](#next-step) the lifetime free tier or one of the free trial options.
2323
2424
## What is a vector database?
2525

2626
A vector database is a database designed to store and manage [vector embeddings](#embeddings), which are mathematical representations of data in a high-dimensional space. In this space, each dimension corresponds to a feature of the data, and tens of thousands of dimensions might be used to represent sophisticated data. A vector's position in this space represents its characteristics. Words, phrases, or entire documents, and images, audio, and other types of data can all be vectorized. These vector embeddings are used in similarity search, multi-modal search, recommendations engines, large languages models (LLMs), etc.
2727

2828
In a vector database, embeddings are indexed and queried through [vector search](#vector-search) algorithms based on their vector distance or similarity. A robust mechanism is necessary to identify the most relevant data. Some well-known vector search algorithms include Hierarchical Navigable Small World (HNSW), Inverted File (IVF), DiskANN, etc.
2929

30-
Besides the typical vector database functionalities above, an integrated vector database in a NoSQL or relational database converts the existing raw data in your account into embeddings and stores them alongside your original data. This way, you can avoid the extra cost of replicating your data in a separate vector database. Moreover, this architecture keeps your vector embeddings and original data together when you need to work with multi-modal data, and you can better achieve data consistency, scale, and performance.
30+
Besides the typical vector database functionalities above, an integrated vector database in a highly performant NoSQL or relational database converts the existing raw data in your account into embeddings and stores them alongside your original data. This way, you can avoid the extra cost of replicating your data in a separate vector database. Moreover, this architecture keeps your vector embeddings and original data together, which better facilitates multi-modal data operations, and you can achieve greater data consistency, scale, and performance.
3131

3232
## What are some vector database use cases?
3333

@@ -142,7 +142,7 @@ The natively integrated vector database in our NoSQL API will become available i
142142
> [!div class="nextstepaction"]
143143
> [Use the Azure Cosmos DB lifetime free tier](free-tier.md)
144144
145-
## Other Vector Databases
145+
## More Vector Databases
146146

147147
- [Azure PostgreSQL Server pgvector Extension](../postgresql/flexible-server/how-to-use-pgvector.md)
148148
- [Azure AI Search](../search/search-what-is-azure-search.md)

0 commit comments

Comments
 (0)