You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cosmos-db/vector-database.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ ms.author: wangwilliam
7
7
ms.reviewer: sidandrews
8
8
ms.service: cosmos-db
9
9
ms.topic: conceptual
10
-
ms.date: 03/29/2024
10
+
ms.date: 03/30/2024
11
11
---
12
12
13
13
# Vector database
@@ -16,18 +16,18 @@ ms.date: 03/29/2024
16
16
17
17
Vector databases are used in numerous domains and situations across analytical and generative AI, including natural language processing, video and image recognition, recommendation system, search, etc.
18
18
19
-
Many AI-enhanced systems that emerged in 2023 use standalone vector databases that are distinct from NoSQL or relational databases in their tech stacks. However, besides adding a separate vector database, you might want to consider using an integrated vector database in the NoSQL or relational database you already use, especially when working with multi-modal data. By doing so, you can reduce cost while achieving greater data consistency, scale, and performance. Read on to find out why and how.
19
+
In 2023, a notable trend in software was the integration of AI enhancements, often achieved by incorporating specialized standalone vector databases into existing tech stacks. This article explains what vector databases are, as well as presents an alternative architecture that you might want to consider: using an integrated vector database in the NoSQL or relational database you already use, especially when working with multi-modal data. This approach not only allows you to reduce cost but also achieve greater data consistency, scale, and performance.
20
20
21
21
> [!TIP]
22
-
> Data consistency, scale, and performance guarantees are why OpenAI built its ChatGPT service on top of Azure Cosmos DB. You, too, can take advantage of its integrated vector database, as well as its single-digit millisecond response times, automatic and instant scalability, and guaranteed speed at any scale by referring to the [How to implement integrated vector database functionalities](#how-to-implement-integrated-vector-database-functionalities) section of this article.
22
+
> Data consistency, scale, and performance guarantees are why OpenAI built its ChatGPT service on top of Azure Cosmos DB. You, too, can take advantage of its integrated vector database, as well as its single-digit millisecond response times, automatic and instant scalability, and guaranteed speed at any scale. Please consult the [implementation samples](#how-to-implement-integrated-vector-database-functionalities) section of this article and [try](#next-step) the lifetime free tier or one of the free trial options.
23
23
24
24
## What is a vector database?
25
25
26
26
A vector database is a database designed to store and manage [vector embeddings](#embeddings), which are mathematical representations of data in a high-dimensional space. In this space, each dimension corresponds to a feature of the data, and tens of thousands of dimensions might be used to represent sophisticated data. A vector's position in this space represents its characteristics. Words, phrases, or entire documents, and images, audio, and other types of data can all be vectorized. These vector embeddings are used in similarity search, multi-modal search, recommendations engines, large languages models (LLMs), etc.
27
27
28
28
In a vector database, embeddings are indexed and queried through [vector search](#vector-search) algorithms based on their vector distance or similarity. A robust mechanism is necessary to identify the most relevant data. Some well-known vector search algorithms include Hierarchical Navigable Small World (HNSW), Inverted File (IVF), DiskANN, etc.
29
29
30
-
Besides the typical vector database functionalities above, an integrated vector database in a NoSQL or relational database converts the existing raw data in your account into embeddings and stores them alongside your original data. This way, you can avoid the extra cost of replicating your data in a separate vector database. Moreover, this architecture keeps your vector embeddings and original data together when you need to work with multi-modal data, and you can better achieve data consistency, scale, and performance.
30
+
Besides the typical vector database functionalities above, an integrated vector database in a highly performant NoSQL or relational database converts the existing raw data in your account into embeddings and stores them alongside your original data. This way, you can avoid the extra cost of replicating your data in a separate vector database. Moreover, this architecture keeps your vector embeddings and original data together, which better facilitates multi-modal data operations, and you can achieve greater data consistency, scale, and performance.
31
31
32
32
## What are some vector database use cases?
33
33
@@ -142,7 +142,7 @@ The natively integrated vector database in our NoSQL API will become available i
142
142
> [!div class="nextstepaction"]
143
143
> [Use the Azure Cosmos DB lifetime free tier](free-tier.md)
144
144
145
-
## Other Vector Databases
145
+
## More Vector Databases
146
146
147
147
-[Azure PostgreSQL Server pgvector Extension](../postgresql/flexible-server/how-to-use-pgvector.md)
148
148
-[Azure AI Search](../search/search-what-is-azure-search.md)
0 commit comments