Skip to content

Commit 27f6d03

Browse files
authored
Update vector-database.md
1 parent 7c303b4 commit 27f6d03

File tree

1 file changed

+6
-4
lines changed

1 file changed

+6
-4
lines changed

articles/cosmos-db/vector-database.md

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -23,15 +23,17 @@ In 2023, a notable trend in software was the integration of AI enhancements, oft
2323
2424
## What is a vector database?
2525

26-
### Pure vector database basic functionalities
27-
2826
A vector database is a database designed to store and manage [vector embeddings](#embeddings), which are mathematical representations of data in a high-dimensional space. In this space, each dimension corresponds to a feature of the data, and tens of thousands of dimensions might be used to represent sophisticated data. A vector's position in this space represents its characteristics. Words, phrases, or entire documents, and images, audio, and other types of data can all be vectorized. These vector embeddings are used in similarity search, multi-modal search, recommendations engines, large languages models (LLMs), etc.
2927

3028
In a vector database, embeddings are indexed and queried through [vector search](#vector-search) algorithms based on their vector distance or similarity. A robust mechanism is necessary to identify the most relevant data. Some well-known vector search algorithms include Hierarchical Navigable Small World (HNSW), Inverted File (IVF), DiskANN, etc.
3129

32-
### Integrated vector database
30+
### Integrated vector database vs pure vector database
31+
32+
There are two common types of vector database implementations - pure vector database and integrated vector database in a NoSQL or relational database.
33+
34+
A pure vector database is designed to efficiently store and manage vector embeddings, along with a small amount of metadata; it is separate from the data source from which the embeddings are derived.
3335

34-
An integrated vector database in a highly performant NoSQL or relational database provides additional capabilities beyond the basic functionalies of a vector database. A pure vector database is designed to effeciently store and manage vector embeddings, along with a small amoung of metadata; it is separate from the data source from which the embeddings are derived. Whereas, an integrated vector database converts the existing data in a nonrelational or relational database into embeddings and stores them alongside the original data. This approach eliminates the extra cost of replicating data in a separate pure vector database. Moreover, this architecture keeps the vector embeddings and original data together, which better facilitates multi-modal data operations, and enable greater data consistency, scale, and performance.
36+
A vector database that is integrated in a highly performant NoSQL or relational database provides additional capabilities. The integrated vector database converts the existing data in a NoSQL or relational database into embeddings and stores them alongside the original data. This approach eliminates the extra cost of replicating data in a separate pure vector database. Moreover, this architecture keeps the vector embeddings and original data together, which better facilitates multi-modal data operations, and enables greater data consistency, scale, and performance.
3537

3638
## What are some vector database use cases?
3739

0 commit comments

Comments
 (0)