You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/cosmos-db/mongodb/vcore/vector-search-ai.md
+11-12Lines changed: 11 additions & 12 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -19,9 +19,9 @@ When developers select vector databases, the open-source options provide numerou
19
19
20
20
Another advantage of open-source vector databases is the strong community support they enjoy. Active user communities often contribute to the development of these databases, provide support, and share best practices, promoting innovation.
21
21
22
-
Some individuals also opt for open-source vector databases because they are "free," meaning there is no cost to acquire or use the software. However, for this purpose users can also use the free benefits through the lifetime free tiers offered by managed vector database services. These managed services provide not only cost-free access up to a certain usage limit but also simplify the operational burden by handling maintenance, updates, and scalability. Therefore, users who seek the economic advantages of open-source solutions can achieve similar cost savings using the free tier of managed vector database services, which provide the added benefit of reduced management overhead. This approach offers a balance between cost efficiency and operational convenience, allowing users to focus more on their core activities rather than on database administration.
22
+
Some individuals opt for open-source vector databases because they are "free," meaning there's no cost to acquire or use the software. An alternative is using the free tiers offered by managed vector database services. These managed services provide not only cost-free access up to a certain usage limit but also simplify the operational burden by handling maintenance, updates, and scalability. Therefore, by using the free tier of managed vector database services, users can achieve cost savings while reducing management overhead. This approach allows users to focus more on their core activities rather than on database administration.
23
23
24
-
## Working Mechanism of Open-source Vector Databases
24
+
## Working mechanism of open-source vector databases
25
25
26
26
Open-source vector databases are designed to store and manage vector embeddings, which are mathematical representations of data in a high-dimensional space. In this space, each dimension corresponds to a feature of the data, and tens of thousands of dimensions might be used to represent sophisticated data. A vector's position in this space represents its characteristics. Words, phrases, or entire documents, and images, audio, and other types of data can all be vectorized. These vector embeddings are used in similarity search, multi-modal search, recommendations engines, large languages models (LLMs), etc.
27
27
@@ -40,33 +40,32 @@ Vector databases are used in numerous domains and situations across analytical a
40
40
- implement persistent memory for AI agents
41
41
- enable retrieval-augmented generation (RAG)
42
42
43
-
## Selecting the Appropriate Open-source Vector Database
43
+
## Selecting the best open-source vector database
44
44
45
-
Choosing the best open-source vector database requires considering several factors. Performance and scalability of the database are crucial, as they impact whether the database can handle your specific workload requirements. Databases with efficient indexing and querying capabilities usually offer optimal performance.
46
-
47
-
Another factor is the community support and documentation available for the database. A robust community and ample documentation can provide valuable assistance. Comparing different open-source vector database options based on features, supported data types, and compatibility with existing tools and frameworks is critical to finding the best fit for your needs. Ease of installation, configuration, and maintenance should also be considered to ensure smooth integration into your workflow.
48
-
49
-
Here are some popular open-source vector databases:
45
+
Choosing the best open-source vector database requires considering several factors. Performance and scalability of the database are crucial, as they impact whether the database can handle your specific workload requirements. Databases with efficient indexing and querying capabilities usually offer optimal performance. Another factor is the community support and documentation available for the database. A robust community and ample documentation can provide valuable assistance. Here are some popular open-source vector databases:
50
46
51
47
- Chroma
52
48
- Milvus
53
49
- Qdrant
54
50
- Weaviate
55
51
56
-
## Challenges with Open-source Vector Databases
52
+
>[!NOTE]
53
+
>The most popular option may not be the best option for you. To find the best fit for your needs, you should compare different options based on features, supported data types, compatibility with existing tools and frameworks you use. Ease of installation, configuration, and maintenance should also be considered to ensure smooth integration into your workflow.
54
+
55
+
## Challenges with open-source vector databases
57
56
58
57
Open-source vector databases pose challenges that are typical of open-source software:
59
58
60
59
- Setup: Users need in-depth knowledge to install, configure, and operate, especially for complex deployments. Optimizing resources and configuration while scaling up operation requires close monitoring and adjustments.
61
-
- Maintenance: Users must manage their own updates, patches, and maintenance. Thus, ML expertise would not suffice; users must also have extensive experience in database administration.
60
+
- Maintenance: Users must manage their own updates, patches, and maintenance. Thus, ML expertise wouldn't suffice; users must also have extensive experience in database administration.
62
61
- Support: Official support can be limited compared to managed services, relying more on community assistance.
63
62
64
63
Therefore, while free initially, open-source vector databases incur significant costs when scaling up. Expanding operations necessitates more hardware, skilled IT staff, and advanced infrastructure management, leading to higher expenses in hardware, personnel, and operational costs. Scaling open-source vector databases can be financially demanding despite the lack of licensing fees.
65
64
66
-
## Addressing the Challenges with Open-Source Vector Databases
65
+
## Addressing the challenges
67
66
68
67
A fully managed database service helps developers avoid the hassles from setting up, maintaining, and relying on community assistance for an open-source vector database. The Integrated Vector Database in Azure Cosmos DB for MongoDB vCore offers a life-time free tier. It allows developers to enjoy the same financial benefit associated with open-source vector databases, while the service provider handles maintenance, updates, and scalability. When it’s time to scale up operations, upgrading is quick and easy while keeping a low [total cost of ownership (TCO)](introduction.md#low-total-cost-of-ownership-tco).
69
68
70
69
## Next steps
71
70
> [!div class="nextstepaction"]
72
-
> [Create a lifetime free-tier vCore cluster for Azure Cosmos DB for MongoDB]
71
+
> [Create a lifetime free-tier vCore cluster for Azure Cosmos DB for MongoDB](free-tier.md)
0 commit comments