Skip to content

Commit e9007c4

Browse files
Merge pull request #272551 from wmwxwa/patch-20
Update vector-database.md
2 parents 2ba8048 + bd10c90 commit e9007c4

File tree

1 file changed

+12
-9
lines changed

1 file changed

+12
-9
lines changed

articles/cosmos-db/vector-database.md

Lines changed: 12 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -35,7 +35,7 @@ A pure vector database is designed to efficiently store and manage vector embedd
3535

3636
A vector database that is integrated in a highly performant NoSQL or relational database provides additional capabilities. The integrated vector database in a NoSQL or relational database can store, index, and query embeddings alongside the corresponding original data. This approach eliminates the extra cost of replicating data in a separate pure vector database. Moreover, keeping the vector embeddings and original data together better facilitates multi-modal data operations, and enables greater data consistency, scale, and performance.
3737

38-
## What are some vector database use cases?
38+
### Vector database use cases
3939

4040
Vector databases are used in numerous domains and situations across analytical and generative AI, including natural language processing, video and image recognition, recommendation system, search, etc. For example, you can use a vector database to:
4141

@@ -78,11 +78,11 @@ A prompt refers to a specific text or information that can serve as an instructi
7878
- Cues: direct the LLM's output in the right direction
7979
- Supporting content: represents supplemental information the LLM can use to generate output
8080

81-
The process of creating good prompts for a scenario is called prompt engineering. For more information about prompts and best practices for prompt engineering, see Azure OpenAI Service [prompt engineering techniques](../ai-services/openai/concepts/advanced-prompt-engineering.md). [[Go back](#what-are-some-vector-database-use-cases)]
81+
The process of creating good prompts for a scenario is called prompt engineering. For more information about prompts and best practices for prompt engineering, see Azure OpenAI Service [prompt engineering techniques](../ai-services/openai/concepts/advanced-prompt-engineering.md). [[Go back](#vector-database-use-cases)]
8282

8383
### Tokens
8484

85-
Tokens are small chunks of text generated by splitting the input text into smaller segments. These segments can either be words or groups of characters, varying in length from a single character to an entire word. For instance, the word hamburger would be divided into tokens such as ham, bur, and ger while a short and common word like pear would be considered a single token. LLMs like ChatGPT, GPT-3.5, or GPT-4 break words into tokens for processing. [[Go back](#what-are-some-vector-database-use-cases)]
85+
Tokens are small chunks of text generated by splitting the input text into smaller segments. These segments can either be words or groups of characters, varying in length from a single character to an entire word. For instance, the word hamburger would be divided into tokens such as ham, bur, and ger while a short and common word like pear would be considered a single token. LLMs like ChatGPT, GPT-3.5, or GPT-4 break words into tokens for processing. [[Go back](#vector-database-use-cases)]
8686

8787
### Retrieval-augmented generation
8888

@@ -97,20 +97,17 @@ A simple RAG pattern using Azure Cosmos DB for NoSQL could be:
9797
5. Create a function to perform vector similarity search based on a user prompt
9898
6. Perform question answering over the data using an Azure OpenAI Completions model
9999

100-
The RAG pattern, with prompt engineering, serves the purpose of enhancing response quality by offering more contextual information to the model. RAG enables the model to apply a broader knowledge base by incorporating relevant external sources into the generation process, resulting in more comprehensive and informed responses. For more information on "grounding" LLMs, see [grounding LLMs](https://techcommunity.microsoft.com/t5/fasttrack-for-azure/grounding-llms/ba-p/3843857). [[Go back](#what-are-some-vector-database-use-cases)]
100+
The RAG pattern, with prompt engineering, serves the purpose of enhancing response quality by offering more contextual information to the model. RAG enables the model to apply a broader knowledge base by incorporating relevant external sources into the generation process, resulting in more comprehensive and informed responses. For more information on "grounding" LLMs, see [grounding LLMs](https://techcommunity.microsoft.com/t5/fasttrack-for-azure/grounding-llms/ba-p/3843857). [[Go back](#vector-database-use-cases)]
101101

102102
Here are multiple ways to implement RAG on your data by using our integrated vector database functionalities:
103103

104104
## How to implement integrated vector database functionalities
105105

106106
You can implement integrated vector database functionalities for the following [Azure Cosmos DB APIs](choose-api.md):
107107

108-
> [!NOTE]
109-
> For our NoSQL API, the native integration of a state-of-the-art vector indexing algorithm will be announced during Build in May 2024. Please stay tuned.
110-
111108
### API for MongoDB
112109

113-
Use the natively [integrated vector database in Azure Cosmos DB for MongoDB](mongodb/vcore/vector-search.md), which offers an efficient way to store, index, and search high-dimensional vector data directly alongside other application data. This approach removes the necessity of migrating your data to costlier alternative vector databases and provides a seamless integration of your AI-driven applications.
110+
Use the natively [integrated vector database in Azure Cosmos DB for MongoDB](mongodb/vcore/vector-search.md) (vCore architecture), which offers an efficient way to store, index, and search high-dimensional vector data directly alongside other application data. This approach removes the necessity of migrating your data to costlier alternative vector databases and provides a seamless integration of your AI-driven applications.
114111

115112
#### Code samples
116113

@@ -122,6 +119,9 @@ Use the natively [integrated vector database in Azure Cosmos DB for MongoDB](mon
122119
- [Python notebook tutorial - LLM Caching integration through LangChain](https://python.langchain.com/docs/integrations/llms/llm_caching#azure-cosmos-db-semantic-cache)
123120
- [Python - LlamaIndex integration](https://docs.llamaindex.ai/en/stable/examples/vector_stores/AzureCosmosDBMongoDBvCoreDemo.html)
124121
- [Python - Semantic Kernel memory integration](https://github.com/microsoft/semantic-kernel/tree/main/python/semantic_kernel/connectors/memory/azure_cosmosdb)
122+
123+
> [!div class="nextstepaction"]
124+
> [Use Azure Cosmos DB for MongoDB lifetime free tier](mongodb/vcore/free-tier.md)
125125
126126
### API for PostgreSQL
127127

@@ -133,6 +133,9 @@ Use the natively integrated vector database in [Azure Cosmos DB for PostgreSQL](
133133

134134
### NoSQL API
135135

136+
> [!NOTE]
137+
> For our NoSQL API, the native integration of a state-of-the-art vector indexing algorithm will be announced during Build in May 2024. Please stay tuned.
138+
136139
The natively integrated vector databaseg in the NoSQL API is under development. In the meantime, you may implement RAG patterns with Azure Cosmos DB for NoSQL and [Azure AI Search](../search/vector-search-overview.md). This approach enables powerful integration of your data residing in the NoSQL API into your AI-oriented applications.
137140

138141
#### Code samples
@@ -143,7 +146,7 @@ The natively integrated vector databaseg in the NoSQL API is under development.
143146
- [.NET tutorial - recipe chatbot w/ Semantic Kernel](https://github.com/microsoft/AzureDataRetrievalAugmentedGenerationSamples/tree/main/C%23/CosmosDB-NoSQL_CognitiveSearch_SemanticKernel)
144147
- [Python notebook tutorial - Azure product chatbot](https://github.com/microsoft/AzureDataRetrievalAugmentedGenerationSamples/tree/main/Python/CosmosDB-NoSQL_CognitiveSearch)
145148

146-
## Next step
149+
### Next step
147150

148151
[30-day Free Trial without Azure subscription](https://azure.microsoft.com/try/cosmosdb/)
149152

0 commit comments

Comments
 (0)