Merge pull request #295553 from flang-msft/fxl---freshness-for-AI-articles

prmerger-automator[bot] · web-flow · commit caabd60036e2 · 2025-03-21T04:22:43.000Z
Fxl---freshness for ai articles
diff --git a/articles/azure-cache-for-redis/cache-overview-vector-similarity.md b/articles/azure-cache-for-redis/cache-overview-vector-similarity.md
@@ -7,36 +7,31 @@ ms.collection: ce-skilling-ai-copilot
 ms.topic: overview
 ms.custom:
   - ignite-2024
-ms.date: 04/24/2024
+ms.date: 02/27/2025
 ---
 
 # What are Vector Embeddings and Vector Search in Azure Cache for Redis?
 
 Vector similarity search (VSS) has become a popular technology for AI-powered intelligent applications. Azure Cache for Redis can be used as a vector database when combined with models like [Azure OpenAI](/azure/ai-services/openai/overview) for Retrieval-Augmented Generative AI and other analysis scenarios. This article is a high-level introduction to the concept of vector embeddings, vector similarity search, and how Redis can be used as a vector database powering intelligent applications.
 
-For tutorials and sample applications on how to use Azure Cache for Redis and Azure OpenAI to perform vector similarity search, see the following:
+For tutorials and sample applications on how to use Enterprise tier or Azure Managed Redis with Azure OpenAI, see the following:
 
-- [Tutorial: Conduct vector similarity search on Azure OpenAI embeddings using Azure Cache for Redis with LangChain](./cache-tutorial-vector-similarity.md)
-- [Sample: Using Redis as vector database in a Chatbot application with .NET Semantic Kernel](https://github.com/CawaMS/chatappredis)
-- [Sample: Using Redis as semantic cache in a Dall-E powered image gallery with Redis OM for .NET](https://github.com/CawaMS/OutputCacheOpenAI)
+- [Tutorial: Conduct vector similarity Enterprise tier or Azure Managed Redis with ddings using LangChain](./cache-tutorial-vector-similarity.md)
+- [Sample: Using Redis as semantic cache in a Dall-E powered image gallery with Redis OM for .NET](https://github.com/Azure-Samples/azure-redis-dalle-semantic-caching)
 
 ## Scope of Availability
 
-Vector search capabilities in Redis require [Redis Stack](https://redis.io/docs/latest/operate/oss_and_stack/stack-with-enterprise/), specifically the [RediSearch](https://redis.io/docs/interact/search-and-query/) module. This capability is only available in the [Enterprise tiers of Azure Cache for Redis](./cache-redis-modules.md).
+Vector search capabilities in Redis require [Redis Stack](https://redis.io/docs/latest/operate/oss_and_stack/stack-with-enterprise/), specifically the [RediSearch](https://redis.io/docs/interact/search-and-query/) module. This capability is only available in the [Enterprise tiers of Azure Cache for Redis](./cache-redis-modules.md) and Azure Managed Redis.
 
 This table contains the information for vector search availability in different tiers.
 
-|Tier      | Basic / Standard  | Premium  |Enterprise | Enterprise Flash  | Azure Managed Redis (preview)
-|--------- |:------------------:|:----------:|:---------:|:---------:|:---------:|
-|Available | No          | No       |  Yes  | Yes (preview) |Yes
+| Tier      | Basic / Standard | Premium | Enterprise | Enterprise Flash | Azure Managed Redis (preview) |
+|-----------|:----------------:|:-------:|:----------:|:----------------:|:-----------------------------:|
+| Available | No               | No      | Yes        | Yes (preview)    | Yes                           |
 
 ## What are vector embeddings?
 
-### Concept
-
-Vector embeddings are a fundamental concept in machine learning and natural language processing that enable the representation of data, such as words, documents, or images as numerical vectors in a high-dimension vector space. The primary idea behind vector embeddings is to capture the underlying relationships and semantics of the data by mapping them to points in this vector space. That means converting your text or images into a sequence of numbers that represents the data, and then comparing the different number sequences. This allows complex data to be manipulated and analyzed mathematically, making it easier to perform tasks like similarity comparison, recommendation, and classification.
-
-<!-- TODO - Add image example -->
+Vector embeddings are a fundamental concept in machine learning and natural language processing that enable the representation of data, such as words, documents, or images, as numerical vectors in a high-dimension vector space. The primary idea behind vector embeddings is to capture the underlying relationships and semantics of the data by mapping them to points in this vector space. That means converting your text or images into a sequence of numbers that represents the data, and then comparing the different number sequences. This allows complex data to be manipulated and analyzed mathematically, making it easier to perform tasks like similarity comparison, recommendation, and classification.
 
 Each machine learning model classifies data and produces the vector in a different manner. Furthermore, it's typically not possible to determine exactly what semantic meaning each vector dimension represents. But because the model is consistent between each block of input data, similar words, documents, or images have vectors that are also similar. For example, the words `basketball` and `baseball` have embeddings vectors much closer to each other than a word like `rainforest`.
 
@@ -81,9 +76,9 @@ Vector similarity search can be used in multiple applications. Some common use-c
 - **Semantic Caching**. Reduce the cost and latency of LLMs by caching LLM completions. LLM queries are compared using vector similarity. If a new query is similar enough to a previously cached query, the cached query is returned. [Semantic Caching example using LangChain](https://python.langchain.com/docs/integrations/llm_caching/#redis-cache)
 - **LLM Conversation Memory**. Persist conversation history with an LLM as embeddings in a vector database. Your application can use vector search to pull relevant history or "memories" into the response from the LLM. [LLM Conversation Memory example](https://github.com/continuum-llms/chatgpt-memory)
 
-## Why choose Azure Cache for Redis for storing and searching vectors?
+## Why choose Azure Redis for storing and searching vectors?
 
-Azure Cache for Redis can be used effectively as a vector database to store embeddings vectors and to perform vector similarity searches. Support for vector storage and search has been available in many key machine learning frameworks like:
+Azure Redis caches can be used effectively as a vector database to store embeddings vectors and to perform vector similarity searches. Support for vector storage and search has been available in many key machine learning frameworks like:
 
 - [Semantic Kernel](https://github.com/microsoft/semantic-kernel)
 - [LangChain](https://python.langchain.com/docs/integrations/vectorstores/redis)
@@ -117,7 +112,4 @@ There are multiple other solutions on Azure for vector storage and search. Other
 
 ## Related content
 
-The best way to get started with embeddings and vector search is to try it yourself!
-
-> [!div class="nextstepaction"]
-> [Tutorial: Conduct vector similarity search on Azure OpenAI embeddings using Azure Cache for Redis](./cache-tutorial-vector-similarity.md)
+- [Tutorial: Conduct vector similarity search on Azure OpenAI embeddings using Azure Cache for Redis](./cache-tutorial-vector-similarity.md)
diff --git a/articles/azure-cache-for-redis/cache-tutorial-vector-similarity.md b/articles/azure-cache-for-redis/cache-tutorial-vector-similarity.md
@@ -7,57 +7,59 @@ ms.collection: ce-skilling-ai-copilot
 ms.topic: tutorial
 ms.custom:
   - ignite-2024
-ms.date: 09/15/2023
+ms.date: 02/27/2025
 
 #CustomerIntent: As a developer, I want to develop some code using a sample so that I see an example of a vector similarity with an AI-based large language model.
 ---
 
-# Tutorial: Conduct vector similarity search on Azure OpenAI embeddings using Azure Cache for Redis
+# Tutorial: Conduct vector similarity search on Azure OpenAI embeddings using Azure Redis
 
 <!-- cawa - need to mention AMR in this tutorial -->
-In this tutorial, you'll walk through a basic vector similarity search use-case. You'll use embeddings generated by Azure OpenAI Service and the built-in vector search capabilities of the Enterprise tier of Azure Cache for Redis to query a dataset of movies to find the most relevant match.
+In this tutorial, you walk through a basic vector similarity search use-case. You use embeddings generated by Azure OpenAI Service and the built-in vector search capabilities of the Enterprise tier of Azure Cache for Redis to query a dataset of movies to find the most relevant match.
 
-The tutorial uses the [Wikipedia Movie Plots dataset](https://www.kaggle.com/datasets/jrobischon/wikipedia-movie-plots) that features plot descriptions of over 35,000 movies from Wikipedia covering the years 1901 to 2017.
-The dataset includes a plot summary for each movie, plus metadata such as the year the film was released, the director(s), main cast, and genre. You'll follow the steps of the tutorial to generate embeddings based on the plot summary and use the other metadata to run hybrid queries.
+The tutorial uses the [Wikipedia Movie Plots dataset](https://www.kaggle.com/datasets/jrobischon/wikipedia-movie-plots) that features plot descriptions of over 35,000 movies from Wikipedia covering the years 1901 to 2017. The dataset includes a plot summary for each movie, plus metadata such as the year the film was released, the director(s), main cast, and genre. You follow the steps of the tutorial to generate embeddings based on the plot summary and use the other metadata to run hybrid queries.
 
 In this tutorial, you learn how to:
 
 > [!div class="checklist"]
-> * Create an Azure Cache for Redis instance configured for vector search
-> * Install Azure OpenAI and other required Python libraries.
-> * Download the movie dataset and prepare it for analysis.
-> * Use the **text-embedding-ada-002 (Version 2)** model to generate embeddings.
-> * Create a vector index in Azure Cache for Redis
-> * Use cosine similarity to rank search results.
-> * Use hybrid query functionality through [RediSearch](https://redis.io/docs/interact/search-and-query/) to prefilter the data and make the vector search even more powerful.
+> - Create an Azure Cache for Redis instance configured for vector search
+> - Install Azure OpenAI and other required Python libraries.
+> - Download the movie dataset and prepare it for analysis.
+> - Use the **text-embedding-ada-002 (Version 2)** model to generate embeddings.
+> - Create a vector index in Azure Cache for Redis
+> - Use cosine similarity to rank search results.
+> - Use hybrid query functionality through [RediSearch](https://redis.io/docs/interact/search-and-query/) to prefilter the data and make the vector search even more powerful.
 
 >[!IMPORTANT]
->This tutorial will walk you through building a Jupyter Notebook. You can follow this tutorial with a Python code file (.py) and get *similar* results, but you will need to add all of the code blocks in this tutorial into the `.py` file and execute once to see results. In other words, Jupyter Notebooks provides intermediate results as you execute cells, but this is not behavior you should expect when working in a Python code file.
+>This tutorial walks you through building a Jupyter Notebook. You can follow this tutorial with a Python code file (.py) and get _similar_ results, but you need to add all of the code blocks in this tutorial into the `.py` file and execute once to see results. In other words, Jupyter Notebooks provides intermediate results as you execute cells, but this is not behavior you should expect when working in a Python code file.
 
 >[!IMPORTANT]
->If you would like to follow along in a completed Jupyter notebook instead, [download the Jupyter notebook file named *tutorial.ipynb*](https://github.com/Azure-Samples/azure-cache-redis-samples/tree/main/tutorial/vector-similarity-search-open-ai) and save it into the new *redis-vector* folder.
+>If you would like to follow along in a completed Jupyter notebook instead, [download the Jupyter notebook file named _tutorial.ipynb_](https://github.com/Azure-Samples/azure-cache-redis-samples/tree/main/tutorial/vector-similarity-search-open-ai) and save it into the new _redis-vector_ folder.
 
 ## Prerequisites
+<!-- Continue here. -->
 
-* An Azure subscription - [Create one for free](https://azure.microsoft.com/free/cognitive-services?azure-portal=true)
-* Access granted to Azure OpenAI in the desired Azure subscription. Currently, you must apply for access to Azure OpenAI. You can apply for access to Azure OpenAI by completing the form at <a href="https://aka.ms/oai/access" target="_blank">https://aka.ms/oai/access</a>.
-* <a href="https://www.python.org/" target="_blank">Python 3.8 or later version</a>
-* [Jupyter Notebooks](https://jupyter.org/) (optional)
-* An Azure OpenAI resource with the **text-embedding-ada-002 (Version 2)** model deployed. This model is currently only available in [certain regions](/azure/ai-services/openai/concepts/models#model-summary-table-and-region-availability). See the [resource deployment guide](/azure/ai-services/openai/how-to/create-resource) for instructions on how to deploy the model.
+- An Azure subscription - [Create one for free](https://azure.microsoft.com/free/cognitive-services?azure-portal=true)
+- Access granted to Azure OpenAI in the desired Azure subscription. Currently, you must apply for access to Azure OpenAI. You can apply for access to Azure OpenAI by completing the form at [https://aka.ms/oai/access](https://aka.ms/oai/access). <!-- I don't know if this is still true -->
+- [Python 3.8 or later version](https://www.python.org/)
+- [Jupyter Notebooks](https://jupyter.org/) (optional)
+- An Azure OpenAI resource with the **text-embedding-ada-002 (Version 2)** model deployed. This model is currently only available in [certain regions](/azure/ai-services/openai/concepts/models#model-summary-table-and-region-availability). See the [resource deployment guide](/azure/ai-services/openai/how-to/create-resource) for instructions on how to deploy the model.
 
 ## Create an Azure Cache for Redis Instance
 
-1. Follow the [Quickstart: Create a Redis Enterprise cache](quickstart-create-redis-enterprise.md) guide. On the **Advanced** page, make sure that you've added the **RediSearch** module and have chosen the **Enterprise** Cluster Policy. All other settings can match the default described in the quickstart.
+1. Follow the [Quickstart: Create a Redis Enterprise cache](quickstart-create-redis-enterprise.md) guide, but make sure you add the RedisSearch module at create time.
+
+1. On the **Advanced** page, make sure that you've added the **RediSearch** module and have chosen the **Enterprise** Cluster Policy. All other settings can match the default described in the quickstart.
 
    It takes a few minutes for the cache to create. You can move on to the next step in the meantime.
 
-:::image type="content" source="media/cache-create/enterprise-tier-basics.png" alt-text="Screenshot showing the Enterprise tier Basics tab filled out.":::
+    :::image type="content" source="media/cache-create/enterprise-tier-basics.png" alt-text="Screenshot showing the Enterprise tier Basics tab filled out.":::
 
 ## Set up your development environment
 
-1. Create a folder on your local computer named *redis-vector* in the location where you typically save your projects.
+1. Create a folder on your local computer named _redis-vector_ in the location where you typically save your projects.
 
-1. Create a new python file (*tutorial.py*) or Jupyter notebook (*tutorial.ipynb*) in the folder.
+1. Create a new python file (_tutorial.py_) or Jupyter notebook (_tutorial.ipynb_) in the folder.
 
 1. Install the required Python packages:
 
@@ -71,9 +73,9 @@ In this tutorial, you learn how to:
 
 1. Sign in or register with Kaggle. Registration is required to download the file.
 
-1. Select the **Download** link on Kaggle to download the *archive.zip* file.
+1. Select the **Download** link on Kaggle to download the _archive.zip_ file.
 
-1. Extract the *archive.zip* file and move the *wiki_movie_plots_deduped.csv* into the *redis-vector* folder.
+1. Extract the _archive.zip_ file and move the _wiki_movie_plots_deduped.csv_ into the _redis-vector_ folder.
 
 ## Import libraries and set up connection information
 
@@ -170,7 +172,6 @@ Next, you'll read the csv file into a pandas DataFrame.
    def normalize_text(s, sep_token = " \n "):
        s = re.sub(r'\s+',  ' ', s).strip()
        s = re.sub(r". ,","",s)
-       # remove all instances of multiple spaces
        s = s.replace("..",".")
        s = s.replace(". .",".")
        s = s.replace("\n", "")
@@ -207,7 +208,7 @@ Next, you'll read the csv file into a pandas DataFrame.
 
 ## Load DataFrame into LangChain
 
-Load the DataFrame into LangChain using the `DataFrameLoader` class. Once the data is in LangChain documents, it's far easier to use LangChain libraries to generate embeddings and conduct similarity searches. Set *Plot* as the `page_content_column` so that embeddings are generated on this column.
+Load the DataFrame into LangChain using the `DataFrameLoader` class. Once the data is in LangChain documents, it's far easier to use LangChain libraries to generate embeddings and conduct similarity searches. Set _Plot_ as the `page_content_column` so that embeddings are generated on this column.
 
 1. Add the following code to a new code cell and execute it:
 
@@ -338,9 +339,9 @@ With Azure Cache for Redis and Azure OpenAI Service, you can use embeddings and
 
 ## Related Content
 
-* [Learn more about Azure Cache for Redis](cache-overview.md)
-* Learn more about Azure Cache for Redis [vector search capabilities](./cache-overview-vector-similarity.md)
-* Learn more about [embeddings generated by Azure OpenAI Service](/azure/ai-services/openai/concepts/understand-embeddings)
-* Learn more about [cosine similarity](https://en.wikipedia.org/wiki/Cosine_similarity)
-* [Read how to build an AI-powered app with OpenAI and Redis](https://techcommunity.microsoft.com/blog/azuredevcommunityblog/vector-similarity-search-with-azure-cache-for-redis-enterprise/3822059)
-* [Build a Q&A app with semantic answers](https://github.com/ruoccofabrizio/azure-open-ai-embeddings-qna)
+- [Learn more about Azure Cache for Redis](cache-overview.md)
+- Learn more about Azure Cache for Redis [vector search capabilities](./cache-overview-vector-similarity.md)
+- Learn more about [embeddings generated by Azure OpenAI Service](/azure/ai-services/openai/concepts/understand-embeddings)
+- Learn more about [cosine similarity](https://en.wikipedia.org/wiki/Cosine_similarity)
+- [Read how to build an AI-powered app with OpenAI and Redis](https://techcommunity.microsoft.com/blog/azuredevcommunityblog/vector-similarity-search-with-azure-cache-for-redis-enterprise/3822059)
+- [Build a Q&A app with semantic answers](https://github.com/ruoccofabrizio/azure-open-ai-embeddings-qna)