Merge pull request #124 from HeidiSteen/main

HeidiSteen · web-flow · commit eaa4e4205225 · 2024-09-30T21:50:57.000-07:00
Revised script for RAG tutorial
diff --git a/Tutorial-RAG/Tutorial-rag.ipynb b/Tutorial-RAG/Tutorial-rag.ipynb
@@ -6,16 +6,20 @@
    "source": [
     "# Build a RAG solution in Azure AI Search\n",
     "\n",
-    "This notebook builds an indexing pipeline in Azure AI Search for RAG scenarios. \n",
+    "This notebook provides sample script for the indexing pipeline in [Build a RAG solution in Azure AI Search](https://learn.microsoft.com/azure/search/tutorial-rag-build-solution). If you need more information than the readme provides, you can refer to that article.\n",
     "\n",
-    "The scripts rely on API keys for most of the connections, with the exception of Microsoft Entra ID authentication and role authorization for Azure AI Search connections to Azure Blob Storage. For the data connection to blob storage:\n",
+    "Steps in this notebook include:\n",
     "\n",
-    "+ Azure AI Search must have a system-assigned managed identity.\n",
-    "+ The system identity must have Storage Blob Data Reader rights on Azure Storage.\n",
+    "- Set up the environment\n",
+    "- Set up the Azure resources used in the pipeline\n",
+    "- Create an index, data source, skillset, and indexer on Azure AI Search\n",
+    "- Send a query to the search engine to check results\n",
+    "- Send a query to an LLM to chat with your data\n",
+    "- Revisit the index schema and query logic to improve relevance\n",
     "\n",
-    "In a future update, this notebook will use Microsoft Entra ID and role assignments for all connections.\n",
+    "Sample data is a collection of PDF pages from the NASA's Earth Book that you load into Azure Storage and retrieve during indexing.\n",
     "\n",
-    "The scripts and steps in this notebook are fully documented in [Tutorial: Build a RAG solution in Azure AI Search](https://learn.microsoft.com/azure/search/tutorial-rag-build-solution). If you need more guidance than the readme provides, please refer to the article."
+    "This tutorial assumes embedding and chat models on Azure OpenAI so that you can use the integrated vectorization capabilities of Azure AI Search. You can use a different provider but you might need custom skills or a different approach for indexing and embedding your content."
    ]
   },
   {
@@ -24,59 +28,129 @@
    "source": [
     "## Prerequisites\n",
     "\n",
-    "- [Azure Storage](https://learn.microsoft.com/azure/storage/common/storage-account-create), with a blob container named \"nasa-ebook-pdfs-all\", containing PDFs that originate from the NASA Earth Book. Upload the [PDFs from this folder](https://github.com/Azure-Samples/azure-search-sample-data/tree/main/nasa-e-book/earth_book_2019_text_pages) into a container on Azure Storage.\n",
+    "You need the following Azure resources to run all of the script in this notebook.\n",
     "\n",
-    "- [Azure OpenAI](https://learn.microsoft.com/azure/ai-services/openai/how-to/create-resource)\n",
+    "- [Azure Storage](https://learn.microsoft.com/azure/storage/common/storage-account-create), general purpose account, used for providing the PDFs.\n",
     "\n",
-    "  - Deploy a chat model (GPT-3.5-Turbo, GPT-4, or equivalent LLM).\n",
-    "  - Deploy an embedding model (text-embedding-ada-002, text-embedding-3-large, text-embedding-3-small)\n",
+    "- [Azure OpenAI](https://learn.microsoft.com/azure/ai-services/openai/how-to/create-resource) provides the embedding and chat models.\n",
     "\n",
-    "- [Azure AI Services multiservice account](https://learn.microsoft.com/azure/ai-services/multi-service-resource), in the same region as Azure AI Search. This resource is used for the Entity Recognition skill that detects locations in your content.\n",
+    "- [Azure AI Services multiservice account](https://learn.microsoft.com/azure/ai-services/multi-service-resource), in the same region as Azure AI Search, used for recognizing location entities in the Earth Book.\n",
     "\n",
-    "- [Azure AI Search](https://learn.microsoft.com/azure/search/search-create-service-portal)\n",
-    "\n",
-    "  - Basic tier or higher is recommended.\n",
-    "  - Choose the same region as Azure OpenAI.\n",
-    "  - Enable semantic ranking.\n",
-    "  - Enable role-based access control.\n",
-    "  - Enable a system identity for Azure AI Search.\n",
-    "  \n",
-    "Make sure you know the name of the deployed models, and have the endpoints for all Azure resources at hand. You will provide this information in the steps that follow."
+    "- [Azure AI Search](https://learn.microsoft.com/azure/search/search-create-service-portal), basic tier or higher is recommended. Choose the same region as Azure OpenAI and Azure AI multiservice.\n"
    ]
   },
   {
-   "cell_type": "code",
-   "execution_count": 77,
+   "cell_type": "markdown",
    "metadata": {},
-   "outputs": [],
    "source": [
-    "! pip install -r tutorial-rag-requirements.txt --quiet"
+    "## Sign in to Azure\n",
+    "\n",
+    "You might not need this step, but if downstream connections fail with a 401 during indexer pipeline execution, it could be because you're using the wrong tenant or subscription. You can avoid this issue by signing in from the command line, explicitly setting the tenant ID and choosing the right subscription.\n",
+    "\n",
+    "This section assumes you have the [Azure CLI](https://learn.microsoft.com/cli/azure/authenticate-azure-cli-interactively).\n",
+    "\n",
+    "1. Open a command line prompt.\n",
+    "\n",
+    "1. Run this command to get a list of Azure tenants: `az account tenant list`\n",
+    "\n",
+    "1. If you have multiple tenants, set the tenant: `az login --tenant <YOUR-TENANT_ID> `\n",
+    "\n",
+    "If you have multiple subscriptions, a list is provided so that you can select one."
    ]
   },
   {
-   "cell_type": "code",
-   "execution_count": 96,
+   "cell_type": "markdown",
    "metadata": {},
-   "outputs": [],
    "source": [
-    "# Set endpoints and API keys for Azure services\n",
-    "AZURE_SEARCH_SERVICE: str = \"PUT YOUR SEARCH SERVICE URL HERE\"\n",
-    "AZURE_SEARCH_KEY: str = \"PUT YOUR SEARCH SERVICE ADMIN KEY HERE\"\n",
-    "AZURE_OPENAI_ACCOUNT: str = \"PUT YOUR AZURE OPENAI ACCOUNT URL HERE\"\n",
-    "AZURE_OPENAI_KEY: str = \"PUT YOUR AZURE OPENAI KEY HERE\"\n",
-    "AZURE_AI_MULTISERVICE_ACCOUNT: str = \"PUT YOUR AZURE AI MULTISERVICE ACCOUNT URL HERE\"\n",
-    "AZURE_AI_MULTISERVICE_KEY: str = \"PUT YOUR AZURE AI MULTISERVICE KEY HERE\"\n",
-    "AZURE_STORAGE_CONNECTION: str = \"PUT YOUR AZURE STORAGE CONNECTION STRING HERE\"\n",
+    "## Set up Azure resources using the Azure portal\n",
     "\n",
-    "# Example connection string for a search service managed identity connection:\n",
-    "# \"ResourceId=/subscriptions/FAKE-SUBCRIPTION=ID/resourceGroups/FAKE-RESOURCE-GROUP/providers/Microsoft.Storage/storageAccounts/FAKE-ACCOUNT;\""
+    "We recommend using the Azure portal for setting up resources.\n",
+    "\n",
+    "You must be a subscription **Owner** or **User Access Administrator** to create roles. If you don't have permission to create roles, you can use API keys instead. If you're using keys, you can skip the steps that enable system assigned managed identities.\n",
+    "\n",
+    "1. Download the sample PDF files from [nasa-e-book/earth_book_2019_text_pages](https://github.com/Azure-Samples/azure-search-sample-data/tree/main/nasa-e-book/earth_book_2019_text_pages).\n",
+    "\n",
+    "1. Sign in to the [Azure portal](https://portal.azure.com).\n",
+    "\n",
+    "1. Make sure Azure AI Search, Azure OpenAI, and Azure AI multiservice resources are in the same region.\n",
+    "\n",
+    "### Configure Azure Storage\n",
+    "\n",
+    "1. On the Azure Storage left menu, select **Storage browser** > **Blob containers**, and then **Add container**.\n",
+    "\n",
+    "1. Name the container *nasa-ebooks-pdfs-all*.\n",
+    "\n",
+    "1. Upload the PDFs to the container.\n",
+    "\n",
+    "1. On the left menu, select **Settings** > **Identity** and turn on system assigned managed identity.\n",
+    "\n",
+    "### Configure Azure AI Search\n",
+    "\n",
+    "1. On the Azure AI Search left menu, select **Settings** > **Semantic ranker** and enable the free plan that authorizes 1,000 requests at no charge.\n",
+    "\n",
+    "1. On the left menu, select **Settings** > **Keys** and turn on role-based access control or \"both\".\n",
+    "\n",
+    "1. On the left menu, select **Settings** > **Identity** and turn on system assigned managed identity.\n",
+    "\n",
+    "### Configure Azure OpenAI\n",
+    "\n",
+    "Deploy the following models on Azure OpenAI:\n",
+    "\n",
+    "- Text-embedding-ada-02 on Azure OpenAI for embeddings\n",
+    "- GPT-35-Turbo on Azure OpenAI for chat completion\n",
+    "\n",
+    "You must have [**Cognitive Services OpenAI Contributor**]( /azure/ai-services/openai/how-to/role-based-access-control#cognitive-services-openai-contributor) or higher to deploy models in Azure OpenAI.\n",
+    "\n",
+    "1. Go to [Azure OpenAI Studio](https://oai.azure.com/).\n",
+    "\n",
+    "1. Select **Deployments** on the left menu.\n",
+    "\n",
+    "1. Select **Deploy model** > **Deploy base model**.\n",
+    "\n",
+    "1. Select **text-embedding-ada-02** from the dropdown list and confirm the selection.\n",
+    "\n",
+    "1. Specify a deployment name. We recommend \"text-embedding-ada-002\".\n",
+    "\n",
+    "1. Accept the defaults.\n",
+    "\n",
+    "1. Select **Deploy**.\n",
+    "\n",
+    "1. Repeat the previous steps for **gpt-35-turbo**.\n",
+    "\n",
+    "Make a note of the model names and endpoint. Embedding skills and vectorizers assemble the full endpoint internally, so you only need the resource URI. For example, given `https://MY-FAKE-ACCOUNT.openai.azure.com/openai/deployments/text-embedding-ada-002/embeddings?api-version=2023-05-15`, the endpoint you should provide in skill and vectorizer definitions is `https://MY-FAKE-ACCOUNT.openai.azure.com`.\n",
+    "\n",
+    "### Configure search engine role-based access to Azure Storage\n",
+    "\n",
+    "1. Sign in to the [Azure portal](https://portal.azure.com) and find your storage account.\n",
+    "\n",
+    "1. On the left menu, select **Access control (IAM)**.\n",
+    "\n",
+    "1. Add a role for **Storage Blob Data Reader**, assigned to the search service system-managed identity.\n",
+    "\n",
+    "### Configure search engine role-based access to Azure models\n",
+    "\n",
+    "Assign yourself *and* the search service identity permissions on Azure OpenAI. The code for this tutorial runs locally. Requests to Azure OpenAI originate from your system. Also, embedding requests and query reponses from the search engine are passed to Azure OpenAI. For these reasons, both you and the search service need permissions on Azure OpenAI.\n",
+    "\n",
+    "1. Sign in to the [Azure portal](https://portal.azure.com) and find your Azure OpenAI resource.\n",
+    "\n",
+    "1. On the left menu, select **Access control (IAM)**.\n",
+    "\n",
+    "1. Add a role for [**Cognitive Services OpenAI User**](/azure/ai-services/openai/how-to/role-based-access-control#cognitive-services-openai-userpermissions).\n",
+    "\n",
+    "1. Select **Managed identity** and then select **Members**. Find the system-managed identity for your search service in the dropdown list.\n",
+    "\n",
+    "1. Next, select **User, group, or service principal** and then select **Members**. Search for your user account and then select it from the dropdown list.\n",
+    "\n",
+    "1. Select **Review and Assign** to create the role assignments.\n",
+    "\n",
+    "This step concludes provisioning services in the Azure portal. Continuing to the next section, you switch to Visual Studio Code and a local environment."
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## Create a virtual environment\n",
+    "## Create a virtual environment in Visual Studio Code\n",
     "\n",
     "Create a virtual environment so that you can install the dependencies in isolation.\n",
     "\n",
@@ -93,7 +167,25 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## Create an index"
+    "## Install packages"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "! pip install -r tutorial-rag-requirements.txt --quiet"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Set endpoints\n",
+    "\n",
+    "Provide the endpoints you collected in a previous step. You can leave the API keys empty if you enabled role-based authentication. Otherwise, if you can't use roles, provide API keys for each resource."
    ]
   },
   {
@@ -102,7 +194,36 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "from azure.core.credentials import AzureKeyCredential\n",
+    "# Set endpoints and API keys for Azure services\n",
+    "AZURE_SEARCH_SERVICE: str = \"PUT YOUR SEARCH SERVICE URL HERE\"\n",
+    "AZURE_SEARCH_KEY: str = \"DELETE IF USING ROLES, OTHERWISE PUT YOUR SEARCH SERVICE ADMIN KEY HERE\"\n",
+    "AZURE_OPENAI_ACCOUNT: str = \"PUT YOUR AZURE OPENAI ACCOUNT URL HERE\"\n",
+    "AZURE_OPENAI_KEY: str = \"DELETE IF USING ROLES, OTHERWISE PUT YOUR AZURE OPENAI KEY HERE\"\n",
+    "AZURE_AI_MULTISERVICE_ACCOUNT: str = \"PUT YOUR AZURE AI MULTISERVICE ACCOUNT URL HERE\"\n",
+    "AZURE_AI_MULTISERVICE_KEY: str = \"PUT YOUR AZURE AI MULTISERVICE KEY HERE. ROLES ARE USED TO CONNECT. KEY IS USED FOR BILLING.\"\n",
+    "AZURE_STORAGE_CONNECTION: str = \"PUT YOUR AZURE STORAGE CONNECTION STRING HERE (see example below for syntax)\"\n",
+    "\n",
+    "# Example connection string for a search service managed identity connection:\n",
+    "# \"ResourceId=/subscriptions/FAKE-SUBCRIPTION=ID/resourceGroups/FAKE-RESOURCE-GROUP/providers/Microsoft.Storage/storageAccounts/FAKE-ACCOUNT;\""
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Create an index\n",
+    "\n",
+    "This is index schema used for [Build a RAG solution in Azure AI Search](https://learn.microsoft.com/azure/search/tutorial-rag-build-solution)."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from azure.identity import DefaultAzureCredential\n",
+    "from azure.identity import get_bearer_token_provider\n",
     "from azure.search.documents.indexes import SearchIndexClient\n",
     "from azure.search.documents.indexes.models import (\n",
     "    SearchField,\n",
@@ -115,11 +236,11 @@
     "    SearchIndex\n",
     ")\n",
     "\n",
-    "AZURE_SEARCH_CREDENTIAL = AzureKeyCredential(AZURE_SEARCH_KEY)\n",
+    "credential = DefaultAzureCredential()\n",
     "\n",
     "# Create a search index  \n",
     "index_name = \"py-rag-tutorial-idx\"\n",
-    "index_client = SearchIndexClient(endpoint=AZURE_SEARCH_SERVICE, credential=AZURE_SEARCH_CREDENTIAL)  \n",
+    "index_client = SearchIndexClient(endpoint=AZURE_SEARCH_SERVICE, credential=credential)  \n",
     "fields = [\n",
     "    SearchField(name=\"parent_id\", type=SearchFieldDataType.String),  \n",
     "    SearchField(name=\"title\", type=SearchFieldDataType.String),\n",
@@ -148,8 +269,7 @@
     "            parameters=AzureOpenAIVectorizerParameters(  \n",
     "                resource_url=AZURE_OPENAI_ACCOUNT,  \n",
     "                deployment_name=\"text-embedding-ada-002\",\n",
-    "                model_name=\"text-embedding-ada-002\",\n",
-    "                api_key=AZURE_OPENAI_KEY\n",
+    "                model_name=\"text-embedding-ada-002\"\n",
     "            ),\n",
     "        ),  \n",
     "    ], \n",
@@ -183,7 +303,7 @@
     ")\n",
     "\n",
     "# Create a data source \n",
-    "indexer_client = SearchIndexerClient(endpoint=AZURE_SEARCH_SERVICE, credential=AZURE_SEARCH_CREDENTIAL)\n",
+    "indexer_client = SearchIndexerClient(endpoint=AZURE_SEARCH_SERVICE, credential=credential)\n",
     "container = SearchIndexerDataContainer(name=\"nasa-ebooks-pdfs-all\")\n",
     "data_source_connection = SearchIndexerDataSourceConnection(\n",
     "    name=\"py-rag-tutorial-ds\",\n",
@@ -200,7 +320,9 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## Create a skillset"
+    "## Create a skillset\n",
+    "\n",
+    "This skillset chunks and embeds data. It also uses entity recognition to detect location entities."
    ]
   },
   {
@@ -299,7 +421,7 @@
     "    cognitive_services_account=cognitive_services_account\n",
     ")\n",
     "  \n",
-    "client = SearchIndexerClient(endpoint=AZURE_SEARCH_SERVICE, credential=AZURE_SEARCH_CREDENTIAL)  \n",
+    "client = SearchIndexerClient(endpoint=AZURE_SEARCH_SERVICE, credential=credential)  \n",
     "client.create_or_update_skillset(skillset)  \n",
     "print(f\"{skillset.name} created\")  "
    ]
@@ -308,7 +430,9 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## Create an indexer"
+    "## Create an indexer\n",
+    "\n",
+    "The indexer drives the pipeline. You can create an indexer in a disabled state, but the default is for the indexer to run as soon as you send the request."
    ]
   },
   {
@@ -339,7 +463,7 @@
     ")  \n",
     "\n",
     "# Create and run the indexer  \n",
-    "indexer_client = SearchIndexerClient(endpoint=AZURE_SEARCH_SERVICE, credential=AZURE_SEARCH_CREDENTIAL)  \n",
+    "indexer_client = SearchIndexerClient(endpoint=AZURE_SEARCH_SERVICE, credential=credential)  \n",
     "indexer_result = indexer_client.create_or_update_indexer(indexer)  \n",
     "\n",
     "print(f' {indexer_name} is created and running. Give the indexer a few minutes before running a query.')  "
@@ -349,7 +473,9 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## Check results"
+    "## Check results\n",
+    "\n",
+    "After waiting several minutes, send a request to the search engine. There is no chat or generative AI at this point. The results are verbatim content from your search index."
    ]
   },
   {
@@ -364,7 +490,7 @@
     "# Vector Search using text-to-vector conversion of the querystring\n",
     "query = \"where are NASA's headquarters located?\"  \n",
     "\n",
-    "search_client = SearchClient(endpoint=AZURE_SEARCH_SERVICE, credential=AZURE_SEARCH_CREDENTIAL, index_name=index_name)\n",
+    "search_client = SearchClient(endpoint=AZURE_SEARCH_SERVICE, credential=credential, index_name=index_name)\n",
     "vector_query = VectorizableTextQuery(text=query, k_nearest_neighbors=1, fields=\"text_vector\", exhaustive=True)\n",
     "  \n",
     "results = search_client.search(  \n",
@@ -385,7 +511,9 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "## Search using a chat model"
+    "## Search using a chat model\n",
+    "\n",
+    "This script sends a query, the query response, and a prompt to an LLM for chat completion. This time, the response is created using generative AI."
    ]
   },
   {
@@ -396,22 +524,21 @@
    "source": [
     "# Import libraries\n",
     "from azure.search.documents import SearchClient\n",
-    "from azure.core.credentials import AzureKeyCredential\n",
     "from openai import AzureOpenAI\n",
     "\n",
-    "# Set up clients and specify the chat model\n",
+    "token_provider = get_bearer_token_provider(credential, \"https://cognitiveservices.azure.com/.default\")\n",
     "openai_client = AzureOpenAI(\n",
     "     api_version=\"2024-06-01\",\n",
     "     azure_endpoint=AZURE_OPENAI_ACCOUNT,\n",
-    "     api_key=AZURE_OPENAI_KEY\n",
+    "     azure_ad_token_provider=token_provider\n",
     " )\n",
     "\n",
     "deployment_name = \"gpt-35-turbo\"\n",
     "\n",
     "search_client = SearchClient(\n",
     "     endpoint=AZURE_SEARCH_SERVICE,\n",
     "     index_name=index_name,\n",
-    "     credential=AZURE_SEARCH_CREDENTIAL\n",
+    "     credential=credential\n",
     " )\n",
     "\n",
     "# Provide instructions to the model\n",