Revisions per Matt's sample update

HeidiSteen · HeidiSteen · commit 0b3b5bcf6e0e · 2024-10-04T12:36:14.000-07:00
diff --git a/articles/search/tutorial-rag-build-solution-maximize-relevance.md b/articles/search/tutorial-rag-build-solution-maximize-relevance.md
@@ -41,9 +41,78 @@ This tutorial updates the search index created by the [indexing pipeline](tutori
 
 The [sample notebook](https://github.com/Azure-Samples/azure-search-python-samples/blob/main/Tutorial-RAG/Tutorial-rag.ipynb) includes an updated index and query request.
 
+## Run a baseline query for comparison
+
+Let's start with a new query, "Are there any cloud formations specific to oceans and large bodies of water?".
+
+To compare outcomes after adding relevance features, run the query against the existing index schema, before you add semantic ranking or a scoring profile.
+
+```python
+from azure.search.documents import SearchClient
+from openai import AzureOpenAI
+
+token_provider = get_bearer_token_provider(credential, "https://cognitiveservices.azure.com/.default")
+openai_client = AzureOpenAI(
+     api_version="2024-06-01",
+     azure_endpoint=AZURE_OPENAI_ACCOUNT,
+     azure_ad_token_provider=token_provider
+ )
+
+deployment_name = "gpt-4o"
+
+search_client = SearchClient(
+     endpoint=AZURE_SEARCH_SERVICE,
+     index_name=index_name,
+     credential=credential
+ )
+
+GROUNDED_PROMPT="""
+You are an AI assistant that helps users learn from the information found in the source material.
+Answer the query using only the sources provided below.
+Use bullets if the answer has multiple points.
+If the answer is longer than 3 sentences, provide a summary.
+Answer ONLY with the facts listed in the list of sources below. Cite your source when you answer the question
+If there isn't enough information below, say you don't know.
+Do not generate answers that don't use the sources below.
+Query: {query}
+Sources:\n{sources}
+"""
+
+# Focused query on cloud formations and bodies of water
+query="Are there any cloud formations specific to oceans and large bodies of water?"
+vector_query = VectorizableTextQuery(text=query, k_nearest_neighbors=50, fields="text_vector")
+
+search_results = search_client.search(
+    search_text=query,
+    vector_queries= [vector_query],
+    select=["title", "chunk", "locations"],
+    top=5,
+)
+
+sources_formatted = "=================\n".join([f'TITLE: {document["title"]}, CONTENT: {document["chunk"]}, LOCATIONS: {document["locations"]}' for document in search_results])
+
+response = openai_client.chat.completions.create(
+    messages=[
+        {
+            "role": "user",
+            "content": GROUNDED_PROMPT.format(query=query, sources=sources_formatted)
+        }
+    ],
+    model=deployment_name
+)
+
+print(response.choices[0].message.content)
+```
+
+Output from this request might look like the following example.
+
+```
+Yes, there are cloud formations specific to oceans and large bodies of water. A notable example is "cloud streets," which are parallel rows of clouds that form over the Bering Strait in the Arctic Ocean. These cloud streets occur when wind blows from a cold surface like sea ice over warmer, moister air near the open ocean, leading to the formation of spinning air cylinders. Clouds form along the upward cycle of these cylinders, while skies remain clear along the downward cycle (Source: page-21.pdf).
+```
+
 ## Update the index for semantic ranking and scoring profiles
 
-In a previous tutorial, you [designed an index schema](tutorial-rag-build-solution-index-schema.md) for RAG workloads. We purposely omitted relevance enhancements from that schema so that you could focus on the fundamentals. Deferring relevance to a separate exercise also gives you a before-and-after comparison of the quality of search results after the updates are made.
+In a previous tutorial, you [designed an index schema](tutorial-rag-build-solution-index-schema.md) for RAG workloads. We purposely omitted relevance enhancements from that schema so that you could focus on the fundamentals. Deferring relevance to a separate exercise gives you a before-and-after comparison of the quality of search results after the updates are made.
 
 1. Update the import statements to include classes for semantic ranking and scoring profiles.
 
@@ -138,7 +207,7 @@ openai_client = AzureOpenAI(
      azure_ad_token_provider=token_provider
  )
 
-deployment_name = "gpt-35-turbo"
+deployment_name = "gpt-4o"
 
 search_client = SearchClient(
      endpoint=AZURE_SEARCH_SERVICE,
@@ -160,8 +229,8 @@ Sources:\n{sources}
 """
 
 # Queries are unchanged in this update
-query="how much of earth is covered by water"
-vector_query = VectorizableTextQuery(text=query, k_nearest_neighbors=1, fields="text_vector", exhaustive=True)
+query="Are there any cloud formations specific to oceans and large bodies of water?"
+vector_query = VectorizableTextQuery(text=query, k_nearest_neighbors=50, fields="text_vector")
 
 # Add query_type semantic and semantic_configuration_name
 # Add scoring_profile and scoring_parameters
@@ -175,7 +244,7 @@ search_results = search_client.search(
     select="title, chunk, locations",
     top=5,
 )
-sources_formatted = "\n".join([f'{document["title"]}:{document["chunk"]}:{document["locations"]}' for document in search_results])
+sources_formatted = "=================\n".join([f'TITLE: {document["title"]}, CONTENT: {document["chunk"]}, LOCATIONS: {document["locations"]}' for document in search_results])
 
 response = openai_client.chat.completions.create(
     messages=[
@@ -190,6 +259,24 @@ response = openai_client.chat.completions.create(
 print(response.choices[0].message.content)
 ```
 
+Output from a semantically ranked and boosted query might look like the following example.
+
+```
+Yes, there are specific cloud formations influenced by oceans and large bodies of water:
+
+- **Stratus Clouds Over Icebergs**: Low stratus clouds can frame holes over icebergs, such as Iceberg A-56 in the South Atlantic Ocean, likely due to thermal instability caused by the iceberg (source: page-39.pdf).
+
+- **Undular Bores**: These are wave structures in the atmosphere created by the collision of cool, dry air from a continent with warm, moist air over the ocean, as seen off the coast of Mauritania (source: page-23.pdf).
+
+- **Ship Tracks**: These are narrow clouds formed by water vapor condensing around tiny particles from ship exhaust. They are observed over the oceans, such as in the Pacific Ocean off the coast of California (source: page-31.pdf).
+
+These specific formations are influenced by unique interactions between atmospheric conditions and the presence of large water bodies or objects within them.
+```
+
+Adding semantic ranking and scoring profiles positively affects the response from the LLM by promoting results that meet scoring criteria and are semantically relevant. 
+
+Now that you have a better understanding of index and query design, let's move on to optimizing for speed and concision. We revisit the schema definition to implement quantization and storage reduction, but the rest of the pipeline and models remain intact.
+
 <!-- ## Update queries for minimum thresholds ** NOT AVAILABLE IN PYTHON SDK
 
 Keyword search only returns results if there's match found in the index, up to a maximum of 50 results by default. In contrast, vector search returns `k`-results every time, even if the matching vectors aren't a close match.
diff --git a/articles/search/tutorial-rag-build-solution-query.md b/articles/search/tutorial-rag-build-solution-query.md
@@ -234,6 +234,9 @@ The NASA Earth book appears to showcase various locations on Earth captured thro
 (Source: page-43.pdf, page-147.pdf, page-153.pdf, page-39.pdf)
 ```
 
+> [!TIP]
+> If you're continuing on with the tutorial, remember to restore the prompt to its previous value (`You are an AI assistant that helps users learn from the information found in the source material`).
+
 Changing parameters and prompts affects the response from the LLM. As you explore on your own, keep the following tips in mind:
 
 - Raising the `top` value can exhaust available quota on the model. If there's no quota, an error message is returned or the model might return "I don't know".