Skip to content

Commit d54969f

Browse files
committed
checkpoint
1 parent a61af64 commit d54969f

File tree

3 files changed

+18
-3
lines changed

3 files changed

+18
-3
lines changed

articles/search/tutorial-rag-build-solution-index-schema.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,7 @@ Chunks are the focus of the schema, and each chunk is the defining element of a
4545

4646
In addition to structural considerations, like chunked content, you also want to consider the substance of your content because it also informs what fields are indexed.
4747

48-
In this tutorial, sample data consists of PDFs and content from the NASA Earth Book. This content is descriptive and informative, with numerous references to geographies, countries, and areas across the world. To capture this information in our index and potentially use it in queries, we include skills in our indexing pipeline that recognize and extract this information, loading it into a searchable and filterable `locations` field.
48+
In this tutorial, sample data consists of PDFs and content from the NASA Earth Book. This content is descriptive and informative, with numerous references to geographies, countries, and areas across the world. To capture this information in our index and potentially use it in queries, we include skills in our indexing pipeline that recognize and extract this information, loading it into a searchable and filterable `locations` field. Adding structured content to your index gives you more options for filtering, relevance tuning, and richer answers.
4949

5050
The original ebook is large, over 100 pages and 35 MB in size. We broke it up into smaller PDFs, one per page of text, to stay under the REST API payload limit of 16 MB per API call.
5151

@@ -57,7 +57,7 @@ Chunked content typically derives from a larger document. And although the schem
5757

5858
An inflection point in schema design is whether to have two indexes for parent and child/chunked content, or a single index that repeats parent elements for each chunk.
5959

60-
In this tutorial, because all of the chunks of text originate from a single parent (NASA Earth Book), you don't need a separate index dedicated to up level parent fields. However, if you index from multiple parent PDFs, you might want a parent-child index pair to capture level-specific fields and then send lookup queries to the parent index to retrieve those fields relevant to each chunk.
60+
In this tutorial, because all of the chunks of text originate from a single parent (NASA Earth Book), you don't need a separate index dedicated to up level the parent fields. However, if you're indexing from multiple parent PDFs, you might want a parent-child index pair to capture level-specific fields and then send [lookup queries](https://learn.microsoft.com/rest/api/searchservice/documents/get) to the parent index to retrieve those fields relevant to each chunk.
6161

6262
### Checklist of schema considerations
6363

articles/search/tutorial-rag-build-solution-models.md

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -102,13 +102,21 @@ This tutorial series uses the following models and model providers:
102102
You must have [**Cognitive Services OpenAI Contributor**]( /azure/ai-services/openai/how-to/role-based-access-control#cognitive-services-openai-contributor) or higher to deploy models in Azure OpenAI.
103103

104104
1. Go to [Azure OpenAI Studio](https://oai.azure.com/).
105+
105106
1. Select **Deployments** on the left menu.
107+
106108
1. Select **Deploy model** > **Deploy base model**.
109+
107110
1. Select **text-embedding-ada-02** from the dropdown list and confirm the selection.
111+
108112
1. Specify a deployment name. We recommend "text-embedding-ada-002".
113+
109114
1. Accept the defaults.
115+
110116
1. Select **Deploy**.
117+
111118
1. Repeat the previous steps for **gpt-35-turbo**.
119+
112120
1. Make a note of the model names and endpoint. Embedding skills and vectorizers assemble the full endpoint internally, so you only need the resource URI. For example, given `https://MY-FAKE-ACCOUNT.openai.azure.com/openai/deployments/text-embedding-ada-002/embeddings?api-version=2023-05-15`, the endpoint you should provide in skill and vectorizer definitions is `https://MY-FAKE-ACCOUNT.openai.azure.com`.
113121

114122
## Configure search engine access to Azure models
@@ -118,13 +126,20 @@ For pipeline and query execution, this tutorial uses Microsoft Entra ID for auth
118126
Assign yourself and the search service identity permissions on Azure OpenAI. The code for this tutorial runs locally. Requests to Azure OpenAI originate from your system. Also, search results from the search engine are passed to Azure OpenAI. For these reasons, both you and the search service need permissions on Azure OpenAI.
119127

120128
1. Sign in to the [Azure portal](https://portal.azure.com) and [find your search service](https://portal.azure.com/#blade/HubsExtension/BrowseResourceBlade/resourceType/Microsoft.Search%2FsearchServices).
129+
121130
1. Configure Azure AI Search to [use a system-managed identity](search-howto-managed-identities-data-sources.md).
131+
122132
1. Find your Azure OpenAI resource.
133+
123134
1. Select **Access control (IAM)** on the left menu.
135+
124136
1. Select **Add role assignment**.
137+
125138
1. Select [**Cognitive Services OpenAI User**](/azure/ai-services/openai/how-to/role-based-access-control#cognitive-services-openai-userpermissions).
126139
1. Select **Managed identity** and then select **Members**. Find the system-managed identity for your search service in the dropdown list.
140+
127141
1. Next, select **User, group, or service principal** and then select **Members**. Search for your user account and then select it from the dropdown list.
142+
128143
1. Select **Review and Assign** to create the role assignments.
129144

130145
For access to models on Azure AI Vision, assign **Cognitive Services OpenAI User**. For Azure AI Studio, assign **Azure AI Developer**.

articles/search/tutorial-rag-build-solution.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ manager: nitinme
77
author: HeidiSteen
88
ms.author: heidist
99
ms.service: cognitive-search
10-
ms.topic: conceptual
10+
ms.topic: overview
1111
ms.date: 09/12/2024
1212

1313
---

0 commit comments

Comments
 (0)