You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/search/search-how-to-semantic-chunking.md
+10-6Lines changed: 10 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -29,14 +29,18 @@ In this article, learn how to:
29
29
> + Generate embeddings for each chunk
30
30
> + Use index projections to map embeddings to fields in a search index
31
31
32
-
This article uses the [sample health plan PDFs](https://github.com/Azure-Samples/azure-search-sample-data/tree/main/health-plan) uploaded to Azure Blob Storage and then indexed using the **Import and vectorize data** wizard.
32
+
For illustration purposes, this article uses the [sample health plan PDFs](https://github.com/Azure-Samples/azure-search-sample-data/tree/main/health-plan) uploaded to Azure Blob Storage and then indexed using the **Import and vectorize data wizard**.
33
33
34
34
## Prerequisites
35
35
36
36
+[An indexer-based indexing pipeline](search-indexer-overview.md) with an index that accepts the output. The index must have fields for receiving headings and content.
37
+
37
38
+[A supported data source](search-indexer-overview.md#supported-data-sources) having text content that you want to chunk.
39
+
38
40
+[A skillset with Document Layout skill](cognitive-search-skill-document-intelligence-layout.md) that splits documents based on paragraph boundaries.
41
+
39
42
+[An Azure OpenAI Embedding skill](cognitive-search-skill-azure-openai-embedding.md) that generates vector embeddings.
43
+
40
44
+[An index projection](search-how-to-define-index-projections.md) for one-to-many indexing.
41
45
42
46
## Prepare data files
@@ -380,13 +384,13 @@ POST /indexes/[index name]/docs/search?api-version=[api-version]
380
384
}
381
385
```
382
386
383
-
If you used the health plan PDFs to test this skill, search results for the example query should look similar to this example.
387
+
If you used the health plan PDFs to test this skill, Search Explorer results for the example query should look similar to the results in the following screenshot.
384
388
385
-
+It uses semantic ranking, so you see `captions` (it also has answers, but those aren't shown in the screenshot). The first result is also semantically relevant to the query string.
386
-
+It's a [hybrid query](hybrid-search-how-to-query.md) over text and vectors, so you see a `@search.rerankerScore` and results are ranked by that score.
387
-
+ The `select` statement specifies the header fields that the Document Layout skill detected and populated.
389
+
+The query uses semantic ranking, so you see `captions` (it also has `answers`, but those aren't shown in the screenshot). The results are the most semantically relevant to the query input, as determined by the [semantic ranker](semantic-search-overview.md).
390
+
+The query is a [hybrid query](hybrid-search-how-to-query.md) over text and vectors, so you see a `@search.rerankerScore` and results are ranked by that score.
391
+
+ The `select` statement specifies the header fields that the Document Layout skill detected and populated. You can add more fields to inspect the content of chunks, title, or any other human readable field.
388
392
389
-
:::image type="content" source="media/search-how-to-semantic-chunking/query-results-doc-layout.png" alt-text="Screenshot of query results that include doc layout skill output fields.":::
393
+
:::image type="content" source="media/search-how-to-semantic-chunking/query-results-doc-layout.png" lightbox="media/search-how-to-semantic-chunking/query-results-doc-layout.png" alt-text="Screenshot of hybrid query results that include doc layout skill output fields.":::
0 commit comments