Skip to content

Commit 26e016f

Browse files
committed
revised the example query and screenshot
1 parent 6cfcaea commit 26e016f

File tree

2 files changed

+10
-6
lines changed

2 files changed

+10
-6
lines changed
-20.8 KB
Loading

articles/search/search-how-to-semantic-chunking.md

Lines changed: 10 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -29,14 +29,18 @@ In this article, learn how to:
2929
> + Generate embeddings for each chunk
3030
> + Use index projections to map embeddings to fields in a search index
3131
32-
This article uses the [sample health plan PDFs](https://github.com/Azure-Samples/azure-search-sample-data/tree/main/health-plan) uploaded to Azure Blob Storage and then indexed using the **Import and vectorize data** wizard.
32+
For illustration purposes, this article uses the [sample health plan PDFs](https://github.com/Azure-Samples/azure-search-sample-data/tree/main/health-plan) uploaded to Azure Blob Storage and then indexed using the **Import and vectorize data wizard**.
3333

3434
## Prerequisites
3535

3636
+ [An indexer-based indexing pipeline](search-indexer-overview.md) with an index that accepts the output. The index must have fields for receiving headings and content.
37+
3738
+ [A supported data source](search-indexer-overview.md#supported-data-sources) having text content that you want to chunk.
39+
3840
+ [A skillset with Document Layout skill](cognitive-search-skill-document-intelligence-layout.md) that splits documents based on paragraph boundaries.
41+
3942
+ [An Azure OpenAI Embedding skill](cognitive-search-skill-azure-openai-embedding.md) that generates vector embeddings.
43+
4044
+ [An index projection](search-how-to-define-index-projections.md) for one-to-many indexing.
4145

4246
## Prepare data files
@@ -380,13 +384,13 @@ POST /indexes/[index name]/docs/search?api-version=[api-version]
380384
}
381385
```
382386

383-
If you used the health plan PDFs to test this skill, search results for the example query should look similar to this example.
387+
If you used the health plan PDFs to test this skill, Search Explorer results for the example query should look similar to the results in the following screenshot.
384388

385-
+ It uses semantic ranking, so you see `captions` (it also has answers, but those aren't shown in the screenshot). The first result is also semantically relevant to the query string.
386-
+ It's a [hybrid query](hybrid-search-how-to-query.md) over text and vectors, so you see a `@search.rerankerScore` and results are ranked by that score.
387-
+ The `select` statement specifies the header fields that the Document Layout skill detected and populated.
389+
+ The query uses semantic ranking, so you see `captions` (it also has `answers`, but those aren't shown in the screenshot). The results are the most semantically relevant to the query input, as determined by the [semantic ranker](semantic-search-overview.md).
390+
+ The query is a [hybrid query](hybrid-search-how-to-query.md) over text and vectors, so you see a `@search.rerankerScore` and results are ranked by that score.
391+
+ The `select` statement specifies the header fields that the Document Layout skill detected and populated. You can add more fields to inspect the content of chunks, title, or any other human readable field.
388392

389-
:::image type="content" source="media/search-how-to-semantic-chunking/query-results-doc-layout.png" alt-text="Screenshot of query results that include doc layout skill output fields.":::
393+
:::image type="content" source="media/search-how-to-semantic-chunking/query-results-doc-layout.png" lightbox="media/search-how-to-semantic-chunking/query-results-doc-layout.png" alt-text="Screenshot of hybrid query results that include doc layout skill output fields.":::
390394

391395
## See also
392396

0 commit comments

Comments
 (0)