Skip to content

Commit 6b2f1b6

Browse files
Merge pull request #252666 from HeidiSteen/heidist-freshness
misc edits
2 parents 9606b14 + b35cf43 commit 6b2f1b6

File tree

2 files changed

+5
-5
lines changed

2 files changed

+5
-5
lines changed

articles/search/semantic-how-to-query-request.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@ To use semantic ranking:
3434
+ Review [semantic ranking](semantic-search-overview.md) if you need an introduction to the feature.
3535

3636
> [!NOTE]
37-
> Captions and answers are extracted verbatim from text in the search document. The semantic subsystem uses language understanding to recognize what part of your content has the characteristics of a caption or answer, but it doesn't compose new sentences or phrases. For this reason, content that includes explanations or definitions work best for semantic ranking.
37+
> Captions and answers are extracted verbatim from text in the search document. The semantic subsystem uses language understanding to recognize content having the characteristics of a caption or answer, but doesn't compose new sentences or phrases. For this reason, content that includes explanations or definitions work best for semantic ranking. If you want chat-style interaction with generated responses, see [Retrieval Augmented Generation (RAG)](retrieval-augmented-generation-overview.md).
3838
3939
## 1 - Choose a client
4040

@@ -50,13 +50,13 @@ Choose a search client that supports preview APIs on the query request. Here are
5050

5151
## 2 - Create a semantic configuration
5252

53-
A *semantic configuration* is a section in your index that establishes field inputs for semantic ranking. You can add or update a semantic configuration at any time, no rebuild necessary. At query time, specify one on a query request. A semantic configuration has a name and the following properties:
53+
A *semantic configuration* is a section in your index that establishes field inputs for semantic ranking. You can add or update a semantic configuration at any time, no rebuild necessary. At query time, specify one on a [query request](#4---set-up-the-query). A semantic configuration has a name and the following properties:
5454

5555
| Property | Characteristics |
5656
|----------|-----------------|
5757
| Title field | A short string, ideally under 25 words. This field could be the title of a document, name of a product, or a unique identifier. If you don't have suitable field, leave it blank. |
58-
| Content fields | Should contain text in natural language form. Common examples include the body of a document, description of a product, or other free-form text. |
59-
| Keyword fields | Should be a list of keywords, such as the tags on a document, or a descriptive term, such as the category of an item. |
58+
| Content fields | Longer chunks of text in natural language form, subject to [maximum token input limits](semantic-search-overview.md#how-inputs-are-prepared) on the machine learning models. Common examples include the body of a document, description of a product, or other free-form text. |
59+
| Keyword fields | A list of keywords, such as the tags on a document, or a descriptive term, such as the category of an item. |
6060

6161
You can only specify one title field, but you can specify as many content and keyword fields as you like. For content and keyword fields, list the fields in priority order because lower priority fields may get truncated.
6262

articles/search/semantic-search-overview.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -69,7 +69,7 @@ In semantic ranking, the query subsystem passes search results as an input to th
6969

7070
Each document is now represented by a single long string.
7171

72-
The string is composed of tokens, not characters or words. The maximum token count is 256 unique tokens. For estimation purposes, you can assume that 256 tokens are roughly equivalent to a string that is 256 words in length.
72+
**Maximum token counts (256)**. The string is composed of tokens, not characters or words. The maximum token count is 256 unique tokens. For estimation purposes, you can assume that 256 tokens are roughly equivalent to a string that is 256 words in length.
7373

7474
> [!NOTE]
7575
> Tokenization is determined in part by the [analyzer assignment](search-analyzers.md) on searchable fields. If you are using specialized analyzer, such as nGram or EdgeNGram, you might want to exclude that field from semantic ranking. For insights into how strings are tokenized, you can review the token output of an analyzer using the [Test Analyzer REST API](/rest/api/searchservice/test-analyzer).

0 commit comments

Comments
 (0)