You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/search/semantic-how-to-query-request.md
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -34,7 +34,7 @@ To use semantic ranking:
34
34
+ Review [semantic ranking](semantic-search-overview.md) if you need an introduction to the feature.
35
35
36
36
> [!NOTE]
37
-
> Captions and answers are extracted verbatim from text in the search document. The semantic subsystem uses language understanding to recognize what part of your content has the characteristics of a caption or answer, but it doesn't compose new sentences or phrases. For this reason, content that includes explanations or definitions work best for semantic ranking.
37
+
> Captions and answers are extracted verbatim from text in the search document. The semantic subsystem uses language understanding to recognize content having the characteristics of a caption or answer, but doesn't compose new sentences or phrases. For this reason, content that includes explanations or definitions work best for semantic ranking. If you want chat-style interaction with generated responses, see [Retrieval Augmented Generation (RAG)](retrieval-augmented-generation-overview.md).
38
38
39
39
## 1 - Choose a client
40
40
@@ -50,13 +50,13 @@ Choose a search client that supports preview APIs on the query request. Here are
50
50
51
51
## 2 - Create a semantic configuration
52
52
53
-
A *semantic configuration* is a section in your index that establishes field inputs for semantic ranking. You can add or update a semantic configuration at any time, no rebuild necessary. At query time, specify one on a query request. A semantic configuration has a name and the following properties:
53
+
A *semantic configuration* is a section in your index that establishes field inputs for semantic ranking. You can add or update a semantic configuration at any time, no rebuild necessary. At query time, specify one on a [query request](#4---set-up-the-query). A semantic configuration has a name and the following properties:
54
54
55
55
| Property | Characteristics |
56
56
|----------|-----------------|
57
57
| Title field | A short string, ideally under 25 words. This field could be the title of a document, name of a product, or a unique identifier. If you don't have suitable field, leave it blank. |
58
-
| Content fields |Should contain text in natural language form. Common examples include the body of a document, description of a product, or other free-form text. |
59
-
| Keyword fields |Should be a list of keywords, such as the tags on a document, or a descriptive term, such as the category of an item.|
58
+
| Content fields |Longer chunks of text in natural language form, subject to [maximum token input limits](semantic-search-overview.md#how-inputs-are-prepared) on the machine learning models. Common examples include the body of a document, description of a product, or other free-form text. |
59
+
| Keyword fields |A list of keywords, such as the tags on a document, or a descriptive term, such as the category of an item. |
60
60
61
61
You can only specify one title field, but you can specify as many content and keyword fields as you like. For content and keyword fields, list the fields in priority order because lower priority fields may get truncated.
Copy file name to clipboardExpand all lines: articles/search/semantic-search-overview.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -69,7 +69,7 @@ In semantic ranking, the query subsystem passes search results as an input to th
69
69
70
70
Each document is now represented by a single long string.
71
71
72
-
The string is composed of tokens, not characters or words. The maximum token count is 256 unique tokens. For estimation purposes, you can assume that 256 tokens are roughly equivalent to a string that is 256 words in length.
72
+
**Maximum token counts (256)**. The string is composed of tokens, not characters or words. The maximum token count is 256 unique tokens. For estimation purposes, you can assume that 256 tokens are roughly equivalent to a string that is 256 words in length.
73
73
74
74
> [!NOTE]
75
75
> Tokenization is determined in part by the [analyzer assignment](search-analyzers.md) on searchable fields. If you are using specialized analyzer, such as nGram or EdgeNGram, you might want to exclude that field from semantic ranking. For insights into how strings are tokenized, you can review the token output of an analyzer using the [Test Analyzer REST API](/rest/api/searchservice/test-analyzer).
0 commit comments