Merge pull request #214595 from HeidiSteen/heidist-fresh

JamesJBarnett · web-flow · commit 0a69b20a6ec2 · 2022-10-14T12:32:33.000-07:00
[azure search] relevance doc refresh
diff --git a/articles/search/TOC.yml b/articles/search/TOC.yml
@@ -143,6 +143,8 @@
       href: search-query-overview.md
     - name: Relevance scoring
       href: index-similarity-and-scoring.md
+    - name: Semantic ranking
+      href: semantic-ranking.md 
   - name: Indexing
     items:
     - name: Search indexes
@@ -456,11 +458,9 @@
       href: search-normalizers.md
   - name: Relevance
     items:
-    - name: Similarity ranking
+    - name: Configure scoring
       href: index-ranking-similarity.md
-    - name: Semantic ranking
-      href: semantic-ranking.md
-    - name: Scoring profiles 
+    - name: Add a scoring profile
       href: index-add-scoring-profiles.md
   - name: Performance and monitoring
     items:
diff --git a/articles/search/index-add-scoring-profiles.md b/articles/search/index-add-scoring-profiles.md
@@ -1,29 +1,33 @@
 ---
-title: Add scoring profiles to boost search scores
+title: Add scoring profiles
 titleSuffix: Azure Cognitive Search
 description: Boost search relevance scores for Azure Cognitive Search results by adding scoring profiles to a search index.
 
 manager: nitinme
 author: shmed
 ms.author: ramero
 ms.service: cognitive-search
-ms.topic: conceptual
-ms.date: 06/24/2022
+ms.topic: how-to
+ms.date: 10/14/2022
 ---
 
-# Add scoring profiles to a search index
+# Add scoring profiles to boost search scores
 
-For full text search queries, the search engine computes a search score for each matching document, which allows results to be ranked from high to low. Azure Cognitive Search uses a default scoring algorithm to compute an initial score, but you can customize the calculation through a *scoring profile*.
+In this article, you'll learn how to define a scoring profile for boosting search scores based on criteria.
 
-Scoring profiles are embedded in index definitions and include properties for boosting the score of matches, where additional criteria found in the profile provides the boosting logic. For example, you might want to boost matches based on their revenue potential, promote newer items, or perhaps boost items that have been in inventory too long.  
+Criteria can be a weighted field, such as when a match found in a "tags" field is more relevant than a match found in "descriptions". Criteria can also be a function, such as the `distance` function that favors results that are within a specified distance of the current location.
 
-Unfamiliar with relevance concepts? The following video segment fast-forwards to how scoring profiles work in Azure Cognitive Search, but the video also covers basic concepts. You might also want to review [Relevance and scoring in Azure Cognitive Search](index-similarity-and-scoring.md) for more background.
+Scoring profiles are defined in a search index and invoked on query requests. You can create multiple profiles and then modify query logic to choose which one is used.
 
-> [!VIDEO https://www.youtube.com/embed/Y_X6USgvB1g?version=3&start=463&end=970]
+> [!NOTE]
+> Unfamiliar with relevance concepts? The following video segment fast-forwards to how scoring profiles work in Azure Cognitive Search. You can also visit [Relevance and scoring in Azure Cognitive Search](index-similarity-and-scoring.md) for more background.
+>
+> > [!VIDEO https://www.youtube.com/embed/Y_X6USgvB1g?version=3&start=463&end=970]
+>
 
-## What is a scoring profile?
+## Scoring profile definition
 
-A scoring profile is part of the index definition and is composed of weighted fields, functions, and parameters. The purpose of a scoring profile is to boost or amplify matching documents based on criteria you provide. 
+A scoring profile is part of the index definition and is composed of weighted fields, functions, and parameters.
 
 The following definition shows a simple profile named 'geo'. This example boosts results that have the search term in the hotelName field. It also uses the `distance` function to favor results that are within 10 kilometers of the current location. If someone searches on the term 'inn', and 'inn' happens to be part of the hotel name, documents that include hotels with 'inn' within a 10 KM radius of the current location will appear higher in the search results.  
 
@@ -52,7 +56,7 @@ The following definition shows a simple profile named 'geo'. This example boosts
 ]
 ```  
 
-To use this scoring profile, your query is formulated to specify scoringProfile parameter in the request.
+Parameters are specified on invocation. To use this scoring profile, your query is formulated to specify scoringProfile parameter in the request.
 
 ```http
 POST /indexes/hotels/docs&api-version=2020-06-30
@@ -73,88 +77,14 @@ See the [Extended example](#bkmk_ex) to review a more detailed example of a scor
 
 Scores are computed for full text search queries for the purpose of ranking the most relevant matches and returning them at the top of the response. The overall score for each document is an aggregation of the individual scores for each field, where the individual score of each field is computed based on the term frequency and document frequency of the searched terms within that field (known as [TF-IDF](https://en.wikipedia.org/wiki/Tf%E2%80%93idf) or term frequency-inverse document frequency). 
 
-> [!TIP]
-> You can use the [featuresMode](index-similarity-and-scoring.md#featuresmode-parameter-preview) parameter to request additional scoring details with the search results (including the field level scores).
+You can use the [featuresMode (preview)](index-similarity-and-scoring.md#featuresmode-parameter-preview) parameter to request additional scoring details with the search results (including the field level scores).
 
 ## When to add scoring logic
 
 You should create one or more scoring profiles when the default ranking behavior doesn’t go far enough in meeting your business objectives. For example, you might decide that search relevance should favor newly added items. Likewise, you might have a field that contains profit margin, or some other field indicating revenue potential. Boosting results that are more meaningful to your users or the business is often the deciding factor in adoption of scoring profiles.
 
 Relevancy-based ordering in a search page is also implemented through scoring profiles. Consider search results pages you’ve used in the past that let you sort by price, date, rating, or relevance. In Azure Cognitive Search, scoring profiles can be used to drive the ‘relevance’ option. The definition of relevance is user-defined, predicated on business objectives and the type of search experience you want to deliver.  
 
-<a name="bkmk_ex"></a>
-
-## Extended example
-
-The following example shows the schema of an index with two scoring profiles (`boostGenre`, `newAndHighlyRated`). Any query against this index that includes either profile as a query parameter will use the profile to score the result set. 
-
-The `boostGenre` profile uses weighted text fields, boosting matches found in albumTitle, genre, and artistName fields. The fields are boosted 1.5, 5, and 2 respectively. Why is genre boosted so much higher than the others? If search is conducted over data that is somewhat homogenous (as is the case with 'genre' in the musicstoreindex), you might need a larger variance in the relative weights. For example, in the musicstoreindex, 'rock' appears as both a genre and in identically phrased genre descriptions. If you want genre to outweigh genre description, the genre field will need a much higher relative weight.
-
-```json
-{  
-  "name": "musicstoreindex",  
-  "fields": [  
-    { "name": "key", "type": "Edm.String", "key": true },  
-    { "name": "albumTitle", "type": "Edm.String" },  
-    { "name": "albumUrl", "type": "Edm.String", "filterable": false },  
-    { "name": "genre", "type": "Edm.String" },  
-    { "name": "genreDescription", "type": "Edm.String", "filterable": false },  
-    { "name": "artistName", "type": "Edm.String" },  
-    { "name": "orderableOnline", "type": "Edm.Boolean" },  
-    { "name": "rating", "type": "Edm.Int32" },  
-    { "name": "tags", "type": "Collection(Edm.String)" },  
-    { "name": "price", "type": "Edm.Double", "filterable": false },  
-    { "name": "margin", "type": "Edm.Int32", "retrievable": false },  
-    { "name": "inventory", "type": "Edm.Int32" },  
-    { "name": "lastUpdated", "type": "Edm.DateTimeOffset" }  
-  ],  
-  "scoringProfiles": [  
-    {  
-      "name": "boostGenre",  
-      "text": {  
-        "weights": {  
-          "albumTitle": 1.5,  
-          "genre": 5,  
-          "artistName": 2  
-        }  
-      }  
-    },  
-    {  
-      "name": "newAndHighlyRated",  
-      "functions": [  
-        {  
-          "type": "freshness",  
-          "fieldName": "lastUpdated",  
-          "boost": 10,  
-          "interpolation": "quadratic",  
-          "freshness": {  
-            "boostingDuration": "P365D"  
-          }  
-        },  
-        {
-          "type": "magnitude",  
-          "fieldName": "rating",  
-          "boost": 10,  
-          "interpolation": "linear",  
-          "magnitude": {  
-            "boostingRangeStart": 1,  
-            "boostingRangeEnd": 5,  
-            "constantBoostBeyondRange": false  
-          }  
-        }  
-      ]  
-    }  
-  ],  
-  "suggesters": [  
-    {  
-      "name": "sg",  
-      "searchMode": "analyzingInfixMatching",  
-      "sourceFields": [ "albumTitle", "artistName" ]  
-    }  
-  ]   
-}  
-```  
-
 ## Steps for adding a scoring profile
 
 To implement custom scoring behavior, add a scoring profile to the schema that defines the index. You can have up to 100 scoring profiles within an index (see [Service Limits](search-limits-quotas-capacity.md)), but you can only specify one profile at time in any given query.
@@ -330,6 +260,79 @@ The following table provides several examples.
 
 For more examples, see [XML Schema: Datatypes (W3.org web site)](https://www.w3.org/TR/xmlschema11-2/#dayTimeDuration).  
 
+<a name="bkmk_ex"></a>
+
+## Extended example
+
+The following example shows the schema of an index with two scoring profiles (`boostGenre`, `newAndHighlyRated`). Any query against this index that includes either profile as a query parameter will use the profile to score the result set. 
+
+The `boostGenre` profile uses weighted text fields, boosting matches found in albumTitle, genre, and artistName fields. The fields are boosted 1.5, 5, and 2 respectively. Why is genre boosted so much higher than the others? If search is conducted over data that is somewhat homogenous (as is the case with 'genre' in the musicstoreindex), you might need a larger variance in the relative weights. For example, in the musicstoreindex, 'rock' appears as both a genre and in identically phrased genre descriptions. If you want genre to outweigh genre description, the genre field will need a much higher relative weight.
+
+```json
+{  
+  "name": "musicstoreindex",  
+  "fields": [  
+    { "name": "key", "type": "Edm.String", "key": true },  
+    { "name": "albumTitle", "type": "Edm.String" },  
+    { "name": "albumUrl", "type": "Edm.String", "filterable": false },  
+    { "name": "genre", "type": "Edm.String" },  
+    { "name": "genreDescription", "type": "Edm.String", "filterable": false },  
+    { "name": "artistName", "type": "Edm.String" },  
+    { "name": "orderableOnline", "type": "Edm.Boolean" },  
+    { "name": "rating", "type": "Edm.Int32" },  
+    { "name": "tags", "type": "Collection(Edm.String)" },  
+    { "name": "price", "type": "Edm.Double", "filterable": false },  
+    { "name": "margin", "type": "Edm.Int32", "retrievable": false },  
+    { "name": "inventory", "type": "Edm.Int32" },  
+    { "name": "lastUpdated", "type": "Edm.DateTimeOffset" }  
+  ],  
+  "scoringProfiles": [  
+    {  
+      "name": "boostGenre",  
+      "text": {  
+        "weights": {  
+          "albumTitle": 1.5,  
+          "genre": 5,  
+          "artistName": 2  
+        }  
+      }  
+    },  
+    {  
+      "name": "newAndHighlyRated",  
+      "functions": [  
+        {  
+          "type": "freshness",  
+          "fieldName": "lastUpdated",  
+          "boost": 10,  
+          "interpolation": "quadratic",  
+          "freshness": {  
+            "boostingDuration": "P365D"  
+          }  
+        },  
+        {
+          "type": "magnitude",  
+          "fieldName": "rating",  
+          "boost": 10,  
+          "interpolation": "linear",  
+          "magnitude": {  
+            "boostingRangeStart": 1,  
+            "boostingRangeEnd": 5,  
+            "constantBoostBeyondRange": false  
+          }  
+        }  
+      ]  
+    }  
+  ],  
+  "suggesters": [  
+    {  
+      "name": "sg",  
+      "searchMode": "analyzingInfixMatching",  
+      "sourceFields": [ "albumTitle", "artistName" ]  
+    }  
+  ]   
+}  
+```  
+
 ## See also
 
 + [Relevance and scoring in Azure Cognitive Search](index-similarity-and-scoring.md)
diff --git a/articles/search/index-ranking-similarity.md b/articles/search/index-ranking-similarity.md
@@ -1,44 +1,58 @@
 ---
-title: Configure scoring algorithm
+title: Configure relevance scoring
 titleSuffix: Azure Cognitive Search
 description: Enable Okapi BM25 ranking to upgrade the search ranking and relevance behavior on older Azure Search services.
 
 author: HeidiSteen
 ms.author: heidist
 ms.service: cognitive-search
 ms.topic: how-to
-ms.date: 06/22/2022
+ms.date: 10/14/2022
 ---
 
-# Configure the scoring algorithm in Azure Cognitive Search
+# Configure relevance scoring
 
-Depending on the age of your search service, Azure Cognitive Search supports two [scoring algorithms](index-similarity-and-scoring.md) for assigning relevance to results in a full text search query:
+In this article, you'll learn how to configure the similarity scoring algorithm used by Azure Cognitive Search. The BM25 scoring model has defaults for weighting term frequency and document length. You can customize these properties if the defaults aren't suited to your content.
+
+Configuration changes are scoped to individual indexes, which means you can adjust relevance scoring based on the characteristics of each index.
+
+## Default scoring algorithm
+
+Depending on the age of your search service, Azure Cognitive Search supports two [similarity scoring algorithms](index-similarity-and-scoring.md) for assigning relevance to results in a full text search query:
 
 + An *Okapi BM25* algorithm, used in all search services created after July 15, 2020
 + A *classic similarity* algorithm, used by all search services created before July 15, 2020
 
-BM25 ranking is the default because it tends to produce search rankings that align better with user expectations. It includes [parameters](#set-bm25-parameters) for tuning results based on factors such as document size. For search services created after July 2020, BM25 is the sole scoring algorithm. If you try to set "similarity" to ClassicSimilarity on a new service, an HTTP 400 error will be returned because that algorithm is not supported by the service.
+BM25 ranking is the default because it tends to produce search rankings that align better with user expectations. It includes [parameters](#set-bm25-parameters) for tuning results based on factors such as document size. For search services created after July 2020, BM25 is the only scoring algorithm. If you try to set "similarity" to ClassicSimilarity on a new service, an HTTP 400 error will be returned because that algorithm is not supported by the service.
 
 For older services, classic similarity remains the default algorithm. Older services can [upgrade to BM25](#enable-bm25-scoring-on-older-services) on a per-index basis. When switching from classic to BM25, you can expect to see some differences how search results are ordered.
 
 ## Set BM25 parameters
 
-BM25 similarity adds two parameters to control the relevance score calculation. To set "similarity" parameters, issue a [Create or Update Index](/rest/api/searchservice/create-index) request as illustrated by the following example.
+BM25 similarity adds two parameters to control the relevance score calculation. 
 
-```http
-PUT [service-name].search.windows.net/indexes/[index-name]?api-version=2020-06-30&allowIndexDowntime=true
-{
-    "similarity": {
-        "@odata.type": "#Microsoft.Azure.Search.BM25Similarity",
-        "b" : 0.5,
-        "k1" : 1.3
+1. Formulate a [Create or Update Index](/rest/api/searchservice/create-index) request as illustrated by the following example.
+
+    ```http
+    PUT [service-name].search.windows.net/indexes/[index-name]?api-version=2020-06-30&allowIndexDowntime=true
+    {
+        "similarity": {
+            "@odata.type": "#Microsoft.Azure.Search.BM25Similarity",
+            "b" : 0.75,
+            "k1" : 1.2
+        }
     }
-}
-```
+    ```
+
+1. Set "b" and "k1" to custom values. See the property descriptions in the next section for details.
+
+1. If the index is live, append the "allowIndexDowntime=true" URI parameter on the request.
+
+   Because Cognitive Search won't allow updates to a live index, you'll need to take the index offline so that the parameters can be added. Indexing and query requests will fail while the index is offline. The duration of the outage is the amount of time it takes to update the index, usually no more than several seconds. When the update is complete, the index comes back automatically.
 
-Because Cognitive Search won't allow updates to a live index, you'll need to take the index offline so that the parameters can be added. Indexing and query requests will fail while the index is offline. The duration of the outage is the amount of time it takes to update the index, usually no more than several seconds. When the update is complete, the index comes back automatically. To take the index offline, append the "allowIndexDowntime=true" URI parameter on the request that sets the "similarity" property.
+1. Send the request.
 
-### BM25 property reference
+### BM25 property descriptions
 
 | Property | Type | Description |
 |----------|------|-------------|
diff --git a/articles/search/index-similarity-and-scoring.md b/articles/search/index-similarity-and-scoring.md
@@ -7,12 +7,14 @@ author: HeidiSteen
 ms.author: heidist
 ms.service: cognitive-search
 ms.topic: conceptual
-ms.date: 06/22/2022
+ms.date: 10/14/2022
 ---
 
 # Relevance and scoring in Azure Cognitive Search
 
-This article describes relevance and the scoring algorithms used to compute search scores in Azure Cognitive Search. A relevance score applies to matches returned in [full text search](search-lucene-query-architecture.md), where the most relevant matches appear first. Filter queries, autocomplete and suggested queries, wildcard search or fuzzy search queries are not scored or ranked for relevance.
+This article explains the relevance and the scoring algorithms used to compute search scores in Azure Cognitive Search. A relevance score is computed for each match found in a [full text search](search-lucene-query-architecture.md), where the strongest matches are assigned higher search scores. 
+
+Relevance applies to full text search only. Filter queries, autocomplete and suggested queries, wildcard search or fuzzy search queries are not scored or ranked for relevance.
 
 In Azure Cognitive Search, you can tune search relevance and boost search scores through these mechanisms:
 
@@ -87,7 +89,7 @@ As long as the same `sessionId` is used, a best-effort attempt will be made to t
 
 ## Scoring profiles
 
-You can customize the way different fields are ranked by defining a *scoring profile*. Scoring profiles give you greater control over the ranking of items in search results. For example, you might want to boost items based on their revenue potential, promote newer items, or perhaps boost items that have been in inventory too long. 
+You can customize the way different fields are ranked by defining a *scoring profile*. Scoring profiles provide criteria for boosting the search score of a match based on content characteristics. For example, you might want to boost matches based on their revenue potential, promote newer items, or perhaps boost items that have been in inventory too long. 
 
 A scoring profile is part of the index definition, composed of weighted fields, functions, and parameters. For more information about defining one, see [Scoring Profiles](index-add-scoring-profiles.md).
 
diff --git a/articles/search/semantic-ranking.md b/articles/search/semantic-ranking.md