Merge pull request #217765 from rmca14/11-8edits

prmerger-automator[bot] · web-flow · commit 30065c17e8f7 · 2022-11-10T16:24:12.000Z
[CogSvcs] Additional edits from PR 215826
diff --git a/articles/cognitive-services/personalizer/concepts-features.md b/articles/cognitive-services/personalizer/concepts-features.md
@@ -11,20 +11,10 @@ ms.topic: conceptual
 ms.date: 10/25/2022
 ---
 
-# Context and Actions
+# Context and actions
 
 Personalizer works by learning what your application should show to users in a given context. These are the two most important pieces of information that you pass into Personalizer. The **context** represents the information you have about the current user or the state of your system, and the **actions** are the options to be chosen from.
 
-## Table of Contents
-
-* [Context](#context) Information about the current user or state of the system 
-* [Actions](#actions) A list of options to choose from
-* [Features](#features) Attributes describing the Context and Actions
-* [Feature Engineering](#feature-engineering) Tips for constructing impactful features
-* [Namespaces](#namespaces) Grouping Features
-* [Examples](#json-examples) Examples of Context and Action features in JSON format
-
-
 ## Context
 
 Information for the _context_ depends on each application and use case, but it typically may include information such as:
@@ -34,18 +24,16 @@ Information for the _context_ depends on each application and use case, but it t
 * Information about the current time, such as day of the week, weekend or not, morning or afternoon, holiday season or not, etc.
 * Information extracted from mobile applications, such as location, movement, or battery level.
 * Historical aggregates of the behavior of users - such as what are the movie genres this user has viewed the most.
-* Information about the state of the system. 
+* Information about the state of the system.
 
 Your application is responsible for loading the information about the context from the relevant databases, sensors, and systems you may have. If your context information doesn't change, you can add logic in your application to cache this information, before sending it to the Rank API.
 
-
 ## Actions
 
 Actions represent a list of options.
 
 Don't send in more than 50 actions when Ranking actions. These may be the same 50 actions every time, or they may change. For example, if you have a product catalog of 10,000 items for an e-commerce application, you may use a recommendation or filtering engine to determine the top 40 a customer may like, and use Personalizer to find the one that will generate the most reward (for example, the user will add to the basket) for the current context.
 
-
 ### Examples of actions
 
 The actions you send to the Rank API will depend on what you are trying to personalize.
@@ -61,18 +49,15 @@ Here are some examples:
 |Choose a chat bot's response to clarify user intent or suggest an action.|Each action is an option of how to interpret the response.|
 |Choose what to show at the top of a list of search results|Each action is one of the top few search results.|
 
-
 ### Load actions from the client application
 
 Features from actions may typically come from content management systems, catalogs, and recommender systems. Your application is responsible for loading the information about the actions from the relevant databases and systems you have. If your actions don't change or getting them loaded every time has an unnecessary impact on performance, you can add logic in your application to cache this information.
 
-
 ### Prevent actions from being ranked
 
 In some cases, there are actions that you don't want to display to users. The best way to prevent an action from being ranked is by adding it to the [Excluded Actions](https://learn.microsoft.com/dotnet/api/microsoft.azure.cognitiveservices.personalizer.models.rankrequest.excludedactions) list, or not passing it to the Rank Request.
 
-In some cases, you might not want events to be trained on by default, i.e., you only want to train events when a specific condition is met. For example, The personalized part of your webpage is below the fold (users have to scroll before interacting with the personalized content). In this case you will render the entire page, but only want an event to be trained on when the user scrolls and has a chance to interact with the personalized content. For these cases, you should [Defer Event Activation](concept-active-inactive-events.md) to avoid assigning default reward (and training) events which the end user did not have a chance to interact with.
-
+In some cases, you might not want events to be trained on by default. In other words, you only want to train events when a specific condition is met. For example, The personalized part of your webpage is below the fold (users have to scroll before interacting with the personalized content). In this case you will render the entire page, but only want an event to be trained on when the user scrolls and has a chance to interact with the personalized content. For these cases, you should [Defer Event Activation](concept-active-inactive-events.md) to avoid assigning default reward (and training) events which the end user did not have a chance to interact with.
 
 ## Features
 
@@ -91,14 +76,14 @@ Personalizer does not prescribe, limit, or fix what features you can send for ac
 
 It's ok and natural for features to change over time. However, keep in mind that Personalizer's machine learning model adapts based on the features it sees. If you send a request containing all new features, Personalizer's model will not be able to leverage past events to select the best action for the current event. Having a 'stable' feature set (with recurring features) will help the performance of Personalizer's machine learning algorithms.
 
-### Context Features
+### Context features
 * Some context features may only be available part of the time. For example, if a user is logged into the online grocery store website, the context will contain features describing purchase history. These features will not be available for a guest user.
 * There must be at least one context feature. Personalizer does not support an empty context.
 * If the context features are identical for every request, Personalizer will choose the globally best action.
 
-### Action Features
+### Action features
 * Not all actions need to contain the same features. For example, in the online grocery store scenario, microwavable popcorn will have a "cooking time" feature, while a cucumber will not.
-* Features for a certain action ID may be available one day, but later on become unavailable. 
+* Features for a certain action ID may be available one day, but later on become unavailable.
 
 Examples:
 
@@ -112,17 +97,16 @@ The following are good examples for action features. These will depend a lot on
 
 Personalizer supports features of string, numeric, and boolean types. It's very likely that your application will mostly use string features, with a few exceptions.
 
-### How feature types affects the Machine Learning in Personalizer
+### How feature types affect machine learning in Personalizer
 
-* **Strings**: For string types, every key-value (feature name, feature value) combination is treated as a One-Hot feature (e.g. category:"Produce" and category:"Meat" would internally be represented as different features in the machine learning model.
+* **Strings**: For string types, every key-value (feature name, feature value) combination is treated as a One-Hot feature (for example, category:"Produce" and category:"Meat" would internally be represented as different features in the machine learning model).
 * **Numeric**: Only use numeric values when the number is a magnitude that should proportionally affect the personalization result. This is very scenario dependent. Features that are based on numeric units but where the meaning isn't linear - such as Age, Temperature, or Person Height - are best encoded as categorical strings. For example Age could be encoded as "Age":"0-5", "Age":"6-10", etc. Height could be bucketed as "Height": "<5'0", "Height": "5'0-5'4", "Height": "5'5-5'11", "Height":"6'0-6-4", "Height":">6'4".
 * **Boolean**
-* **Arrays** ONLY numeric arrays are supported.
-
+* **Arrays** Only numeric arrays are supported.
 
-## Feature Engineering
+## Feature engineering
 
-* Use categorical and string types for features that are not a magnitude. 
+* Use categorical and string types for features that are not a magnitude.
 * Make sure there are enough features to drive personalization. The more precisely targeted the content needs to be, the more features are needed.
 * There are features of diverse *densities*. A feature is *dense* if many items are grouped in a few buckets. For example, thousands of videos can be classified as "Long" (over 5 min long) and "Short" (under 5 min long). This is a *very dense* feature. On the other hand, the same thousands of items can have an attribute called "Title", which will almost never have the same value from one item to another. This is a very non-dense or *sparse* feature.  
 
@@ -134,21 +118,21 @@ Having features of high density helps Personalizer extrapolate learning from one
 * **Sending user IDs** With large numbers of users, it's unlikely that this information is relevant to Personalizer learning to maximize the average reward score. Sending user IDs (even if non-PII) will likely add more noise to the model and is not recommended.
 * **Sending unique values that will rarely occur more than a few times**. It's recommended to bucket your features to a higher level-of-detail. For example, having features such as `"Context.TimeStamp.Day":"Monday"` or `"Context.TimeStamp.Hour":13` can be useful as there are only 7 and 24 unique values, respectively. However, `"Context.TimeStamp":"1985-04-12T23:20:50.52Z"` is very precise and has an extremely large number of unique values, which makes it very difficult for Personalizer to learn from it.
 
-### Improve feature sets 
+### Improve feature sets
 
 Analyze the user behavior by running a [Feature Evaluation Job](how-to-feature-evaluation.md). This allows you to look at past data to see what features are heavily contributing to positive rewards versus those that are contributing less. You can see what features are helping, and it will be up to you and your application to find better features to send to Personalizer to improve results even further.
 
 ### Expand feature sets with artificial intelligence and cognitive services
 
-Artificial Intelligence and ready-to-run Cognitive Services can be a very powerful addition to Personalizer. 
+Artificial Intelligence and ready-to-run Cognitive Services can be a very powerful addition to Personalizer.
 
 By preprocessing your items using artificial intelligence services, you can automatically extract information that is likely to be relevant for personalization.
 
 For example:
 
-* You can run a movie file via [Video Indexer](https://azure.microsoft.com/services/media-services/video-indexer/) to extract scene elements, text, sentiment, and many other attributes. These attributes can then be made more dense to reflect characteristics that the original item metadata didn't have. 
+* You can run a movie file via [Video Indexer](https://azure.microsoft.com/services/media-services/video-indexer/) to extract scene elements, text, sentiment, and many other attributes. These attributes can then be made more dense to reflect characteristics that the original item metadata didn't have.
 * Images can be run through object detection, faces through sentiment, etc.
-* Information in text can be augmented by extracting entities, sentiment, expanding entities with Bing knowledge graph, etc.
+* Information in text can be augmented by extracting entities, sentiment, and expanding entities with Bing knowledge graph.
 
 You can use several other [Azure Cognitive Services](https://www.microsoft.com/cognitive-services), like
 
@@ -157,17 +141,16 @@ You can use several other [Azure Cognitive Services](https://www.microsoft.com/c
 * [Emotion](../face/overview.md)
 * [Computer Vision](../computer-vision/overview.md)
 
-### Use Embeddings as Features
+### Use embeddings as features
 
 Embeddings from various Machine Learning models have proven to be affective features for Personalizer
 
 * Embeddings from Large Language Models
 * Embeddings from Computer Vision Models
 
-
 ## Namespaces
 
-Optionally, features can be organized using namespaces (relevant for both context and action features). Namespaces can be used to group features by topic, by source, or any other grouping that makes sense in your application. You determine if namespaces are used and what they should be. Namespaces organize features into distinct sets, and disambiguate features with similar names. You can think of namespaces as a 'prefix' that is added to feature names. Namespaces should not be nested. 
+Optionally, features can be organized using namespaces (relevant for both context and action features). Namespaces can be used to group features by topic, by source, or any other grouping that makes sense in your application. You determine if namespaces are used and what they should be. Namespaces organize features into distinct sets, and disambiguate features with similar names. You can think of namespaces as a 'prefix' that is added to feature names. Namespaces should not be nested.
 
 The following are examples of feature namespaces used by applications:
 
@@ -191,13 +174,12 @@ The following are examples of feature namespaces used by applications:
 * The following characters cannot be used: codes < 32 (not printable), 32 (space), 58 (colon), 124 (pipe), and 126–140.
 * All namespaces starting with an underscore `_` will be ignored.
 
-
-## JSON Examples
+## JSON examples
 
 ### Actions
 When calling Rank, you will send multiple actions to choose from:
 
-JSON objects can include nested JSON objects and simple property/values. An array can be included only if the array items are numbers. 
+JSON objects can include nested JSON objects and simple property/values. An array can be included only if the array items are numbers.
 
 ```json
 {
@@ -266,7 +248,7 @@ JSON objects can include nested JSON objects and simple property/values. An arra
 
 Context is expressed as a JSON object that is sent to the Rank API:
 
-JSON objects can include nested JSON objects and simple property/values. An array can be included only if the array items are numbers. 
+JSON objects can include nested JSON objects and simple property/values. An array can be included only if the array items are numbers.
 
 ```JSON
 {
@@ -290,12 +272,11 @@ JSON objects can include nested JSON objects and simple property/values. An arra
 
 ### Namespaces
 
-In the following JSON, `user`, `environment`, `device`, and `activity` are namespaces. 
+In the following JSON, `user`, `environment`, `device`, and `activity` are namespaces.
 
 > [!Note]
 > We strongly recommend using names for feature namespaces that are UTF-8 based and start with different letters. For example, `user`, `environment`, `device`, and `activity` start with `u`, `e`, `d`, and `a`. Currently having namespaces with same first characters could result in collisions.
 
-
 ```JSON
 {
     "contextFeatures": [