You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
description: With exploration, Personalizer is able to continue delivering good results, even as user behavior changes. Choosing an exploration setting is a business decision about the proportion of user interactions to explore with, in order to improve the model.
4
+
description: With exploration, Personalizer is able to continuously deliver good results, even as user behavior changes. Choosing an exploration setting is a business decision about the proportion of user interactions to explore with, in order to improve the model.
5
5
author: jcodella
6
6
ms.author: jacodel
7
7
ms.manager: nitinme
8
8
ms.service: cognitive-services
9
9
ms.subservice: personalizer
10
10
ms.topic: conceptual
11
-
ms.date: 10/23/2019
11
+
ms.date: 08/28/2022
12
12
---
13
13
14
-
# Exploration and exploitation
14
+
# Exploration and Known
15
15
16
-
With exploration, Personalizer is able to continue delivering good results, even as user behavior changes.
16
+
With exploration, Personalizer is able to continuously deliver good results, even as user behavior changes.
17
17
18
18
When Personalizer receives a Rank call, it returns a RewardActionID that either:
19
-
* Uses exploitation to match the most probable user behavior based on the current machine learning model.
19
+
* Uses known relevance to match the most probable user behavior based on the current machine learning model.
20
20
* Uses exploration, which does not match the action that has the highest probability in the rank.
21
21
22
22
Personalizer currently uses an algorithm called *epsilon greedy* to explore.
@@ -25,7 +25,7 @@ Personalizer currently uses an algorithm called *epsilon greedy* to explore.
25
25
26
26
You configure the percentage of traffic to use for exploration in the Azure portal's **Configuration** page for Personalizer. This setting determines the percentage of Rank calls that perform exploration.
27
27
28
-
Personalizer determines whether to explore or exploit with this probability on each rank call. This is different than the behavior in some A/B frameworks that lock a treatment on specific user IDs.
28
+
Personalizer determines whether to explore or use the model's learned best action with this probability on each rank call. This is different than the behavior in some A/B frameworks that lock a treatment on specific user IDs.
29
29
30
30
## Best practices for choosing an exploration setting
31
31
@@ -35,8 +35,8 @@ A setting of zero will negate many of the benefits of Personalizer. With this se
35
35
36
36
A setting that is too high will negate the benefits of learning from user behavior. Setting it to 100% implies a constant randomization, and any learned behavior from users would not influence the outcome.
37
37
38
-
It is important not to change the application behavior based on whether you see if Personalizer is exploring or exploiting. This would lead to learning biases that ultimately would decrease the potential performance.
38
+
It is important not to change the application behavior based on whether you see if Personalizer is exploring or using the learned best action. This would lead to learning biases that ultimately would decrease the potential performance.
0 commit comments