Skip to content

Commit 2c14414

Browse files
authored
Merge pull request #209437 from jeffmend/patch-6
Update concepts-exploration.md
2 parents b70d0e0 + 89eaab0 commit 2c14414

File tree

1 file changed

+8
-8
lines changed

1 file changed

+8
-8
lines changed
Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -1,22 +1,22 @@
11
---
22
title: Exploration - Personalizer
33
titleSuffix: Azure Cognitive Services
4-
description: With exploration, Personalizer is able to continue delivering good results, even as user behavior changes. Choosing an exploration setting is a business decision about the proportion of user interactions to explore with, in order to improve the model.
4+
description: With exploration, Personalizer is able to continuously deliver good results, even as user behavior changes. Choosing an exploration setting is a business decision about the proportion of user interactions to explore with, in order to improve the model.
55
author: jcodella
66
ms.author: jacodel
77
ms.manager: nitinme
88
ms.service: cognitive-services
99
ms.subservice: personalizer
1010
ms.topic: conceptual
11-
ms.date: 10/23/2019
11+
ms.date: 08/28/2022
1212
---
1313

14-
# Exploration and exploitation
14+
# Exploration and Known
1515

16-
With exploration, Personalizer is able to continue delivering good results, even as user behavior changes.
16+
With exploration, Personalizer is able to continuously deliver good results, even as user behavior changes.
1717

1818
When Personalizer receives a Rank call, it returns a RewardActionID that either:
19-
* Uses exploitation to match the most probable user behavior based on the current machine learning model.
19+
* Uses known relevance to match the most probable user behavior based on the current machine learning model.
2020
* Uses exploration, which does not match the action that has the highest probability in the rank.
2121

2222
Personalizer currently uses an algorithm called *epsilon greedy* to explore.
@@ -25,7 +25,7 @@ Personalizer currently uses an algorithm called *epsilon greedy* to explore.
2525

2626
You configure the percentage of traffic to use for exploration in the Azure portal's **Configuration** page for Personalizer. This setting determines the percentage of Rank calls that perform exploration.
2727

28-
Personalizer determines whether to explore or exploit with this probability on each rank call. This is different than the behavior in some A/B frameworks that lock a treatment on specific user IDs.
28+
Personalizer determines whether to explore or use the model's learned best action with this probability on each rank call. This is different than the behavior in some A/B frameworks that lock a treatment on specific user IDs.
2929

3030
## Best practices for choosing an exploration setting
3131

@@ -35,8 +35,8 @@ A setting of zero will negate many of the benefits of Personalizer. With this se
3535

3636
A setting that is too high will negate the benefits of learning from user behavior. Setting it to 100% implies a constant randomization, and any learned behavior from users would not influence the outcome.
3737

38-
It is important not to change the application behavior based on whether you see if Personalizer is exploring or exploiting. This would lead to learning biases that ultimately would decrease the potential performance.
38+
It is important not to change the application behavior based on whether you see if Personalizer is exploring or using the learned best action. This would lead to learning biases that ultimately would decrease the potential performance.
3939

4040
## Next steps
4141

42-
[Reinforcement learning](concepts-reinforcement-learning.md)
42+
[Reinforcement learning](concepts-reinforcement-learning.md)

0 commit comments

Comments
 (0)