Skip to content

Commit ed2152f

Browse files
Drift Explainability Report Images
1 parent b0dda3c commit ed2152f

File tree

5 files changed

+142
-5
lines changed

5 files changed

+142
-5
lines changed

md-docs/imgs/monitoring/drift-explainability/concept-fi.svg

Lines changed: 1 addition & 0 deletions
Loading

md-docs/imgs/monitoring/drift-explainability/fi.svg

Lines changed: 53 additions & 0 deletions
Loading

md-docs/imgs/monitoring/drift-explainability/score.svg

Lines changed: 53 additions & 0 deletions
Loading

md-docs/user_guide/monitoring/drift_explainability.md

Lines changed: 31 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -10,14 +10,14 @@ The ML cube Platform supports this process by offering what we refer to as **Dri
1010
automatically generated upon the detection of a drift and containing several elements that should help you diagnose the root causes
1111
of the change occurred.
1212

13-
You can access the reports by navigating to the `Drift Explainability` tab in the sidebar of the task page.
13+
You can access the reports in the WebApp, by navigating to the `Drift Explainability` tab in the sidebar of the Task page.
1414

1515
## Structure
1616

1717
A Drift Explainability Report consists in comparing the reference data and the portion of production data where the drift was identified, hence
18-
those belonging to the new data distribution. Notice that these reports are generated after a sufficient amount of samples has been collected after the drift.
19-
This is because the elements of the report needs a significant number of samples to guarantee statistical reliability of the results.
20-
If the distribution moves back to the reference before enough samples are collected, the report might not be generated.
18+
those belonging to the new data distribution. Notice that these reports are generated after a sufficient amount of samples has been collected
19+
after the drift, in order to ensure statistical reliability of the results.
20+
If the data distribution moves back to the reference before enough samples are collected, the report might not be generated.
2121

2222
Each report is composed of several entities, each providing a different perspective on the data and the drift occurred.
2323
Most of them are specific to a certain `Data Structure`, so they might not be available for all tasks.
@@ -29,9 +29,36 @@ These are the entities currently available:
2929
- `Feature Importance`: it's a barplot that illustrates how the significance of each feature differs between the reference
3030
and the production datasets. Variations in a feature's values might suggest that its contribution to the model's predictions
3131
has changed over time. This entity is available only for tasks with tabular data.
32+
33+
<figure markdown>
34+
![Feature Importance](../../imgs/monitoring/drift-explainability/fi.svg)
35+
<figcaption>Example of a feature importance plot.</figcaption>
36+
</figure>
37+
3238
- `Variable discriminative power`: it's also a bar plot displays the influence of each feature, as well as the target,
3339
in differentiating between the reference and the production datasets.
3440
The values represent how strongly a given feature helps to distinguish the datasets, with higher values representing stronger
3541
separating power. This entity is available only for tasks with tabular data.
3642

43+
<figure markdown>
44+
![Variable discriminative power](../../imgs/monitoring/drift-explainability/concept-fi.svg)
45+
<figcaption>Example of a variable discriminative power plot.</figcaption>
46+
</figure>
47+
48+
- `Drift Score`: it's a line plot that shows the evolution of the drift score over time. The drift score is a
49+
measure of the statistical distance between a sliding window of the production data and the reference data. It also shows the threshold,
50+
which is the value that the drift score must exceed to raise a drift alarm, and all the [Detection Events] that were triggered in
51+
the time frame of the report. This plot helps in understanding how the drift evolved over time and the moments in which the difference
52+
between the two datasets was higher. Notice that some postprocessing is applied on the events to account for the functioning of the drift detection algorithms.
53+
Specifically,
54+
we shift back the drift on events by a certain offset, aiming to point at the precise time when the drift actually started. As a result,
55+
drift on events might be shown before the threshold is exceeded. This entity is available for all tasks.
56+
57+
58+
<figure markdown style="width: 100%">
59+
![Drift score](../../imgs/monitoring/drift-explainability/score.svg)
60+
<figcaption style="width: 100%; text-align: center;">Example of a drift score plot with detection events of increasing severity displayed.</figcaption>
61+
</figure>
62+
3763
[Monitoring]: index.md
64+
[Detection Events]: detection_event.md

md-docs/user_guide/monitoring/index.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -50,7 +50,10 @@ is automatically enabled upon the task creation. The idea underlying defining ma
5050
only the model error, is to provide a comprehensive view of the model's
5151
performance and the data distribution, easing the identification of the root causes of a drift and thus facilitating the corrective actions.
5252

53-
![Monitoring Targets and Metrics overview](../../imgs/monitoring-overview.svg)
53+
<figure markdown style="width: 100%">
54+
![Monitoring Overview](../../imgs/monitoring-overview.svg)
55+
<figcaption> Monitoring Targets and Metrics overview</figcaption>
56+
</figure>
5457

5558
#### Monitoring Targets
5659

0 commit comments

Comments
 (0)