You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: md-docs/user_guide/monitoring/drift_explainability.md
+31-4Lines changed: 31 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -10,14 +10,14 @@ The ML cube Platform supports this process by offering what we refer to as **Dri
10
10
automatically generated upon the detection of a drift and containing several elements that should help you diagnose the root causes
11
11
of the change occurred.
12
12
13
-
You can access the reports by navigating to the `Drift Explainability` tab in the sidebar of the task page.
13
+
You can access the reports in the WebApp, by navigating to the `Drift Explainability` tab in the sidebar of the Task page.
14
14
15
15
## Structure
16
16
17
17
A Drift Explainability Report consists in comparing the reference data and the portion of production data where the drift was identified, hence
18
-
those belonging to the new data distribution. Notice that these reports are generated after a sufficient amount of samples has been collected after the drift.
19
-
This is because the elements of the report needs a significant number of samples to guarantee statistical reliability of the results.
20
-
If the distribution moves back to the reference before enough samples are collected, the report might not be generated.
18
+
those belonging to the new data distribution. Notice that these reports are generated after a sufficient amount of samples has been collected
19
+
after the drift, in order to ensure statistical reliability of the results.
20
+
If the data distribution moves back to the reference before enough samples are collected, the report might not be generated.
21
21
22
22
Each report is composed of several entities, each providing a different perspective on the data and the drift occurred.
23
23
Most of them are specific to a certain `Data Structure`, so they might not be available for all tasks.
@@ -29,9 +29,36 @@ These are the entities currently available:
29
29
-`Feature Importance`: it's a barplot that illustrates how the significance of each feature differs between the reference
30
30
and the production datasets. Variations in a feature's values might suggest that its contribution to the model's predictions
31
31
has changed over time. This entity is available only for tasks with tabular data.
<figcaption>Example of a variable discriminative power plot.</figcaption>
46
+
</figure>
47
+
48
+
-`Drift Score`: it's a line plot that shows the evolution of the drift score over time. The drift score is a
49
+
measure of the statistical distance between a sliding window of the production data and the reference data. It also shows the threshold,
50
+
which is the value that the drift score must exceed to raise a drift alarm, and all the [Detection Events] that were triggered in
51
+
the time frame of the report. This plot helps in understanding how the drift evolved over time and the moments in which the difference
52
+
between the two datasets was higher. Notice that some postprocessing is applied on the events to account for the functioning of the drift detection algorithms.
53
+
Specifically,
54
+
we shift back the drift on events by a certain offset, aiming to point at the precise time when the drift actually started. As a result,
55
+
drift on events might be shown before the threshold is exceeded. This entity is available for all tasks.
0 commit comments