You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/feature-set-materialization-concepts.md
+8-8Lines changed: 8 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ ms.topic: how-to
8
8
author: fbsolo-ms1
9
9
ms.author: franksolomon
10
10
ms.reviewer: ynpandey
11
-
ms.date: 12/06/2023
11
+
ms.date: 01/23/2025
12
12
ms.custom: template-concept, update-code2
13
13
---
14
14
@@ -43,9 +43,9 @@ As materialization jobs run for the feature set, they create or merge data inter
43
43
- When two data intervals are continuous on the timeline, and they have the same data materialization status, they become one data interval
44
44
- In a data interval, when a portion of the feature data is materialized again, and that portion gets a different data materialization status, that data interval is split into multiple data intervals
45
45
46
-
When users select a feature window, they might see multiple data intervals in that window with different data materialization statuses. They might see multiple data intervals that are disjoint on the timeline. For example, the earlier snapshot has 16 *data intervals* for the defined *feature window* in the offline materialization store.
46
+
When a user selects a feature window, multiple data intervals, with different data materialization statuses, might appear in that window. Additionally, multiple data intervals, that are disjoint on the timeline, might also appear. For example, the earlier snapshot has 16 *data intervals* for the defined *feature window* in the offline materialization store.
47
47
48
-
At any given time, a feature set can have at most 2,000 *data intervals*. Once a feature set reaches that limit, no more materialization jobs can run. Users must then create a new feature set version with materialization enabled. For the new feature set version, materialize the features in the offline and online stores from scratch.
48
+
At any given time, a feature set can have at most 2,000 *data intervals*. Once a feature set reaches that limit, no more materialization jobs can run. Users must then create a new feature set version, with materialization enabled. For the new feature set version, materialize the features in the offline and online stores from scratch.
49
49
50
50
To avoid the limit, users should run backfill jobs in advance to [fill the gaps](#filling-the-gaps) in the data intervals. This merges the data intervals, and reduces the total count.
51
51
@@ -55,10 +55,10 @@ Before you run a data materialization job, enable the offline and/or online data
You can submit the data materialization jobs as a:
58
+
You can submit the data materialization jobs as:
59
59
60
-
- backfill job - a manually submitted batch materialization job
61
-
- recurrent materialization job - an automatic materialization job [triggered on a scheduled interval](./feature-set-materialization-concepts.md#set-proper-source_delay-and-recurrent-schedule).
60
+
-a backfill job - a manually submitted batch materialization job
61
+
-a recurrent materialization job - an automatic materialization job [triggered on a scheduled interval](./feature-set-materialization-concepts.md#set-proper-source_delay-and-recurrent-schedule).
62
62
63
63
> [!WARNING]
64
64
> Data already materialized in the offline and/or online materialization will no longer be usable if offline and/or online data materialization is disabled at the feature set level. The data materialization status in offline and/or online materialization store will be reset to `None`.
@@ -70,7 +70,7 @@ You can submit backfill jobs by:
70
70
71
71
### Data backfill by data materialization status
72
72
73
-
User can submit a backfill request with:
73
+
Users can submit a backfill request with:
74
74
75
75
- A list of data materialization status values - Incomplete, Complete, or None
76
76
- A feature window (optional)
@@ -184,7 +184,7 @@ This example defines a daily job that triggers at 4 AM, starting on 4/15/2023. D
184
184
185
185
### Update materialization store
186
186
187
-
Before you update a feature store online or offline materialization store, all feature sets in that feature store should have the corresponding offline and/or online materialization disabled. The update operation fails as `UserError`, if some feature sets have materialization enabled.
187
+
Before you update an online feature store or an offline materialization store, all feature sets in that feature store should have the corresponding offline and/or online materialization disabled. The update operation fails as `UserError` if some feature sets have materialization enabled.
188
188
189
189
The materialization status of the data in the offline and/or online materialization store resets if offline and/or online materialization is disabled on a feature set. The reset renders materialized data unusable. If offline and/or online materialization on the feature set is enabled later, users must resubmit their materialization jobs.
0 commit comments