Skip to content

Commit 03a0009

Browse files
Merge pull request #2472 from fbsolo-ms1/feature-set-specification-transformation-concepts-branch
Freshness update for feature-set-materialization-concepts.md . . .
2 parents 5acda83 + 63a77d5 commit 03a0009

File tree

1 file changed

+8
-8
lines changed

1 file changed

+8
-8
lines changed

articles/machine-learning/feature-set-materialization-concepts.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.topic: how-to
88
author: fbsolo-ms1
99
ms.author: franksolomon
1010
ms.reviewer: ynpandey
11-
ms.date: 12/06/2023
11+
ms.date: 01/23/2025
1212
ms.custom: template-concept, update-code2
1313
---
1414

@@ -43,9 +43,9 @@ As materialization jobs run for the feature set, they create or merge data inter
4343
- When two data intervals are continuous on the timeline, and they have the same data materialization status, they become one data interval
4444
- In a data interval, when a portion of the feature data is materialized again, and that portion gets a different data materialization status, that data interval is split into multiple data intervals
4545

46-
When users select a feature window, they might see multiple data intervals in that window with different data materialization statuses. They might see multiple data intervals that are disjoint on the timeline. For example, the earlier snapshot has 16 *data intervals* for the defined *feature window* in the offline materialization store.
46+
When a user selects a feature window, multiple data intervals, with different data materialization statuses, might appear in that window. Additionally, multiple data intervals, that are disjoint on the timeline, might also appear. For example, the earlier snapshot has 16 *data intervals* for the defined *feature window* in the offline materialization store.
4747

48-
At any given time, a feature set can have at most 2,000 *data intervals*. Once a feature set reaches that limit, no more materialization jobs can run. Users must then create a new feature set version with materialization enabled. For the new feature set version, materialize the features in the offline and online stores from scratch.
48+
At any given time, a feature set can have at most 2,000 *data intervals*. Once a feature set reaches that limit, no more materialization jobs can run. Users must then create a new feature set version, with materialization enabled. For the new feature set version, materialize the features in the offline and online stores from scratch.
4949

5050
To avoid the limit, users should run backfill jobs in advance to [fill the gaps](#filling-the-gaps) in the data intervals. This merges the data intervals, and reduces the total count.
5151

@@ -55,10 +55,10 @@ Before you run a data materialization job, enable the offline and/or online data
5555

5656
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_only/4.Enable-online-store-run-inference.ipynb?name=enable-accounts-material)]
5757

58-
You can submit the data materialization jobs as a:
58+
You can submit the data materialization jobs as:
5959

60-
- backfill job - a manually submitted batch materialization job
61-
- recurrent materialization job - an automatic materialization job [triggered on a scheduled interval](./feature-set-materialization-concepts.md#set-proper-source_delay-and-recurrent-schedule).
60+
- a backfill job - a manually submitted batch materialization job
61+
- a recurrent materialization job - an automatic materialization job [triggered on a scheduled interval](./feature-set-materialization-concepts.md#set-proper-source_delay-and-recurrent-schedule).
6262

6363
> [!WARNING]
6464
> Data already materialized in the offline and/or online materialization will no longer be usable if offline and/or online data materialization is disabled at the feature set level. The data materialization status in offline and/or online materialization store will be reset to `None`.
@@ -70,7 +70,7 @@ You can submit backfill jobs by:
7070

7171
### Data backfill by data materialization status
7272

73-
User can submit a backfill request with:
73+
Users can submit a backfill request with:
7474

7575
- A list of data materialization status values - Incomplete, Complete, or None
7676
- A feature window (optional)
@@ -184,7 +184,7 @@ This example defines a daily job that triggers at 4 AM, starting on 4/15/2023. D
184184

185185
### Update materialization store
186186

187-
Before you update a feature store online or offline materialization store, all feature sets in that feature store should have the corresponding offline and/or online materialization disabled. The update operation fails as `UserError`, if some feature sets have materialization enabled.
187+
Before you update an online feature store or an offline materialization store, all feature sets in that feature store should have the corresponding offline and/or online materialization disabled. The update operation fails as `UserError` if some feature sets have materialization enabled.
188188

189189
The materialization status of the data in the offline and/or online materialization store resets if offline and/or online materialization is disabled on a feature set. The reset renders materialized data unusable. If offline and/or online materialization on the feature set is enabled later, users must resubmit their materialization jobs.
190190

0 commit comments

Comments
 (0)