You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/tutorial-enable-materialization-backfill-data.md
+22-22Lines changed: 22 additions & 22 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -135,31 +135,31 @@ The materialization store uses these values. You can optionally override the def
135
135
136
136
The first option is to create new storage and container resources.
137
137
138
-
# [Python SDK](#tab/python)
138
+
# [Python SDK](#tab/python)
139
139
140
-
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_only/2. Enable materialization and backfill feature data.ipynb?name=create-new-storage)]
140
+
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_only/2. Enable materialization and backfill feature data.ipynb?name=create-new-storage)]
141
141
142
-
# [Azure CLI](#tab/cli)
142
+
# [Azure CLI](#tab/cli)
143
143
144
-
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_and_cli/2. Enable materialization and backfill feature data.ipynb?name=create-new-storage)]
144
+
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_and_cli/2. Enable materialization and backfill feature data.ipynb?name=create-new-storage)]
145
145
146
-
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_and_cli/2. Enable materialization and backfill feature data.ipynb?name=create-new-storage-container)]
146
+
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_and_cli/2. Enable materialization and backfill feature data.ipynb?name=create-new-storage-container)]
147
147
148
-
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_and_cli/2. Enable materialization and backfill feature data.ipynb?name=set-container-arm-id-cli)]
148
+
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_and_cli/2. Enable materialization and backfill feature data.ipynb?name=set-container-arm-id-cli)]
149
149
150
-
---
150
+
---
151
151
152
152
The second option is to reuse an existing storage container.
153
153
154
-
# [Python SDK](#tab/python)
154
+
# [Python SDK](#tab/python)
155
155
156
-
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_only/2. Enable materialization and backfill feature data.ipynb?name=use-existing-storage)]
156
+
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_only/2. Enable materialization and backfill feature data.ipynb?name=use-existing-storage)]
157
157
158
-
# [Azure CLI](#tab/cli)
158
+
# [Azure CLI](#tab/cli)
159
159
160
-
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_and_cli/2. Enable materialization and backfill feature data.ipynb?name=use-existing-storage)]
160
+
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_and_cli/2. Enable materialization and backfill feature data.ipynb?name=use-existing-storage)]
161
161
162
-
---
162
+
---
163
163
164
164
1. Set up a UAI.
165
165
@@ -243,27 +243,27 @@ The following steps grant the blob data reader role access to your user account:
243
243
244
244
After you enable materialization on a feature set, you can perform a backfill, as explained in this tutorial. You can also schedule recurrent materialization jobs. For more information, see [the third tutorial in the series](./tutorial-experiment-train-models-using-features.md).
245
245
246
-
# [Python SDK](#tab/python)
246
+
# [Python SDK](#tab/python)
247
247
248
-
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_only/2. Enable materialization and backfill feature data.ipynb?name=enable-offline-mat-txns-fset)]
248
+
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_only/2. Enable materialization and backfill feature data.ipynb?name=enable-offline-mat-txns-fset)]
249
249
250
-
# [Azure CLI](#tab/cli)
250
+
# [Azure CLI](#tab/cli)
251
251
252
-
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_and_cli/2. Enable materialization and backfill feature data.ipynb?name=enable-offline-mat-txns-fset)]
252
+
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_and_cli/2. Enable materialization and backfill feature data.ipynb?name=enable-offline-mat-txns-fset)]
253
253
254
-
---
254
+
---
255
255
256
256
Optionally, you can save the feature set asset as a YAML resource.
257
257
258
-
# [Python SDK](#tab/python)
258
+
# [Python SDK](#tab/python)
259
259
260
-
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_only/2. Enable materialization and backfill feature data.ipynb?name=dump-txn-fset-yaml)]
260
+
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_only/2. Enable materialization and backfill feature data.ipynb?name=dump-txn-fset-yaml)]
261
261
262
-
# [Azure CLI](#tab/cli)
262
+
# [Azure CLI](#tab/cli)
263
263
264
-
Not applicable.
264
+
Not applicable.
265
265
266
-
---
266
+
---
267
267
268
268
3. Backfill data for the `transactions` feature set.
Copy file name to clipboardExpand all lines: articles/machine-learning/tutorial-get-started-with-feature-store.md
+16-18Lines changed: 16 additions & 18 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -18,7 +18,7 @@ ms.custom: sdkv2, build-2023
18
18
19
19
This tutorial series shows how features seamlessly integrate all phases of the machine learning (ML) lifecycle: prototyping, training, and operationalization.
20
20
21
-
You can use Azure Machine Learning managed feature store to discover, create, and operationalize features. The ML lifecycle includes a prototyping phase, where you experiment with various features. It also involves an operationalization phase, where models are deployed and inference steps look up feature data. Features serve as the connective tissue in the ML lifecycle. To learn more about basic concepts for feature store, see [What is managed feature store?](./concept-what-is-managed-feature-store.md) and [Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md).
21
+
You can use Azure Machine Learning managed feature store to discover, create, and operationalize features. The ML lifecycle includes a prototyping phase, where you experiment with various features. It also involves an operationalization phase, where models are deployed and inference steps look up feature data. Features serve as the connective tissue in the ML lifecycle. To learn more about basic concepts for managed feature store, see [What is managed feature store?](./concept-what-is-managed-feature-store.md) and [Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md).
22
22
23
23
This tutorial is the first part of a four-part series. Here, you learn how to:
24
24
@@ -51,9 +51,9 @@ Before you proceed with this tutorial, be sure to cover these prerequisites:
51
51
52
52
This tutorial uses an Azure Machine Learning Spark notebook for development.
53
53
54
-
1. In the Azure Machine Learning studio environment, select **Notebooks** on the left pane, and then select the **Samples** tab.
54
+
1. In the Azure Machine Learning studio environment, select **Notebooks** on the left pane, and then select the **Samples** tab.
55
55
56
-
1. Browse to the *featurestore_sample* directory (*Samples* > *SDK v2* > *sdk* > *python* > *featurestore_sample*), and then select **Clone**.
56
+
1. Browse to the *featurestore_sample* directory (select **Samples** > **SDK v2** > **sdk** > **python** > **featurestore_sample**), and then select **Clone**.
57
57
58
58
:::image type="content" source="media/tutorial-get-started-with-feature-store/clone-featurestore-example-notebooks.png" lightbox="media/tutorial-get-started-with-feature-store/clone-featurestore-example-notebooks.png" alt-text="Screenshot that shows selection of the sample directory in Azure Machine Learning studio.":::
59
59
@@ -64,16 +64,16 @@ This tutorial uses an Azure Machine Learning Spark notebook for development.
64
64
1. To configure the notebook environment, you must upload the *conda.yml* file:
65
65
66
66
1. Select **Notebooks** on the left pane, and then select the **Files** tab.
67
-
1. Browse to the *env* directory (*Users* > *testUser* > *featurestore_sample* > *project* > *env*), and then select the *conda.yml* file. In this path, *testUser* is the user directory.
67
+
1. Browse to the *env* directory (select **Users** > **testUser** > **featurestore_sample** > **project** > **env**), and then select the *conda.yml* file. In this path, *testUser* is the user directory.
68
68
1. Select **Download**.
69
69
70
70
:::image type="content" source="media/tutorial-get-started-with-feature-store/download-conda-file.png" lightbox="media/tutorial-get-started-with-feature-store/download-conda-file.png" alt-text="Screenshot that shows selection of the Conda YAML file in Azure Machine Learning studio.":::
71
71
72
-
1. In the Azure Machine Learning environment, open the notebook, and then select **Configure Session**.
72
+
1. In the Azure Machine Learning environment, open the notebook, and then select **Configure session**.
73
73
74
74
:::image type="content" source="media/tutorial-get-started-with-feature-store/open-configure-session.png" lightbox="media/tutorial-get-started-with-feature-store/open-configure-session.png" alt-text="Screenshot that shows selections for configuring a session for a notebook.":::
75
75
76
-
1. On the **Configure Session** panel, select **Python packages**.
76
+
1. On the **Configure session** panel, select **Python packages**.
77
77
78
78
1. Upload the Conda file:
79
79
1. Select **Upload Conda file**.
@@ -123,19 +123,19 @@ Not applicable.
123
123
124
124
This tutorial uses two SDKs:
125
125
126
-
* Feature store CRUD SDK
126
+
**Feature store CRUD SDK*
127
127
128
128
You use the same `MLClient` (package name `azure-ai-ml`) SDK that you use with the Azure Machine Learning workspace. A feature store is implemented as a type of workspace. As a result, this SDK is used for CRUD operations for feature stores, feature sets, and feature store entities.
129
129
130
-
* Feature store core SDK
130
+
**Feature store core SDK*
131
131
132
132
This SDK (`azureml-featurestore`) is for feature set development and consumption. Later steps in this tutorial describe these operations:
133
133
134
-
* Develop a feature set specification
135
-
* Retrieve feature data
136
-
* List or get a registered feature set
137
-
* Generate and resolve feature retrieval specifications
138
-
* Generate training and inference data by using point-in-time joins
134
+
* Develop a feature set specification.
135
+
* Retrieve feature data.
136
+
* List or get a registered feature set.
137
+
* Generate and resolve feature retrieval specifications.
138
+
* Generate training and inference data by using point-in-time joins.
139
139
140
140
This tutorial doesn't require explicit installation of those SDKs, because the earlier Conda YAML instructions cover this step.
141
141
@@ -218,7 +218,7 @@ In the following steps, you build a feature set named `transactions` that has ro
218
218
*`features`: A list of features and their datatypes. If you provide transformation code (see the "Day 2" section), the code must return a DataFrame that maps to the features and datatypes.
219
219
*`index_columns`: The join keys required to access values from the feature set.
220
220
221
-
To learn more about the specification, see [Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md) and the [CLI (v2) feature set YAML schema](./reference-yaml-feature-set.md).
221
+
To learn more about the specification, see [Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md) and [CLI (v2) feature set YAML schema](./reference-yaml-feature-set.md).
222
222
223
223
Persisting the feature set specification offers another benefit: the feature set specification can be source controlled.
224
224
@@ -276,7 +276,7 @@ Feature store asset creation and updates can happen only through the SDK and CLI
276
276
277
277
1. Open the [Azure Machine Learning global landing page](https://ml.azure.com/home).
278
278
1. Select **Feature stores** on the left pane.
279
-
1. From this list of accessible feature stores, select the feature store that you created earlier in this tutorial.
279
+
1. From the list of accessible feature stores, select the feature store that you created earlier in this tutorial.
280
280
281
281
## Generate a training data DataFrame by using the registered feature set
282
282
@@ -294,9 +294,7 @@ Feature store asset creation and updates can happen only through the SDK and CLI
294
294
295
295
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_only/1. Develop a feature set and register with managed feature store.ipynb?name=print-txn-fset-sample-values)]
296
296
297
-
1. Select features, and generate training data.
298
-
299
-
Here, you select the features that become part of the training data. Then, you use the feature store SDK to generate the training data itself.
297
+
1. Select the features that become part of the training data. Then, use the feature store SDK to generate the training data itself.
300
298
301
299
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_only/1. Develop a feature set and register with managed feature store.ipynb?name=select-features-and-gen-training-data)]
0 commit comments