You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
#Customer intent: As a professional data scientist, I want to know how to build and deploy a model with Azure Machine Learning by using Python in a Jupyter Notebook.
This tutorial series shows how features seamlessly integrate all phases of the machine learning lifecycle: prototyping, training, and operationalization.
20
20
21
-
You can use Azure Machine Learning managed feature store to discover, create, and operationalize features. The machine learning lifecycle includes a prototyping phase, where you experiment with various features. It also involves an operationalization phase, where models are deployed and inference steps look up feature data. Features serve as the connective tissue in the machine learning lifecycle. To learn more about basic concepts for managed feature store, see [What is managed feature store?](./concept-what-is-managed-feature-store.md) and [Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md).
21
+
You can use Azure Machine Learning managed feature store to discover, create, and operationalize features. The machine learning lifecycle includes a prototyping phase, where you experiment with various features. It also involves an operationalization phase, where models are deployed and inference steps look up feature data. Features serve as the connective tissue in the machine learning lifecycle. To learn more about basic concepts for managed feature store, visit the [What is managed feature store?](./concept-what-is-managed-feature-store.md) and [Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md) resources.
22
22
23
23
This tutorial describes how to create a feature set specification with custom transformations. It then uses that feature set to generate training data, enable materialization, and perform a backfill. Materialization computes the feature values for a feature window, and then stores those values in a materialization store. All feature queries can then use those values from the materialization store.
24
24
@@ -43,9 +43,9 @@ This tutorial series has two tracks:
43
43
44
44
Before you proceed with this tutorial, be sure to cover these prerequisites:
45
45
46
-
* An Azure Machine Learning workspace. For more information about workspace creation, see[Quickstart: Create workspace resources](./quickstart-create-resources.md).
46
+
* An Azure Machine Learning workspace. For more information about workspace creation, visit[Quickstart: Create workspace resources](./quickstart-create-resources.md).
47
47
48
-
* On your user account, the Owner role for the resource group where the feature store is created.
48
+
* On your user account, you need the Owner role for the resource group where the feature store is created.
49
49
50
50
If you choose to use a new resource group for this tutorial, you can easily delete all the resources by deleting the resource group.
51
51
@@ -271,17 +271,17 @@ This tutorial doesn't need explicit installation of these resources, because the
271
271
print(fs_poller.result())
272
272
```
273
273
274
-
2. Initialize a feature store core SDK client for Azure Machine Learning.
274
+
1. Initialize a feature store core SDK client for Azure Machine Learning.
275
275
276
276
As explained earlier in this tutorial, the feature store core SDK client is used to develop and consume features.
3. Grant the "Azure Machine Learning Data Scientist" role on the feature store to your user identity. Obtain your Microsoft Entra object ID value from the Azure portal, as described in [Find the user object ID](/partner-center/find-ids-and-domain-names#find-the-user-object-id).
280
+
1. Grant the "Azure Machine Learning Data Scientist" role on the feature store to your user identity. Obtain your Microsoft Entra object ID value from the Azure portal, as described in [Find the user object ID](/partner-center/find-ids-and-domain-names#find-the-user-object-id).
281
281
282
282
Assign the **AzureML Data Scientist** role to your user identity, so that it can create resources in feature store workspace. The permissions might need some time to propagate.
283
283
284
-
For more information more about access control, see [Manage access control for managed feature store](./how-to-setup-access-control-feature-store.md).
284
+
For more information more about access control, visit the [Manage access control for managed feature store](./how-to-setup-access-control-feature-store.md) resource.
@@ -300,45 +300,45 @@ In these steps, you build a feature set named `transactions` that has rolling wi
300
300
A feature set specification is a self-contained definition of a feature set that you can locally develop and test. Here, you create these rolling window aggregate features:
301
301
302
302
*`transactions three-day count`
303
-
*`transactions amount three-day sum`
304
303
*`transactions amount three-day avg`
304
+
*`transactions amount three-day sum`
305
305
*`transactions seven-day count`
306
-
*`transactions amount seven-day sum`
307
306
*`transactions amount seven-day avg`
307
+
*`transactions amount seven-day sum`
308
308
309
309
Review the feature transformation code file: *featurestore/featuresets/transactions/transformation_code/transaction_transform.py*. Note the rolling aggregation defined for the features. This is a Spark transformer.
310
310
311
-
To learn more about the feature set and transformations, see [What is managed feature store?](./concept-what-is-managed-feature-store.md).
311
+
To learn more about the feature set and transformations, visit the [What is managed feature store?](./concept-what-is-managed-feature-store.md) resource.
To register the feature set specification with the feature store, you must save that specification in a specific format.
318
318
319
-
Review the generated `transactions` feature set specification. Open this file from the file tree to see the specification: *featurestore/featuresets/accounts/spec/FeaturesetSpec.yaml*.
319
+
Review the generated `transactions` feature set specification. Open this file from the file tree to see the *featurestore/featuresets/accounts/spec/FeaturesetSpec.yaml* specification.
320
320
321
321
The specification contains these elements:
322
322
323
-
*`source`: A reference to a storage resource. In this case, it's a Parquet file in a blob storage resource.
323
+
*`source`: A reference to a storage resource. In this case, it's a parquet file in a blob storage resource.
324
324
*`features`: A list of features and their datatypes. If you provide transformation code, the code must return a DataFrame that maps to the features and datatypes.
325
325
*`index_columns`: The join keys required to access values from the feature set.
326
326
327
-
To learn more about the specification, see [Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md) and [CLI (v2) feature set YAML schema](./reference-yaml-feature-set.md).
327
+
To learn more about the specification, visit the [Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md) and [CLI (v2) feature set YAML schema](./reference-yaml-feature-set.md) resources.
328
328
329
-
Persisting the feature set specification offers another benefit: the feature set specification can be source controlled.
329
+
Persisting the feature set specification offers another benefit: the feature set specification supports source control.
As a best practice, entities help enforce use of the same join key definition across feature sets that use the same logical entities. Examples of entities include accounts and customers. Entities are typically created once and then reused across feature sets. To learn more, see[Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md).
335
+
As a best practice, entities help enforce use of the same join key definition, across feature sets that use the same logical entities. Examples of entities include accounts and customers. Entities are typically created once and then reused across feature sets. To learn more, visit the[Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md).
336
336
337
337
### [SDK track](#tab/SDK-track)
338
338
339
339
1. Initialize the feature store CRUD client.
340
340
341
-
As explained earlier in this tutorial, `MLClient` is used for creating, reading, updating, and deleting a feature store asset. The notebook code cell sample shown here searches for the feature store that you created in an earlier step. Here, you can't reuse the same `ml_client` value that you used earlier in this tutorial, because it's scoped at the resource group level. Proper scoping is a prerequisite for feature store creation.
341
+
As explained earlier in this tutorial, `MLClient` is used to create, read, update, and delete a feature store asset. The notebook code cell sample shown here searches for the feature store that you created in an earlier step. Here, you can't reuse the same `ml_client` value that you used earlier in this tutorial, because that value is scoped at the resource group level. Proper scoping is a prerequisite for feature store creation.
342
342
343
343
In this code sample, the client is scoped at feature store level.
344
344
@@ -392,7 +392,7 @@ The Storage Blob Data Reader role must be assigned to your user account on the o
392
392
393
393
:::image type="content" source="media/tutorial-get-started-with-feature-store/offline-store-information.png" lightbox="media/tutorial-get-started-with-feature-store/offline-store-information.png" alt-text="Screenshot that shows offline store account information on feature store Overview page.":::
394
394
395
-
For more information about access control, see [Manage access control for managed feature store](./how-to-setup-access-control-feature-store.md).
395
+
For more information about access control, visit the [Manage access control for managed feature store](./how-to-setup-access-control-feature-store.md) resource.
396
396
397
397
Execute this code cell for role assignment. The permissions might need some time to propagate.
398
398
@@ -404,7 +404,7 @@ The Storage Blob Data Reader role must be assigned to your user account on the o
404
404
405
405
:::image type="content" source="media/tutorial-get-started-with-feature-store/offline-store-information.png" lightbox="media/tutorial-get-started-with-feature-store/offline-store-information.png" alt-text="Screenshot that shows offline store account information on feature store Overview page.":::
406
406
407
-
For more information about access control, see [Manage access control for managed feature store](./how-to-setup-access-control-feature-store.md).
407
+
For more information about access control, visit the [Manage access control for managed feature store](./how-to-setup-access-control-feature-store.md) resource.
408
408
409
409
Execute this code cell for role assignment. The permissions might need some time to propagate.
410
410
@@ -436,7 +436,7 @@ The Storage Blob Data Reader role must be assigned to your user account on the o
436
436
437
437
## Enable offline materialization on the `transactions` feature set
438
438
439
-
After feature set materialization is enabled, you can perform a backfill. You can also schedule recurrent materialization jobs. For more information, see [the third tutorial in the series](./tutorial-enable-recurrent-materialization-run-batch-inference.md).
439
+
After feature set materialization is enabled, you can perform a backfill. You can also schedule recurrent materialization jobs. For more information, visit the [the third tutorial in the series](./tutorial-enable-recurrent-materialization-run-batch-inference.md) resource.
0 commit comments