Skip to content

Commit bf7e85d

Browse files
committed
Freshness update for tutorial-get-started-with-feature-store.md . . .
1 parent 0b14d2f commit bf7e85d

File tree

1 file changed

+19
-19
lines changed

1 file changed

+19
-19
lines changed

articles/machine-learning/tutorial-get-started-with-feature-store.md

Lines changed: 19 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.subservice: core
88
ms.topic: tutorial
99
author: fbsolo-ms1
1010
ms.author: franksolomon
11-
ms.date: 11/28/2023
11+
ms.date: 11/21/2024
1212
ms.reviewer: seramasu
1313
ms.custom: sdkv2, build-2023, ignite-2023, update-code2
1414
#Customer intent: As a professional data scientist, I want to know how to build and deploy a model with Azure Machine Learning by using Python in a Jupyter Notebook.
@@ -18,7 +18,7 @@ ms.custom: sdkv2, build-2023, ignite-2023, update-code2
1818

1919
This tutorial series shows how features seamlessly integrate all phases of the machine learning lifecycle: prototyping, training, and operationalization.
2020

21-
You can use Azure Machine Learning managed feature store to discover, create, and operationalize features. The machine learning lifecycle includes a prototyping phase, where you experiment with various features. It also involves an operationalization phase, where models are deployed and inference steps look up feature data. Features serve as the connective tissue in the machine learning lifecycle. To learn more about basic concepts for managed feature store, see [What is managed feature store?](./concept-what-is-managed-feature-store.md) and [Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md).
21+
You can use Azure Machine Learning managed feature store to discover, create, and operationalize features. The machine learning lifecycle includes a prototyping phase, where you experiment with various features. It also involves an operationalization phase, where models are deployed and inference steps look up feature data. Features serve as the connective tissue in the machine learning lifecycle. To learn more about basic concepts for managed feature store, visit the [What is managed feature store?](./concept-what-is-managed-feature-store.md) and [Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md) resources.
2222

2323
This tutorial describes how to create a feature set specification with custom transformations. It then uses that feature set to generate training data, enable materialization, and perform a backfill. Materialization computes the feature values for a feature window, and then stores those values in a materialization store. All feature queries can then use those values from the materialization store.
2424

@@ -43,9 +43,9 @@ This tutorial series has two tracks:
4343

4444
Before you proceed with this tutorial, be sure to cover these prerequisites:
4545

46-
* An Azure Machine Learning workspace. For more information about workspace creation, see [Quickstart: Create workspace resources](./quickstart-create-resources.md).
46+
* An Azure Machine Learning workspace. For more information about workspace creation, visit [Quickstart: Create workspace resources](./quickstart-create-resources.md).
4747

48-
* On your user account, the Owner role for the resource group where the feature store is created.
48+
* On your user account, you need the Owner role for the resource group where the feature store is created.
4949

5050
If you choose to use a new resource group for this tutorial, you can easily delete all the resources by deleting the resource group.
5151

@@ -271,17 +271,17 @@ This tutorial doesn't need explicit installation of these resources, because the
271271
print(fs_poller.result())
272272
```
273273

274-
2. Initialize a feature store core SDK client for Azure Machine Learning.
274+
1. Initialize a feature store core SDK client for Azure Machine Learning.
275275

276276
As explained earlier in this tutorial, the feature store core SDK client is used to develop and consume features.
277277

278278
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_only/1.Develop-feature-set-and-register.ipynb?name=init-fs-core-sdk)]
279279

280-
3. Grant the "Azure Machine Learning Data Scientist" role on the feature store to your user identity. Obtain your Microsoft Entra object ID value from the Azure portal, as described in [Find the user object ID](/partner-center/find-ids-and-domain-names#find-the-user-object-id).
280+
1. Grant the "Azure Machine Learning Data Scientist" role on the feature store to your user identity. Obtain your Microsoft Entra object ID value from the Azure portal, as described in [Find the user object ID](/partner-center/find-ids-and-domain-names#find-the-user-object-id).
281281

282282
Assign the **AzureML Data Scientist** role to your user identity, so that it can create resources in feature store workspace. The permissions might need some time to propagate.
283283

284-
For more information more about access control, see [Manage access control for managed feature store](./how-to-setup-access-control-feature-store.md).
284+
For more information more about access control, visit the [Manage access control for managed feature store](./how-to-setup-access-control-feature-store.md) resource.
285285

286286
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_and_cli/1.Develop-feature-set-and-register.ipynb?name=assign-aad-ds-role-cli)]
287287

@@ -300,45 +300,45 @@ In these steps, you build a feature set named `transactions` that has rolling wi
300300
A feature set specification is a self-contained definition of a feature set that you can locally develop and test. Here, you create these rolling window aggregate features:
301301

302302
* `transactions three-day count`
303-
* `transactions amount three-day sum`
304303
* `transactions amount three-day avg`
304+
* `transactions amount three-day sum`
305305
* `transactions seven-day count`
306-
* `transactions amount seven-day sum`
307306
* `transactions amount seven-day avg`
307+
* `transactions amount seven-day sum`
308308

309309
Review the feature transformation code file: *featurestore/featuresets/transactions/transformation_code/transaction_transform.py*. Note the rolling aggregation defined for the features. This is a Spark transformer.
310310

311-
To learn more about the feature set and transformations, see [What is managed feature store?](./concept-what-is-managed-feature-store.md).
311+
To learn more about the feature set and transformations, visit the [What is managed feature store?](./concept-what-is-managed-feature-store.md) resource.
312312

313313
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_only/1.Develop-feature-set-and-register.ipynb?name=develop-txn-fset-locally)]
314314

315315
1. Export as a feature set specification.
316316

317317
To register the feature set specification with the feature store, you must save that specification in a specific format.
318318

319-
Review the generated `transactions` feature set specification. Open this file from the file tree to see the specification: *featurestore/featuresets/accounts/spec/FeaturesetSpec.yaml*.
319+
Review the generated `transactions` feature set specification. Open this file from the file tree to see the *featurestore/featuresets/accounts/spec/FeaturesetSpec.yaml* specification.
320320

321321
The specification contains these elements:
322322

323-
* `source`: A reference to a storage resource. In this case, it's a Parquet file in a blob storage resource.
323+
* `source`: A reference to a storage resource. In this case, it's a parquet file in a blob storage resource.
324324
* `features`: A list of features and their datatypes. If you provide transformation code, the code must return a DataFrame that maps to the features and datatypes.
325325
* `index_columns`: The join keys required to access values from the feature set.
326326

327-
To learn more about the specification, see [Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md) and [CLI (v2) feature set YAML schema](./reference-yaml-feature-set.md).
327+
To learn more about the specification, visit the [Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md) and [CLI (v2) feature set YAML schema](./reference-yaml-feature-set.md) resources.
328328

329-
Persisting the feature set specification offers another benefit: the feature set specification can be source controlled.
329+
Persisting the feature set specification offers another benefit: the feature set specification supports source control.
330330

331331
[!notebook-python[] (~/azureml-examples-main/sdk/python/featurestore_sample/notebooks/sdk_only/1.Develop-feature-set-and-register.ipynb?name=dump-transactions-fs-spec)]
332332

333333
## Register a feature store entity
334334

335-
As a best practice, entities help enforce use of the same join key definition across feature sets that use the same logical entities. Examples of entities include accounts and customers. Entities are typically created once and then reused across feature sets. To learn more, see [Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md).
335+
As a best practice, entities help enforce use of the same join key definition, across feature sets that use the same logical entities. Examples of entities include accounts and customers. Entities are typically created once and then reused across feature sets. To learn more, visit the [Understanding top-level entities in managed feature store](./concept-top-level-entities-in-managed-feature-store.md).
336336

337337
### [SDK track](#tab/SDK-track)
338338

339339
1. Initialize the feature store CRUD client.
340340

341-
As explained earlier in this tutorial, `MLClient` is used for creating, reading, updating, and deleting a feature store asset. The notebook code cell sample shown here searches for the feature store that you created in an earlier step. Here, you can't reuse the same `ml_client` value that you used earlier in this tutorial, because it's scoped at the resource group level. Proper scoping is a prerequisite for feature store creation.
341+
As explained earlier in this tutorial, `MLClient` is used to create, read, update, and delete a feature store asset. The notebook code cell sample shown here searches for the feature store that you created in an earlier step. Here, you can't reuse the same `ml_client` value that you used earlier in this tutorial, because that value is scoped at the resource group level. Proper scoping is a prerequisite for feature store creation.
342342

343343
In this code sample, the client is scoped at feature store level.
344344

@@ -392,7 +392,7 @@ The Storage Blob Data Reader role must be assigned to your user account on the o
392392

393393
:::image type="content" source="media/tutorial-get-started-with-feature-store/offline-store-information.png" lightbox="media/tutorial-get-started-with-feature-store/offline-store-information.png" alt-text="Screenshot that shows offline store account information on feature store Overview page.":::
394394

395-
For more information about access control, see [Manage access control for managed feature store](./how-to-setup-access-control-feature-store.md).
395+
For more information about access control, visit the [Manage access control for managed feature store](./how-to-setup-access-control-feature-store.md) resource.
396396

397397
Execute this code cell for role assignment. The permissions might need some time to propagate.
398398

@@ -404,7 +404,7 @@ The Storage Blob Data Reader role must be assigned to your user account on the o
404404

405405
:::image type="content" source="media/tutorial-get-started-with-feature-store/offline-store-information.png" lightbox="media/tutorial-get-started-with-feature-store/offline-store-information.png" alt-text="Screenshot that shows offline store account information on feature store Overview page.":::
406406

407-
For more information about access control, see [Manage access control for managed feature store](./how-to-setup-access-control-feature-store.md).
407+
For more information about access control, visit the [Manage access control for managed feature store](./how-to-setup-access-control-feature-store.md) resource.
408408

409409
Execute this code cell for role assignment. The permissions might need some time to propagate.
410410

@@ -436,7 +436,7 @@ The Storage Blob Data Reader role must be assigned to your user account on the o
436436

437437
## Enable offline materialization on the `transactions` feature set
438438

439-
After feature set materialization is enabled, you can perform a backfill. You can also schedule recurrent materialization jobs. For more information, see [the third tutorial in the series](./tutorial-enable-recurrent-materialization-run-batch-inference.md).
439+
After feature set materialization is enabled, you can perform a backfill. You can also schedule recurrent materialization jobs. For more information, visit the [the third tutorial in the series](./tutorial-enable-recurrent-materialization-run-batch-inference.md) resource.
440440

441441
### [SDK track](#tab/SDK-track)
442442

0 commit comments

Comments
 (0)