You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -18,41 +18,41 @@ This document describes the top level entities in the managed feature store.
18
18
19
19
:::image type="content" source="media/concept-managed-feature-store/concepts.png" alt-text="Diagram depicting the main components of managed feature store.":::
20
20
21
-
For more information on the managed feature store, see [What is managed feature store?](concept-what-is-managed-feature-store.md)
21
+
For more information on the managed feature store, visit the [What is managed feature store?](./concept-what-is-managed-feature-store.md) resource.
22
22
23
23
## Feature store
24
-
You can create and manage feature sets through a feature store. Feature sets are a collection of features. You can optionally associate a materialization store (offline store connection) with a feature store, to regularly precompute and persist the features. It can make feature retrieval during training or inference faster and more reliable.
24
+
You can create and manage feature sets through a feature store. A feature set is a collection of features. You can optionally associate a materialization store (offline store connection) with a feature store, to precompute and persist the features on a regular basis. This approach can make feature retrieval during training or inference faster and more reliable.
25
25
26
-
For more information about the configuration, see [CLI (v2) feature store YAML schema](reference-yaml-feature-store.md)
26
+
For more information about the configuration, visit the [CLI (v2) feature store YAML schema](./reference-yaml-feature-store.md) resource.
27
27
28
28
## Entities
29
-
Entities encapsulate the index columns for logical entities in an enterprise. Examples of entities include account entity, customer entity, etc. Entities help enforce, as best practice, the use of the same index column definitions across the feature sets that use the same logical entities.
29
+
An entity encapsulates the index columns for logical entities in an enterprise. Examples of entities include account entity, customer entity, etc. Entities help enforce, as a best practice, the use of the same index column definitions across the feature sets that use the same logical entities.
30
30
31
-
Entities are typically created once and then reused across feature-sets. Entities are versioned.
31
+
Entities are typically created once, and then reused across featuresets. Entities are versioned.
32
32
33
-
For more information about the configuration, see [CLI (v2) feature entity YAML schema](reference-yaml-feature-entity.md)
33
+
For more information about the configuration, visit the [CLI (v2) feature entity YAML schema](reference-yaml-feature-entity.md) resource.
34
34
35
35
## Feature set specification and asset
36
-
Feature sets are a collection of features generated by applying transformations on source system data. Feature sets encapsulate a source, the transformation function, and the materialization settings. We currently support PySpark feature transformation code.
36
+
A feature set is a collection of features generated by the application of a transformation on source system data. Feature sets encapsulate a source, the transformation function, and the materialization settings. We currently support PySpark feature transformation code.
37
37
38
-
Start by creating a feature set specification. A feature set specification is a self-contained definition of a feature set that you can locally develop and test.
38
+
First, create a feature set specification. A feature set specification is a self-contained definition of a feature set that you can locally develop and test.
39
39
40
-
A feature set specification typically consists of the following parameters:
40
+
A feature set specification typically consists of these parameters:
41
41
-`source`: What source(s) does this feature map to
42
42
-`transformation` (optional): The transformation logic, applied to the source data, to create features. In our case, we use Spark as the supported compute.
43
43
- Names of the columns that represent the `index_columns` and the `timestamp_column`: These names are required when users try to join feature data with observation data (more about this later)
44
-
-`materialization_settings`(optional): Required, to cache the feature values in a materialization store for efficient retrieval.
44
+
-`materialization_settings`(optional): Required if you want to cache the feature values in a materialization store for efficient retrieval.
45
45
46
-
After development and testing the feature set spec in your local/dev environment, you can register the spec as a feature set asset with the feature store. The feature set asset provides managed capabilities, such as versioning and materialization.
46
+
After you develop and test the feature set spec in your local/dev environment, you can register the spec as a feature set asset with the feature store. The feature set asset provides managed capabilities, for example versioning and materialization.
47
47
48
-
For more information about the feature set YAML specification, see [CLI (v2) feature set specification YAML schema](reference-yaml-featureset-spec.md)
48
+
For more information about the feature set YAML specification, visit the [CLI (v2) feature set specification YAML schema](reference-yaml-featureset-spec.md) resource.
49
49
50
50
## Feature retrieval specification
51
-
A feature retrieval specification is a portable definition of a feature list associated with a model. It can help streamline machine learning model development and operationalization. A feature retrieval specification is typically an input to the training pipeline. It helps generate the training data. It can be packaged with the model. Additionally, inference step uses it to look up the features. It integrates all phases of the machine learning lifecycle. Changes to your training and inference pipeline can be minimized as you experiment and deploy.
51
+
A feature retrieval specification is a portable definition of a feature list that is associated with a model. It can help streamline machine learning model development and operationalization. A feature retrieval specification is typically an input to the training pipeline. It helps generate the training data. You can package it with the model. Additionally, the inference step uses it to look up the features. It integrates all phases of the machine learning lifecycle. Changes to your training and inference pipeline can be minimized as you experiment and deploy.
52
52
53
53
Use of a feature retrieval specification and the built-in feature retrieval component are optional. You can directly use the `get_offline_features()` API if you want.
54
54
55
-
For more information about the feature retrieval YAML specification, see [CLI (v2) feature retrieval specification YAML schema](reference-yaml-feature-retrieval-spec.md).
55
+
For more information about the feature retrieval YAML specification, visit the [CLI (v2) feature retrieval specification YAML schema](reference-yaml-feature-retrieval-spec.md) resource.
0 commit comments