Skip to content

Commit 33b3c54

Browse files
authored
odh-23178-2 feature store SME review comments (#889)
* odh-23178-2 feature store SME review comments * odh-23178-2 address PR and code rabbit comments
1 parent ab9c986 commit 33b3c54

13 files changed

+46
-47
lines changed

assemblies/defining-ml-features.adoc

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,12 +2,16 @@
22

33
ifdef::context[:parent-context: {context}]
44

5+
:context: featurestore-defining
6+
57
[id="defining-ml-features_{context}"]
6-
= Appendix A: Defining machine learning features
8+
= Defining machine learning features
79

810
[role='_abstract']
911
As part of the Feature Store workflow, ML engineers are responsible for identifying data sources and defining features of interest.
1012

13+
include::modules/setting-up-your-working-environment.adoc[leveloffset=+1]
14+
1115
include::modules/about-feature-definitions.adoc[leveloffset=+1]
1216

1317
include::modules/specifying-the-data-source-for-features.adoc[leveloffset=+1]
@@ -16,6 +20,7 @@ include::modules/about-organizing-features-by-using-entities.adoc[leveloffset=+1
1620

1721
include::modules/creating-feature-views.adoc[leveloffset=+1]
1822

23+
include::modules/making-features-available-for-real-time-inference.adoc[leveloffset=+1]
1924

2025
ifdef::parent-context[:context: {parent-context}]
2126
ifndef::parent-context[:!context:]

assemblies/overview-of-ml-features-and-feature-store.adoc

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ For more information about the support scope of {org-name} Technology Preview fe
2424
====
2525
endif::[]
2626

27-
A machine learning (ML) feature is a measurable property or attribute within a dataset that a machine learning model can analyze to learn patterns and make decisions. Examples of features include a customer's purchase history, demographic data like age and location, weather conditions, and financial market data. You can use these features to train models for tasks such as personalized product recommendations, fraud detection, and predictive maintenance.
27+
A machine learning (ML) feature is a measurable property or attribute within a data set that a machine learning model can analyze to learn patterns and make decisions. Examples of features include a customer's purchase history, demographic data like age and location, weather conditions, and financial market data. You can use these features to train models for tasks such as personalized product recommendations, fraud detection, and predictive maintenance.
2828

2929
Feature Store is a {productname-long} component that provides a centralized repository that stores, manages, and serves machine learning features for both training and inference purposes.
3030

@@ -41,12 +41,13 @@ include::modules/feature-store-workflow.adoc[leveloffset=+1]
4141
* For example Feature Store CRD configurations, see the
4242
link:https://github.com/feast-dev/feast/tree/master/infra/feast-operator/config/samples[Feast Operator configuration samples^].
4343

44-
* For details about the Feast API, see the
44+
* For details about the Feast CRD APIs, see the
4545
link:https://github.com/feast-dev/feast/blob/stable/infra/feast-operator/docs/api/markdown/ref.md[Feast API documentation^].
4646

4747
* For information on how to implement machine learning features, see the
4848
link:https://docs.feast.dev/getting-started/concepts[Feast documentation^].
4949

50+
* For end-to-end use case examples of how Feature Store can benefit your AI/ML workflows, see https://docs.feast.dev/getting-started/use-cases[Feast Getting Started: Use Cases].
5051

5152
ifdef::parent-context[:context: {parent-context}]
5253
ifndef::parent-context[:!context:]

assemblies/setting-up-feature-store.adoc

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ As a cluster administrator, you must complete the following tasks to set up Feat
1010
. Enable the Feature Store component.
1111
. Create a data science project and add a Feature Store instance.
1212
. Initialize the Feature Store instance.
13-
. Make features available to data scientists for them to use for model inference.
13+
. Make features available to data scientists for model training and inference.
1414

1515
include::modules/before-you-begin.adoc[leveloffset=+1]
1616

@@ -22,8 +22,6 @@ include::modules/adding-feature-definitions-and-initializing-your-feature-store-
2222

2323
include::modules/specifying-files-to-ignore.adoc[leveloffset=+2]
2424

25-
include::modules/making-features-available-for-real-time-inference.adoc[leveloffset=+1]
26-
2725
include::modules/viewing-feature-store-objects-in-the-web-based-ui.adoc[leveloffset=+1]
2826

2927

modules/about-feature-definitions.adoc

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -3,9 +3,9 @@
33
[id='about-feature-definitions_{context}']
44
= About feature definitions
55

6-
A machine learning feature is a measurable property or attribute within a dataset that a machine learning model can analyze to learn patterns and make decisions. In Feature Store, you define a feature by defining the name and data type of an attribute.
6+
A machine learning feature is a measurable property or _field_ within a data set that a machine learning model can analyze to learn patterns and make decisions. In Feature Store, you define a feature by defining the name and data type of a field.
77

8-
A feature definition is a schema that includes the name and data type of an attribute, as shown in the following example:
8+
A feature definition is a schema that includes the field name and data type, as shown in the following example:
99

1010
[source,python]
1111
----
@@ -20,7 +20,7 @@ credit_card_amount_due = Field(
2020

2121
For a list of supported data types for fields in Feature Store, see the link:https://rtd.feast.dev/en/master/feast.html#module-feast.types[`feast.types` module] in the Feast documentation.
2222

23-
In addition to field name and data type, a feature view can include additional metadata, specified as descriptions of features, as shown in the following example:
23+
In addition to field name and data type, a feature definition can include additional metadata, specified as descriptions of features, as shown in the following example:
2424

2525
[source,python]
2626
----

modules/about-organizing-features-by-using-entities.adoc

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ customer = Entity(name='dob_ssn', join_keys=['dob_ssn'])
1414

1515
The entity name uniquely identifies the entity. The join key identifies the physical primary key on which feature values are joined together for feature retrieval.
1616

17-
The following table shows example data with a single entity column (`dob_ssn`) and two feature columns (`credit_card_due` and `rating`).
17+
The following table shows example data with a single entity column (`dob_ssn`) and two feature columns (`credit_card_due` and `bankruptcies`).
1818

1919
.Example data showing an entity and features
2020
[cols="1,3,2,3,2"]

modules/adding-feature-definitions-and-initializing-your-feature-store-instance.adoc

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,8 @@ When you initialize the Feature Store instance, Feature Store completes the foll
1616

1717
* Syncs the metadata about objects to the feature registry. If a registry does not exist, Feature Store creates one. The default registry is a simple Protobuf binary file on disk (locally or in an object store).
1818

19-
* Creates or updates all necessary Feature Store infrastructure. The exact infrastructure that Feature Store creates depends on the provider configuration that you have set in `feature_store`.yaml. For example, setting `local` as your provider results in the creation of a sqlite online store.
19+
* Creates or updates all necessary Feature Store infrastructure. The exact infrastructure that Feature Store creates depends on the provider configuration that you have set in `feature_store.yaml`.
20+
For example, when you specify `local` as your provider, Feature Store creates the infrastructure on the local cluster.
2021
+
2122
*Note:* When you use a cloud provider, such as Google Cloud Platform or Amazon Web Service, the `feast apply` command creates cloud infrastructure that might incur costs for your organization.
2223

@@ -37,8 +38,8 @@ When you initialize the Feature Store instance, Feature Store completes the foll
3738
. Click the `feast` pod and then select *Pod details*.
3839
. Scroll down to see the online container. This container is the deployment for the online server, and it makes the feature server REST API available in the OpenShift cluster.
3940
. Scroll up and then click *Terminal*.
40-
. Copy the feature definition (`.py`) file to your Feature Store namespace.
41-
. To create a feature registry, set up data connections, and add the feature definitions to the registry, run the following command:
41+
. Copy the feature definition (`.py`) file to your Feature Store directory.
42+
. To create a feature registry and add the feature definitions to the registry, run the following command:
4243
+
4344
----
4445
feast apply

modules/creating-a-feature-store-instance-in-a-data-science-project.adoc

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@
66
[role='_abstract']
77
You can add an instance of Feature Store to a data science project by creating a custom resource definition (CRD) in the OpenShift console.
88

9-
The following example shows the minimum requirements for a Feature Store CRD YAML file:
9+
The following example shows the minimum requirements for a Feature Store CR YAML file:
1010

1111
[.lines_space]
1212
[.console-input]
@@ -28,7 +28,7 @@ spec:
2828

2929
* You have set up your database infrastructure for the online store, offline store, and registry.
3030
+
31-
For an example of setting up and running PostgreSQL (for the registry) and Redis (for the online store), see the Featire Store Operator quick start example: https://github.com/feast-dev/feast/tree/stable/examples/operator-quickstart.
31+
For an example of setting up and running PostgreSQL (for the registry) and Redis (for the online store), see the Feature Store Operator quick start example: https://github.com/feast-dev/feast/tree/stable/examples/operator-quickstart.
3232

3333

3434
ifndef::upstream[]
@@ -70,7 +70,7 @@ The `spec.feastProjectDir` references a Feature Store project that is in the Git
7070

7171
. Click *Create*.
7272

73-
When you create the Feature Store CRD file in OpenShift, Feature Store starts a remote online feature server, and configures a default registry and an offline store with the local provider.
73+
When you create the Feature Store CR in OpenShift, Feature Store starts a remote online feature server, and configures a default registry and an offline store with the local provider.
7474

7575
A _provider_ is a customizable interface that provides default Feature Store components, such as the registry, offline store, and online store, that target a specific environment, ensuring that these components can work together seamlessly. The local provider uses the following default settings:
7676

@@ -158,6 +158,6 @@ The `feature_store.yaml` file defines the following components:
158158

159159
.Next steps
160160

161-
* Optionally, you can customize the default configurations for the offline store, online store, or registry by editing the YAML configuration for the Feature Store CRD, as described in _Customizing your feature store configuration_.
161+
* Optionally, you can customize the default configurations for the offline store, online store, or registry by editing the YAML configuration for the Feature Store CR, as described in _Customizing your feature store configuration_.
162162

163163
* Give your data scientists access to the data science project so that they can create a workbench. and provide them with a copy of the `feature_store.yaml` file so that they can add it to their workbench IDE, such as Jupyter.

modules/feature-store-workflow.adoc

Lines changed: 12 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -7,30 +7,29 @@ The Feature Store workflow involves the following tasks for ML engineers, OpenSh
77

88
*Note:* This Feature Store workflow describes a local implementation. A production client-server architecture with full authorization and role assignments is planned for a future release.
99

10-
11-
*ML Engineer*
12-
13-
* Creates a feature definition file.
14-
* Defines the data sources and other feature store objects
15-
16-
For more information, see _Appendix A: Defining features_.
17-
18-
1910
*Cluster administrator*
2011

2112
. Installs OpenShift AI.
2213
. Enables the Feature Store component by using the Feature Store operator.
2314
. Creates a data science project.
2415
. In the data science project, creates a Feature Store instance by using a `feast.yaml` file that specifies the offline and online stores.
25-
. Sets up Feature Store, making features available to data scientists to use for model inference
16+
. Sets up Feature Store so that ML Engineers and data scientists can push and retrieve features to use for model training and inference.
2617

2718
For more information, see _Chapter 2. Configuring Feature Store_.
2819

20+
*ML Engineer*
21+
22+
* Creates a feature definition file.
23+
* Defines the data sources and other feature store objects.
24+
* Makes features available for real-time inference.
25+
26+
For more information, see _Chapter 3: Defining features_.
27+
2928
*Data scientist*
3029

3130
. Creates a workbench.
3231
. Obtains the `feature_store.yaml` from the cluster administrator.
33-
. Installs Feature Store in their IDE environment, for example Jupyter.
34-
. Uses `feast` commands to retrieve features for model training.
32+
. Installs the Feature Store Python SDK in their IDE environment, for example Jupyter.
33+
. Uses `feast` Python APIs to retrieve features for model training in the workbench.
3534

36-
For more information, see _Chapter 3. Retrieving features for model training_.
35+
For more information, see _Chapter 4. Retrieving features for model training_.

modules/making-features-available-for-real-time-inference.adoc

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -11,10 +11,11 @@ Materialization ensures that the same features are available for both model trai
1111

1212
.Prerequisites
1313

14-
* You have the `cluster-admin` role in {openshift-platform}.
14+
* In your IDE environment, you have installed the Feature Store Python SDK as described in _Setting up your working environment_.
1515

16-
* You have registered the feature views that you want to materialize, as described in _Adding feature definitions and initializing your Feature Store instance_.
16+
* Your cluster administrator has pushed feature data to the online store.
1717

18+
* Optionally, you have cloned a Git repository that includes your model training code.
1819

1920
.Procedure
2021

modules/retrieving-online-features-for-model-inference.adoc

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -11,13 +11,13 @@ When the cluster administrator creates a `FeatureStore` CR, by default, it runs
1111

1212
A feature service is an object that represents a logical group of features from one or more feature views. With a feature service, your ML models can access features from within a feature view as needed.
1313

14-
Typically, you create one feature service per model version, allowing for tracking of the features used by models. The features retrieved from the online store can belong to multiple feature views.
14+
Typically, you create one feature service per model version, allowing for tracking of the features used by models. The features retrieved from the online store can belong to many feature views.
1515

16-
*Note:* When you apply a feature service, an actual service is not deployed.
16+
*Note:* When you apply a feature service, an actual feature store server is not deployed.
1717

1818
.Prerequisites
1919

20-
* In your IDE environment, you have installed Feature Store and added the `feature_store.yaml` file provided by your cluster administrator, as described in _Setting up your working environment_.
20+
* In your IDE environment, you have installed the Feature Store Python SDK and added a `feature_store.yaml` file, as described in _Setting up your working environment_.
2121

2222
* Your cluster administrator has pushed feature data to the online store.
2323

0 commit comments

Comments
 (0)