| subcategory | Machine Learning |
|---|
The following arguments are supported:
full_name(string, required) - The full three-part name (catalog, schema, name) of the featureprovider_config(ProviderConfig, optional) - Configure the provider for management through account provider.
workspace_id(string,required) - Workspace ID which the resource belongs to. This workspace must be part of the account which the provider is configured with.
The following attributes are exported:
description(string) - The description of the featurefilter_condition(string, deprecated) - Deprecated: Use DeltaTableSource.filter_condition or KafkaSource.filter_condition instead. Kept for backwards compatibility. The filter condition applied to the source data before aggregationfull_name(string) - The full three-part name (catalog, schema, name) of the featurefunction(Function) - The function by which the feature is computedinputs(list of string, deprecated) - Deprecated: Use AggregationFunction.inputs instead. Kept for backwards compatibility. The input columns from which the feature is computedlineage_context(LineageContext) - Lineage context information for this feature. WARNING: This field is primarily intended for internal use by Databricks systems and is automatically populated when features are created through Databricks notebooks or jobs. Users should not manually set this field as incorrect values may lead to inaccurate lineage tracking or unexpected behavior. This field will be set by feature-engineering client and should be left unset by SDK and terraform userssource(DataSource) - The data source of the featuretime_window(TimeWindow, deprecated) - Deprecated: Use Function.aggregation_function.time_window instead. Kept for backwards compatibility. The time window in which the feature is computed
variant_expr_path(string) - String representation of the column name or variant expression path. For nested fields, the leaf value is what will be present in materialized tables and expected to match at query time. For example, the leaf node of value:trip_details.location_details.pickup_zip is pickup_zip
offset(string) - The offset of the continuous window (must be non-positive)window_duration(string) - The duration of the continuous window (must be positive)
delta_table_source(DeltaTableSource)kafka_source(KafkaSource)
dataframe_schema(string) - Schema of the resulting dataframe after transformations, in Spark StructType JSON format (from df.schema.json()). Required if transformation_sql is specified. Example: {"type":"struct","fields":[{"name":"col_a","type":"integer","nullable":true,"metadata":{}},{"name":"col_c","type":"integer","nullable":true,"metadata":{}}]}entity_columns(list of string, deprecated) - Deprecated: Use Feature.entity instead. Kept for backwards compatibility. The entity columns of the Delta tablefilter_condition(string) - Single WHERE clause to filter delta table before applying transformations. Will be row-wise evaluated, so should only include conditionals and projectionsfull_name(string) - The full three-part (catalog, schema, table) name of the Delta tabletimeseries_column(string, deprecated) - Deprecated: Use Feature.timeseries_column instead. Kept for backwards compatibility. The timeseries column of the Delta tabletransformation_sql(string) - A single SQL SELECT expression applied after filter_condition. Should contains all the columns needed (eg. "SELECT , col_a + col_b AS col_c FROM x.y.z WHERE col_a > 0" would havetransformation_sql", col_a + col_b AS col_c") If transformation_sql is not provided, all columns of the delta table are present in the DataSource dataframe
extra_parameters(list of FunctionExtraParameter, deprecated) - Deprecated: Use the function oneof with AggregationFunction instead. Kept for backwards compatibility. Extra parameters for parameterized functionsfunction_type(string, deprecated) - Deprecated: Use the function oneof with AggregationFunction instead. Kept for backwards compatibility. The type of the function. Possible values are:APPROX_COUNT_DISTINCT,APPROX_PERCENTILE,AVG,COUNT,FIRST,LAST,MAX,MIN,STDDEV_POP,STDDEV_SAMP,SUM,VAR_POP,VAR_SAMP
key(string) - The name of the parametervalue(string) - The value of the parameter
job_id(integer) - The job ID where this API invokedjob_run_id(integer) - The job run ID where this API was invoked
entity_column_identifiers(list of ColumnIdentifier, deprecated) - Deprecated: Use Feature.entity instead. Kept for backwards compatibility. The entity column identifiers of the Kafka sourcename(string) - Name of the Kafka source, used to identify it. This is used to look up the corresponding KafkaConfig object. Can be distinct from topic nametimeseries_column_identifier(ColumnIdentifier, deprecated) - Deprecated: Use Feature.timeseries_column instead. Kept for backwards compatibility. The timeseries column identifier of the Kafka source
job_context(JobContext) - Job context information including job ID and run IDnotebook_id(integer) - The notebook ID where this API was invoked
slide_duration(string) - The slide duration (interval by which windows advance, must be positive and less than duration)window_duration(string) - The duration of the sliding window
continuous(ContinuousWindow)sliding(SlidingWindow)tumbling(TumblingWindow)
window_duration(string) - The duration of each tumbling window (non-overlapping, fixed-duration windows)