Skip to content

Data source name format #1041

@abhineet13

Description

@abhineet13

Background [Optional]

Hi, the data source tab lists data source name as the last string in storage path, for example
spark.read.parquet("s3://bucket/folder/partition=2020-01-01") will have a data source name of "partition=2020-01-01" and
val df= spark.read.parquet("s3://bucket/folder")
df.createOrReplaceTempView("df")
spark.sql("select * from df where partition=2020-01-01") will create a data source name of "folder" in Spline UI.

Question

Is it possible to keep data source naming consistent at folder level, otherwise for daily jobs Spline UI will show multiple data source names for each partition.

Thanks

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    Status

    Done

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions