Skip to content

Conversation

@steFaiz
Copy link
Contributor

@steFaiz steFaiz commented Jan 8, 2026

Purpose

Add firstRowId, writeCols and IsBlob columns to Files System Table.

Tests

None

API and Format

None

Documentation

None

new DataField(17, "file_source", DataTypes.STRING()),
new DataField(18, "first_row_id", DataTypes.BIGINT()),
new DataField(19, "write_cols", DataTypes.ARRAY(DataTypes.STRING())),
new DataField(20, "blob", DataTypes.BOOLEAN())));
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Remove this, just get info from file name?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for your advise! I was thinking that this is convenient for sql. For example, if we want to calculate all blob's storage, we just need to write where T.blob = true instead of where T.fileName LIKE '%s.blob'.
Would you be okay with keeping this boolean field, or do you prefer avoid persisting redundant metadata?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This system table should just reflect the schema of data file meta, so I think it is better to remove this.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for your explaination! I've adapted the code.

@JingsongLi
Copy link
Contributor

+1

@JingsongLi JingsongLi merged commit 07e3e9a into apache:master Jan 9, 2026
12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants