-
Notifications
You must be signed in to change notification settings - Fork 9
Open
Description
Would it be possible (and would it make sense to you @platypii?) to be able to generate only part of a file? Ie: if we do a local edit to a table, know which parts of the Parquet file will be modified? It would work well when using CDC and Xet (see https://huggingface.co/blog/parquet-cdc).
I guess it's not straightforward, since an edit can change the page, the row group metadata, and the file metadata, but, similarly to hyparquet letting us read a localized part of a Parquet file, it would help us write a Parquet file surgically.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels