Skip to content

Write partial file portions? #14

@severo

Description

@severo

Would it be possible (and would it make sense to you @platypii?) to be able to generate only part of a file? Ie: if we do a local edit to a table, know which parts of the Parquet file will be modified? It would work well when using CDC and Xet (see https://huggingface.co/blog/parquet-cdc).

I guess it's not straightforward, since an edit can change the page, the row group metadata, and the file metadata, but, similarly to hyparquet letting us read a localized part of a Parquet file, it would help us write a Parquet file surgically.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions