Skip to content

Checkpoint creation time #30

@connieksun

Description

@connieksun

Hi again! Wondering if there's any way to make the checkpoint creation more efficient, time-wise. Our current delta table has ~300,000 underlying parquet files (1.6 TB) and a delta_log with ~15,000 transaction files. About 7,000 files (40 GB) are added every day. The oxbow lambda takes about 13-14 minutes to create each checkpoint, and we worry it will soon hit the max 15 minutes. Any ideas for how we can reduce the time needed for checkpointing? Thank you!

P.S. Sorry for opening all these issues! Thanks for your quick responses!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions