-
Notifications
You must be signed in to change notification settings - Fork 5
Description
Incremental processing is an approach to process new or changed data in workflows. The key advantage is that it only incrementally processes data that are newly added or updated to a dataset, instead of re-processing the complete dataset. This not only reduces the cost of compute resources but also reduces the execution time in a significant manner. When workflow execution has a shorter duration, chances of failure and manual intervention reduce. It also improves the engineering productivity by simplifying the existing pipelines and unlocking the new patterns.
We should explore incremental processing techniques whereby new data is incrementally integrated into our data product on Iceberg and a good starting point would be looking at what NetFlix is doing with Mastro:
Metadata
Metadata
Assignees
Labels
Type
Projects
Status