Redesign applySchemaDeltas to only apply added columns to catalog #3768

Amogh-Bharadwaj · 2025-12-05T15:21:00Z

Why

Let's say you have a Postgres mirror running and it is in CDC.
The pull batch size is 1.
On source for a table t1, the following operations are performed at the same time:

ALTER TABLE t1 ADD COLUMN good_column TIMESTAMP DEFAULT CURRENT_TIMESTAMP;
INSERT INTO t1 DEFAULT VALUES;
ALTER TABLE t1 ADD COLUMN lost_column TIMESTAMP DEFAULT CURRENT_TIMESTAMP;

The way PeerDB syncs the above operations is:

Receives the relation message for the addition of good_column and the INSERT.
1.1) Checks the schema of this table as stored in catalog, sees that good_column is not present and marks good_column as a delta to be synced.
Does not receive the column addition for lost_column because there was no subsequent DML (Postgres behaviour).
Syncs the INSERT to destination and adds good_column to destination.
Performs applySchemaDelta which gets the schema of the table t1 from Postgres and stores this schema in catalog. Note that lost_column will be stored here as it is part of the source table at this time.

Now, when another insert is performed:

INSERT INTO t1 DEFAULT VALUES;

PeerDB now:

Receives the relation message for the addition of lost_column and the INSERT.
1.1) Checks the schema of this table as stored in catalog, sees that lost_column is present and moves on.
Errors out when inserting to target tables with some message resembling column "lost_column" not found in destination.

Because we never identified lost_column as a delta to be added to destination.

What

This PR changes the implementation of applySchemaDeltas where we do not touch Postgres, and instead simply do:

updated columns for t1 in catalog = current columns of t1 in catalog + added columns in delta of this batch

This way, when the second INSERT comes along in the example above, PeerDB will see that lost_column is not present in catalog and will add it to the list of schema deltas to be synced.

Note: Refactors the table OID migration code so that we can share a common function to update table schemas in catalog.

E2E tests pending
Functional reproduction pending

jgao54 · 2025-12-05T17:53:25Z

@Amogh-Bharadwaj love this change! -- i.e. rely on our states (of schema) as base instead of fetching from the source db to apply schema delta, since many things can result in schema change between SyncRecords runs.

jgao54 · 2025-12-05T18:03:25Z

flow/activities/flowable_core.go

 	return tableNameSchemaMapping, nil
 }

 func (a *FlowableActivity) applySchemaDeltas(


did a quick code search and i see a few other places that are using options.tableMapping (i.e. latest schema). wondering if we have any other logic that is relying on latest schema when it should be relying on catalog schema and if they introduce any subtle edge cases like this one

Good call out, will take a look

jgao54 · 2025-12-05T18:16:21Z

flow/activities/flowable.go

+		}

-		err := internal.UpdateTableOIDsInTableSchemaInCatalog(
+		err = internal.UpdateTableSchemasInCatalog(


For MigratePostgresTableOIDs:

// MIGRATION: Migrate Postgres table OIDs to catalog before starting/resuming the flow migrateCtx := workflow.WithActivityOptions(ctx, workflow.ActivityOptions{ StartToCloseTimeout: 1 * time.Hour, HeartbeatTimeout: 2 * time.Minute, })

iiuc, on pause-restart, we also sync what's in the source db to our catalog. so there may be a chance here too that if there are schema changes between pause/resume, we would end up syncing catalog to the latest schema, and ApplySchemaDelta could miss column additions.

Im wondering what is the motivation here for always syncing schema to latest on pause/restart, vs. just letting CDC do the catch-up and ApplySchemaDelta.

Note this doesn't block your PR, just an observation and maybe a follow-up item.

we also sync what's in the source db to our catalog

Actually this activity doesn't touch Postgres, it moves the OIDs from the state object to catalog and the OIDs in state are populated during initial setup flow and can be extended by setup flows of table additions, but is independent of schema changes so we should be good?

P.S: This migration is what enables table cancellation addition (added recently), it can be removed after a certain period of time

This pull request introduces a more robust approach for applying schema deltas to the catalog. It's a follow-up to #3768 (and addresses the race condition described in the PR description), with e2e test coverage, and with the new approach shadow-applied for validation before it's fully enabled, given this PR introduces some non-trivial changes db state that is hard to debug/rollback. Changes: - Moved previous `applySchemaDeltas` logic to `applySchemaDeltasV1`, and introduce a separate `applySchemaDeltasV2`. Note that `applySchemaDeltasV2` applies change in-memory, and works in conjunction with a `ReadModifyWriteTableSchemasToCatalog` method to support transaction with ReadModifyWrite patterns. - Added a feature flag (`PEERDB_APPLY_SCHEMA_DELTA_TO_CATALOG`) and logic to choose between the legacy (v1) and new (v2) approaches for applying schema deltas to the catalog. For now the FF is **disabled** by default, which means the old approach continue to be used as source of truth. - The new v2 method is shadow-applied. This is done by adding temporary validation utilities to compare the results of v1 and v2 schema delta approaches, logging discrepancies and reporting metrics for monitoring. - Improved end-to-end tests to cover a race condition where columns added without subsequent DML operations could be lost in the destination schema, verifying catalog correctness after each sync.

redesign applyschemadeltas

f4549a1

jgao54 reviewed Dec 5, 2025

View reviewed changes

jgao54 mentioned this pull request Dec 11, 2025

Shadow deploy applySchemaDeltasV2 with validations #3770

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Redesign applySchemaDeltas to only apply added columns to catalog #3768

Redesign applySchemaDeltas to only apply added columns to catalog #3768

Uh oh!

Amogh-Bharadwaj commented Dec 5, 2025 •

edited

Loading

Uh oh!

jgao54 commented Dec 5, 2025 •

edited

Loading

Uh oh!

jgao54 Dec 5, 2025

Uh oh!

Amogh-Bharadwaj Dec 5, 2025

Uh oh!

jgao54 Dec 5, 2025 •

edited

Loading

Uh oh!

Amogh-Bharadwaj Dec 5, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Redesign applySchemaDeltas to only apply added columns to catalog #3768

Are you sure you want to change the base?

Redesign applySchemaDeltas to only apply added columns to catalog #3768

Uh oh!

Conversation

Amogh-Bharadwaj commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why

What

Uh oh!

jgao54 commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jgao54 Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

Amogh-Bharadwaj Dec 5, 2025

Choose a reason for hiding this comment

Uh oh!

jgao54 Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Amogh-Bharadwaj Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Amogh-Bharadwaj commented Dec 5, 2025 •

edited

Loading

jgao54 commented Dec 5, 2025 •

edited

Loading

jgao54 Dec 5, 2025 •

edited

Loading

Amogh-Bharadwaj Dec 5, 2025 •

edited

Loading