[Postgres] Optimize transaction replication throughput #228

rkistner · 2025-03-13T08:33:40Z

Previously, if we got 100k transactions each with a single operation, we'd commit/flush that 100k times to the bucket storage. Flushing is slow - in the case of a storage cluster under load, this can take 100-500ms per flush, resulting in throughput of less than 10 transactions per second.

Lucky for us, Postgres already chunks messages together in the replication stream. So now we look ahead in the current chunk to see if there are any more commit messages. If there are, we only flush/commit on the last one. This means if we get many transactions in a single chunk, they are all batched together in a single flush to the bucket storage.

For reference, we already have similar behavior in our MongoDB replication implementation.

In theory we could also batch transactions across multiple replication chunks, but I don't expect significant further gains from that, and it could increase complexity and memory usage.

This also adds Postgres 17 to the test matrix (released Sept 2024).

changeset-bot · 2025-03-13T08:33:46Z

🦋 Changeset detected

Latest commit: 2856a48

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 9 packages

Name	Type
@powersync/service-module-postgres	Minor
@powersync/service-core	Minor
@powersync/service-image	Minor
@powersync/service-core-tests	Patch
@powersync/service-module-mongodb-storage	Patch
@powersync/service-module-mongodb	Patch
@powersync/service-module-mysql	Patch
@powersync/service-module-postgres-storage	Patch
test-client	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

Copilot

Pull Request Overview

This PR optimizes transaction replication throughput by batching flushes, thereby reducing the performance overhead when handling many small transactions.

The changeset file now reflects the new flush behavior for Postgres replication.
The replication stream logic in WalStream.ts has been modified to batch multiple transactions by only flushing on the last commit message in a chunk.
The test workflow has been updated to include Postgres version 17.

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated no comments.

File	Description
.changeset/chilly-flowers-fix.md	Updates changeset metadata to document the flush optimization change.
modules/module-postgres/src/replication/WalStream.ts	Refactors transaction handling to batch commit flushes per chunk.
.github/workflows/test.yml	Adds Postgres 17 to the testing matrix.

Comments suppressed due to low confidence (1)

modules/module-postgres/src/replication/WalStream.ts:707

[nitpick] Consider renaming the variable 'skipKeepalive' to a name that more clearly reflects its purpose, such as 'inTransaction', for improved clarity when handling keepalive messages during transactions.

skipKeepalive = true;

stevensJourney

Looks like a nice optimisation to me :)

rkistner added 3 commits March 13, 2025 10:23

Only do a single commit per chunk, increasing transaction throughput.

8edfc0b

Changeset.

7c2e36a

Test Postgres 17.

2856a48

rkistner requested review from Copilot and stevensJourney March 13, 2025 08:33

Copilot AI reviewed Mar 13, 2025

View reviewed changes

stevensJourney approved these changes Mar 13, 2025

View reviewed changes

rkistner merged commit f049f68 into main Mar 13, 2025
21 checks passed

rkistner deleted the batch-transactions branch March 13, 2025 14:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Postgres] Optimize transaction replication throughput #228

[Postgres] Optimize transaction replication throughput #228

Uh oh!

rkistner commented Mar 13, 2025 •

edited

Loading

Uh oh!

changeset-bot bot commented Mar 13, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

stevensJourney left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Postgres] Optimize transaction replication throughput #228

[Postgres] Optimize transaction replication throughput #228

Uh oh!

Conversation

rkistner commented Mar 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

changeset-bot bot commented Mar 13, 2025

🦋 Changeset detected

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

stevensJourney left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rkistner commented Mar 13, 2025 •

edited

Loading