Bucket priorities #192

simolus3 · 2025-02-03T15:53:01Z

This implements the bucket priorities proposal. When buckets with different priorities are involved:

We only guarantee consistency within each priority (clients see changes from higher-priority buckets before changes from lower-priority buckets, even if there is a causal relationship between them).
(not a concern for the sync service): For the highes priority, clients are allowed to upload local changes before they have received a complete checkpoint.

Priorities are currently declared with a static literal column named _priority in a data query, e.g.

bucket_definitions:
  global_todos:
    parameters: SELECT 0 as _priority;
    data:
      - SELECT * FROM todos
  global_lists: # implicit default priority 1
    data:
      - SELECT * FROM lists

At the moment, the implementation simply groups buckets into their priorities and then synchronizes each priority in a batch (instead of using a batch for all buckets like before).
An interesting improvement might be to interrupt lower-priority sync work when higher-priorities have new data. For instance, in a flow like:

New checkpoint sent to client.
No changed todos to send to client, immediately send partial complete message for priority 0.
Start syncing a large amount of lists rows.
Before being done with step 3, a new checkpoint with new todos comes in.

Here, we might want to stop sending rows for 4 (it's not that they're lost, the client has received the operations and we can later resume from that state) so that we can send the new checkpoint first.

changeset-bot · 2025-02-03T15:53:07Z

🦋 Changeset detected

Latest commit: b683952

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 10 packages

Name	Type
@powersync/service-core	Patch
@powersync/service-sync-rules	Minor
@powersync/service-core-tests	Patch
@powersync/service-module-mongodb-storage	Patch
@powersync/service-module-mongodb	Patch
@powersync/service-module-mysql	Patch
@powersync/service-module-postgres-storage	Patch
@powersync/service-module-postgres	Patch
@powersync/service-image	Patch
test-client	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

.changeset/tall-peas-cough.md

rkistner

Looks good overall! Added some minor comments on the implementation.

If I understand the implementation correctly, it will effectively change nothing if bucket priorities are not specified, which is great.

For specifying bucket priorities, could you also add support for this syntax?

bucket_definitions:
  bulk_data:
    priority: 2
    parameters: select id as project_id from projects where ...

I think this would be simpler for the majority of cases, leaving the as _priority form only for cases where developers need more control.

packages/service-core/src/sync/sync.ts

rkistner · 2025-02-05T11:51:02Z

After looking at the consistency implications in the client implementation, I think it would be better to make the default priority 3 instead of 1.

If the client gets a partial checkpoint, we only sync PUT operations, ignoring REMOVEs, which changes the consistency properties a little. I think that's a decent trade-off in many cases, but the developer needs to be aware of it.

Now imagine a developer starts with only having a priority 1 bucket (the default). Now they want to sync some bulk data as well, so they add a new bucket with priority 2. The issue is that this now affected the consistency properties of bucket 1, despite not touching it in the sync rules.

While if we instead have the default as priority 3 (the lowest), it means the developer will have to explicitly modify the original bucket to assign a higher priority, making the implications more obvious.

Thoughts?

simolus3 · 2025-02-05T15:24:47Z

Now imagine a developer starts with only having a priority 1 bucket (the default). Now they want to sync some bulk data as well, so they add a new bucket with priority 2. The issue is that this now affected the consistency properties of bucket 1, despite not touching it in the sync rules.

I agree that making the lowest priority the default is a good choice to make sure changing the priority of some buckets doesn't affect unrelated buckets 👍

package.json

packages/service-core/src/sync/sync.ts

rkistner

Happy to merge this.

As mentioned offline, there is a fix in f60c705 that should be included before releasing this.

simolus3 commented Feb 3, 2025

View reviewed changes

.changeset/tall-peas-cough.md Show resolved Hide resolved

rkistner reviewed Feb 4, 2025

View reviewed changes

packages/service-core/src/sync/sync.ts Outdated Show resolved Hide resolved

packages/service-core/src/sync/sync.ts Outdated Show resolved Hide resolved

simolus3 added 10 commits February 6, 2025 12:52

Add bucket priority to parameter query

14c08f7

Only include priority in checkpoint message

dc8e14c

Migrate bucket ids -> decsription in sync-rules

f55e36a

Sync buckets in priority order

14fedf2

Format

5acfa6d

Add service changeset

15283d4

Set default priorit to 3

3f0ae53

Allow defining priorities on a bucket level

c09bae7

Use Map.groupBy

f998b56

Adapt new json_array tests

7b1fcde

simolus3 force-pushed the feat/bucket-priorities branch from b8b7836 to 7b1fcde Compare February 6, 2025 12:02

rkistner mentioned this pull request Feb 7, 2025

Optimize incremental sync: Phase 1 #197

Merged

simolus3 and others added 4 commits February 7, 2025 13:53

Interrupt syncing low-priority buckets

d7065b1

Test storage on github actions.

8f514de

Update test expectations

45dd8d3

Also update mongodb storage snapshot

4d9d4a0

rkistner reviewed Feb 10, 2025

View reviewed changes

package.json Outdated Show resolved Hide resolved

packages/service-core/src/sync/sync.ts Show resolved Hide resolved

packages/service-core/src/sync/sync.ts Show resolved Hide resolved

simolus3 added 3 commits February 10, 2025 15:35

Fix aborting low-priority syncs

2e17504

Include sync order snapshot for postgres

db9ebac

Don't require partial sync for interruption

28e8c0f

simolus3 marked this pull request as ready for review February 10, 2025 15:09

benitav mentioned this pull request Feb 13, 2025

Sync bucket priorities powersync-ja/powersync-docs#112

Merged

rkistner approved these changes Feb 13, 2025

View reviewed changes

Merge remote-tracking branch 'origin/main' into feat/bucket-priorities

b683952

rkistner merged commit 7b1ba31 into main Feb 18, 2025
29 checks passed

rkistner deleted the feat/bucket-priorities branch February 18, 2025 12:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bucket priorities #192

Bucket priorities #192

Uh oh!

simolus3 commented Feb 3, 2025

Uh oh!

changeset-bot bot commented Feb 3, 2025 •

edited

Loading

Uh oh!

Uh oh!

rkistner left a comment

Uh oh!

Uh oh!

Uh oh!

rkistner commented Feb 5, 2025

Uh oh!

simolus3 commented Feb 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rkistner left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Bucket priorities #192

Bucket priorities #192

Uh oh!

Conversation

simolus3 commented Feb 3, 2025

Uh oh!

changeset-bot bot commented Feb 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

Uh oh!

rkistner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

rkistner commented Feb 5, 2025

Uh oh!

simolus3 commented Feb 5, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rkistner left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

changeset-bot bot commented Feb 3, 2025 •

edited

Loading