perf: Optimize metadata records processing in `SqlStorageClient` #1551

Mantisus · 2025-11-11T14:09:29Z

Description

This PR adds new buffer tables to improve handling of metadata records. The key change is that updates to metadata are now accumulated in buffer and applied when get_metadata is called. With the old behavior, metadata records were updated instantly within a transaction. This led to waiting for locks to be released in high-concurrency situations.

Issues

Closes: Updating request queue metada performs full table scan in SQL storage #1533

vdusek

Please PR type without an exclamation mark

janbuchar · 2025-11-18T09:39:13Z

Interesting! I'd imagine that transactions consisting of e.g., an insertion to the dataset_items table and an update to dataset metadata wouldn't lock the metadata table for that long - you can commit right after the update to metadata.

Also, the buffering approach is faster because the buffer table gets a row for each increment and those get compacted later on, correct?

Mantisus · 2025-11-18T12:26:14Z

update to dataset metadata wouldn't lock the metadata table for that long

They will create many short-lived locks. And with a large number of clients with high concurrency inserting new records, this effect will accumulate.
This is exactly what @ericvg97 pointed out - #1533 (comment)

Although, of course, the strongest impact on RequestQueue

Yes, insert operations into the buffer table are quite fast. And then we can simply apply the result of the aggregations to update the metadata record.

janbuchar · 2025-11-19T14:02:43Z

update to dataset metadata wouldn't lock the metadata table for that long

They will create many short-lived locks. And with a large number of clients with high concurrency inserting new records, this effect will accumulate. This is exactly what @ericvg97 pointed out - #1533 (comment)

Although, of course, the strongest impact on RequestQueue

I see, thanks. And is there any chance that the lock is held for too long because of how we work with sqlalchemy? In other words, would it be better if we just executed sql such as insert ...; update ...; commit in one go? If yes, it might be worth trying before adding three new tables to the whole thing.

Mantisus · 2025-11-19T15:44:34Z

it might be worth trying before adding three new tables to the whole thing.

I will test this approach.

Mantisus added 4 commits November 10, 2025 18:17

add buffer tables

5d0802b

some optimization

4b9386f

index optimization

7848a21

fix

3665a4c

Mantisus self-assigned this Nov 11, 2025

vdusek reviewed Nov 11, 2025

View reviewed changes

Mantisus changed the title ~~perf!: Optimize metadata records processing in 'SqlStorageClient`~~ perf: Optimize metadata records processing in 'SqlStorageClient` Nov 11, 2025

Mantisus added 9 commits November 11, 2025 15:41

Merge branch 'master' into sql-metadata-buffer

4236b20

recalculate only for is_empty

4e74b70

add lenght for all String fields

251253c

Merge branch 'master' into sql-metadata-buffer

d7171db

consistent use _update_metadata

c8054b9

up block time interval

83209ce

fix

1ee46da

up lengrh for data

d4cd262

up docs

eae231d

Mantisus marked this pull request as ready for review November 17, 2025 14:03

Mantisus requested review from janbuchar and vdusek November 17, 2025 14:03

Mantisus changed the title ~~perf: Optimize metadata records processing in 'SqlStorageClient`~~ perf: Optimize metadata records processing in SqlStorageClient Nov 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: Optimize metadata records processing in `SqlStorageClient` #1551

perf: Optimize metadata records processing in `SqlStorageClient` #1551

Uh oh!

Mantisus commented Nov 11, 2025 •

edited

Loading

Uh oh!

vdusek left a comment

Uh oh!

janbuchar commented Nov 18, 2025

Uh oh!

Mantisus commented Nov 18, 2025

Uh oh!

janbuchar commented Nov 19, 2025

Uh oh!

Mantisus commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

perf: Optimize metadata records processing in SqlStorageClient #1551

Are you sure you want to change the base?

perf: Optimize metadata records processing in SqlStorageClient #1551

Uh oh!

Conversation

Mantisus commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Issues

Uh oh!

vdusek left a comment

Choose a reason for hiding this comment

Uh oh!

janbuchar commented Nov 18, 2025

Uh oh!

Mantisus commented Nov 18, 2025

Uh oh!

janbuchar commented Nov 19, 2025

Uh oh!

Mantisus commented Nov 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

perf: Optimize metadata records processing in `SqlStorageClient` #1551

perf: Optimize metadata records processing in `SqlStorageClient` #1551

Mantisus commented Nov 11, 2025 •

edited

Loading