[server][improve] Add WAL cache to optimize replication. by dao-jun · Pull Request #794 · oxia-db/oxia

dao-jun · 2025-10-27T17:12:35Z

Add WAL LogEntry cache to improve replication.
Bypass page-cache and eliminate deserialization overhead when tailing-read the WAL.

Under ideal conditions, the oxia_server_wal_read_latency_milliseconds_sum can be 0.

Test the WAL via wal-perf
before:

after:

The Read/Write throughput increase about 32%

Signed-off-by: dao-jun <daojun@apache.org>

merlimat

That's a cool addition. just a couple of comments

server/wal/wal_impl.go

merlimat · 2025-10-27T20:46:36Z

server/wal/wal.go

 )

+const (
+	logEntryCacheSize int = 32


It would be better if we could set weighter and use max number of bytes instead of entries here.

Resolved. But I don't understand what does set weighter mean, could you please explain in detail?

Signed-off-by: dao-jun <daojun@apache.org>

dao-jun · 2025-10-28T00:55:40Z

Perf test after address review comments, the performance is still OK

mattisonchao

LGTM +1

It would be better if you could consider this.

Oxia is a sharding systems. we need to consider more of cache memory control. currently, every shard has their own WAL. and I am not sure if we will go or when we will go for sharding WAL(IMO, we should go to avoid mMap cost by many opened segment). but anyway, we should pay attention on the cost.

We need a global cache to avoid shards_num * 2MB. (100 shards = 200MiB， 1_000 shards ~ 2GiB). this is still very useful when we migrate to sharding WAL.

dao-jun · 2025-10-28T04:01:16Z

LGTM +1

It would be better if you could consider this.

Oxia is a sharding systems. we need to consider more of cache memory control. currently, every shard has their own WAL. and I am not sure if we will go or when we will go for sharding WAL(IMO, we should go to avoid mMap cost by many opened segment). but anyway, we should pay attention on the cost.

We need a global cache to avoid shards_num * 2MB. (100 shards = 200MiB， 1_000 shards ~ 2GiB). this is still very useful when we migrate to sharding WAL.

I've considered this. Even in future single WAL instances, we still strive to distribute memory evenly among each shard. Otherwise, there will still be cache penetration, which is not much different from the current implementation

mattisonchao · 2025-10-28T05:22:58Z

I've considered this. Even in future single WAL instances, we still strive to distribute memory evenly among each shard. Otherwise, there will still be cache penetration, which is not much different from the current implementation

well... after deep thinking. I think the write cache could not help us very much in this case. because reader always happened after the data sync. If the write traffic is very large, the 2MiB buffer will never work as expected.

Indeed, your benchmark proof some improvement, but that logic is different as oxia. let me change some logics to make it match the implementation of Oxia. Plus, you could also use your implementation in the cluster benchmarking to see if any improvement on oxia_server_wal_read_latency_milliseconds_sum .

Add WAL cache to optimize replication.

f257fc1

Signed-off-by: dao-jun <daojun@apache.org>

dao-jun requested review from RobertIndie, mattisonchao and merlimat as code owners October 27, 2025 17:12

dao-jun added 5 commits October 28, 2025 01:17

Fix CI

a685d36

Signed-off-by: dao-jun <daojun@apache.org>

Fix CI

de2aa44

Signed-off-by: dao-jun <daojun@apache.org>

Fix CI

e155c45

Signed-off-by: dao-jun <daojun@apache.org>

Merge branch 'main' into dev/add_wal_cache

5754b3a

Improve code

4178d98

Signed-off-by: dao-jun <daojun@apache.org>

merlimat reviewed Oct 27, 2025

View reviewed changes

dao-jun added 2 commits October 28, 2025 08:49

Address review comments

b17aced

Signed-off-by: dao-jun <daojun@apache.org>

Address review comments

3540b33

Signed-off-by: dao-jun <daojun@apache.org>

mattisonchao reviewed Oct 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[server][improve] Add WAL cache to optimize replication.#794

[server][improve] Add WAL cache to optimize replication.#794
dao-jun wants to merge 8 commits intooxia-db:mainfrom
dao-jun:dev/add_wal_cache

dao-jun commented Oct 27, 2025 •

edited

Loading

Uh oh!

merlimat left a comment

Uh oh!

Uh oh!

merlimat Oct 27, 2025

Uh oh!

dao-jun Oct 28, 2025

Uh oh!

dao-jun commented Oct 28, 2025 •

edited

Loading

Uh oh!

mattisonchao left a comment

Uh oh!

dao-jun commented Oct 28, 2025

Uh oh!

mattisonchao commented Oct 28, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dao-jun commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

merlimat left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

merlimat Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

dao-jun Oct 28, 2025

Choose a reason for hiding this comment

Uh oh!

dao-jun commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattisonchao left a comment

Choose a reason for hiding this comment

Uh oh!

dao-jun commented Oct 28, 2025

Uh oh!

mattisonchao commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dao-jun commented Oct 27, 2025 •

edited

Loading

dao-jun commented Oct 28, 2025 •

edited

Loading

mattisonchao commented Oct 28, 2025 •

edited

Loading