Refresh PG statistics after log replay on restart by JacksonYao287 · Pull Request #398 · eBay/HomeObject

JacksonYao287 · 2026-03-13T02:31:58Z

Add refresh_pg_statistics() to recalculate and update PG statistics (active_blob_count, tombstone_blob_count, total_occupied_blk_count) after log replay completes. This ensures statistics accuracy after system crashes or restarts.

The function is called in on_log_replay_done() before the raft group joins, scanning the index table and chunks to recompute all three statistics from actual data. Statistics are persisted at the next periodic checkpoint.

Add refresh_pg_statistics() to recalculate and update PG statistics (active_blob_count, tombstone_blob_count, total_occupied_blk_count) after log replay completes. This ensures statistics accuracy after system crashes or restarts. The function is called in on_log_replay_done() before the raft group joins, scanning the index table and chunks to recompute all three statistics from actual data. Statistics are persisted at the next periodic checkpoint.

JacksonYao287 · 2026-03-13T03:11:14Z

according to the suggestion from @zhiteng , I made this change to force to refresh pg key metrics when restarting

xiaoxichen

can we UT this ?

xiaoxichen · 2026-03-13T03:15:31Z

src/lib/homestore_backend/hs_pg_manager.cpp

+        de.total_occupied_blk_count.store(total_occupied, std::memory_order_relaxed);
+    });
+
+    LOGI("Refreshed statistics for pg={}: active_blobs={}, tombstone_blobs={}, occupied_blocks={}", pg_id, active_count,


better to log out original value as well and a keyword like "corrected" for debugging.

codecov-commenter · 2026-03-13T03:42:35Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

❌ Patch coverage is 75.60976% with 10 lines in your changes missing coverage. Please review.
✅ Project coverage is 52.86%. Comparing base (1746bcc) to head (4d0c98b).
⚠️ Report is 164 commits behind head on main.

Files with missing lines	Patch %	Lines
src/lib/homestore_backend/hs_pg_manager.cpp	75.00%	9 Missing and 1 partial ⚠️
❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@             Coverage Diff             @@
##             main     #398       +/-   ##
===========================================
- Coverage   63.15%   52.86%   -10.30%     
===========================================
  Files          32       36        +4     
  Lines        1900     5272     +3372     
  Branches      204      656      +452     
===========================================
+ Hits         1200     2787     +1587     
- Misses        600     2194     +1594     
- Partials      100      291      +191

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

xiaoxichen · 2026-03-13T10:51:29Z

src/lib/homestore_backend/tests/hs_pg_tests.cpp

+    EXPECT_GT(pg_stats.used_bytes, 0) << "Used bytes should be greater than 0";
+
+    uint64_t used_bytes_after = pg_stats.used_bytes;
+


better to corrupt one more time here

xiaoxichen · 2026-03-13T10:51:52Z

src/lib/homestore_backend/tests/hs_pg_tests.cpp

+    auto hs_pg = dynamic_cast< HSHomeObject::HS_PG* >(_obj_inst->_pg_map[pg_id].get());
+    ASSERT_NE(hs_pg, nullptr);
+
+    // Manually corrupt statistics to simulate desync


s/desync/inconsistency

xiaoxichen · 2026-03-13T10:54:14Z

src/lib/homestore_backend/hs_pg_manager.cpp


-    LOGI("Refreshed statistics for pg={}: active_blobs={}, tombstone_blobs={}, occupied_blocks={}", pg_id, active_count,
-         tombstone_count, total_occupied);
+    LOGI("[corrected] Refreshed statistics for pg={}: active_blobs={} (original={}), tombstone_blobs={} (original={}), "


the [corrected] should not be printed if we didnt correct anything

xiaoxichen · 2026-03-13T11:09:38Z

src/lib/homestore_backend/hs_pg_manager.cpp

            } else {
                active_count++;
            }
            return false; // Continue scanning


It can hit OOM here....

Each blob takes 24B+, assuming we have 10GB mem , it is up to ~400M blobs, with 8KB minimal blob size , it can hit OOM.

Can we cleanup the dummy_output during pagination ?

xiaoxichen · 2026-03-13T11:10:40Z

src/lib/homestore_backend/tests/hs_pg_tests.cpp

+    LOGINFO("Created shard {}", shard_info.id);
+
+    // Put some blobs to populate statistics using put_blobs
+    const uint32_t num_active_blobs = 10;


If we have to deal with pagination by ourselves, the number should be large enough to ensure b-tree do have multi pages.

JacksonYao287 requested review from xiaoxichen, yuwmao and zhiteng March 13, 2026 03:10

xiaoxichen reviewed Mar 13, 2026

View reviewed changes

address commemts and add UT

4d0c98b

xiaoxichen requested changes Mar 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refresh PG statistics after log replay on restart#398

Refresh PG statistics after log replay on restart#398
JacksonYao287 wants to merge 2 commits intoeBay:mainfrom
JacksonYao287:refresh-pg-metrics-when-restart

JacksonYao287 commented Mar 13, 2026

Uh oh!

JacksonYao287 commented Mar 13, 2026

Uh oh!

xiaoxichen left a comment

Uh oh!

xiaoxichen Mar 13, 2026

Uh oh!

JacksonYao287 Mar 13, 2026

Uh oh!

codecov-commenter commented Mar 13, 2026 •

edited

Loading

Uh oh!

xiaoxichen Mar 13, 2026

Uh oh!

xiaoxichen Mar 13, 2026

Uh oh!

xiaoxichen Mar 13, 2026

Uh oh!

xiaoxichen Mar 13, 2026

Uh oh!

xiaoxichen Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		EXPECT_GT(pg_stats.used_bytes, 0) << "Used bytes should be greater than 0";

		uint64_t used_bytes_after = pg_stats.used_bytes;

Conversation

JacksonYao287 commented Mar 13, 2026

Uh oh!

JacksonYao287 commented Mar 13, 2026

Uh oh!

xiaoxichen left a comment

Choose a reason for hiding this comment

Uh oh!

xiaoxichen Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

JacksonYao287 Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

codecov-commenter commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

xiaoxichen Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

xiaoxichen Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

xiaoxichen Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

xiaoxichen Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

xiaoxichen Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-commenter commented Mar 13, 2026 •

edited

Loading