[Hot State] Add a separate StateMerkleDb for hot state #18385

wqfish · 2025-12-23T23:01:45Z

In order to compute the hot state root hash for each block (including the
speculative ones), we have to be able to return Merkle proof for any given key
and version. The most straightforward way to do this is to add another instance
of StateMerkleDb in StateStore.

For now, we always delete and reset the DB on restart. This will give us more
flexibility to make changes.

Note

Adds a hot-state storage path to enable Merkle proofs for speculative blocks.

Introduces optional hot_state_merkle_db in AptosDB/StateStore; proof APIs now take use_hot_state to read from hot or regular state
Extends StateMerkleDb to support hot vs. regular instances (separate folder/CF names), deletion-on-restart, and checkpointing for both
Configuration: new HotStateConfig { max_items_per_shard, delete_on_restart }, path overrides for hot_state_merkle_db, and StorageDirPaths getters
AptosDB::open/open_internal/open_dbs gain reset_hot_state and return the optional hot DB; all call sites updated (genesis, tests, benchmarks, tools)
Storage interface and mocks updated (add use_hot_state params); new HotStateError for missing/misconfigured hot state
DB debugger and truncate tools updated to handle new DB shape; default test/bench runners reset hot state on open

^{Written by Cursor Bugbot for commit 97680f2. This will update automatically on new commits. Configure here.}

storage/aptosdb/src/state_merkle_db.rs

config/src/config/storage_config.rs

storage/aptosdb/src/state_merkle_db.rs

This part of the config is exposed via the `get_dir_paths` API.

In order to compute the hot state root hash for each block (including the speculative ones), we have to be able to return Merkle proof for any given key and version. The most straightforward way to do this is to add another instance of `StateMerkleDb` in `StateStore`. For now, we always delete and reset the DB on restart. This will give us more flexibility to make changes.

github-actions · 2026-01-08T05:56:37Z

✅ Forge suite `realistic_env_max_load` success on `97680f2cb8cc28da7b2e71dc533a960114048624`

two traffics test: inner traffic : committed: 13526.98 txn/s, latency: 2788.92 ms, (p50: 2700 ms, p70: 3000, p90: 3100 ms, p99: 3600 ms), latency samples: 5038880
two traffics test : committed: 100.00 txn/s, latency: 754.34 ms, (p50: 700 ms, p70: 800, p90: 800 ms, p99: 1200 ms), latency samples: 1700
Latency breakdown for phase 0: ["MempoolToBlockCreation: max: 2.302, avg: 2.197", "ConsensusProposalToOrdered: max: 0.168, avg: 0.166", "ConsensusOrderedToCommit: max: 0.046, avg: 0.042", "ConsensusProposalToCommit: max: 0.213, avg: 0.208"]
Max non-epoch-change gap was: 0 rounds at version 0 (avg 0.00) [limit 4], 0.43s no progress at version 5773192 (avg 0.07s) [limit 15].
Max epoch-change gap was: 0 rounds at version 0 (avg 0.00) [limit 4], 0.27s no progress at version 2433051 (avg 0.27s) [limit 16].
Test Ok

github-actions · 2026-01-08T05:57:15Z

✅ Forge suite `compat` success on `a09bb94430a970de7bc45fe0d29bd33fd2e5a7d4` ==> `97680f2cb8cc28da7b2e71dc533a960114048624`

Compatibility test results for a09bb94430a970de7bc45fe0d29bd33fd2e5a7d4 ==> 97680f2cb8cc28da7b2e71dc533a960114048624 (PR)
1. Check liveness of validators at old version: a09bb94430a970de7bc45fe0d29bd33fd2e5a7d4
compatibility::simple-validator-upgrade::liveness-check : committed: 9915.40 txn/s, latency: 3643.75 ms, (p50: 2800 ms, p70: 3000, p90: 3600 ms, p99: 13300 ms), latency samples: 418140
2. Upgrading first Validator to new version: 97680f2cb8cc28da7b2e71dc533a960114048624
compatibility::simple-validator-upgrade::single-validator-upgrade : committed: 5948.29 txn/s, latency: 5682.05 ms, (p50: 6200 ms, p70: 6400, p90: 6500 ms, p99: 6800 ms), latency samples: 208040
3. Upgrading rest of first batch to new version: 97680f2cb8cc28da7b2e71dc533a960114048624
compatibility::simple-validator-upgrade::half-validator-upgrade : committed: 5837.77 txn/s, latency: 5774.56 ms, (p50: 6400 ms, p70: 6500, p90: 6600 ms, p99: 6700 ms), latency samples: 200720
4. upgrading second batch to new version: 97680f2cb8cc28da7b2e71dc533a960114048624
compatibility::simple-validator-upgrade::rest-validator-upgrade : committed: 10038.49 txn/s, latency: 3278.13 ms, (p50: 3100 ms, p70: 3800, p90: 4600 ms, p99: 5100 ms), latency samples: 332520
5. check swarm health
Compatibility test for a09bb94430a970de7bc45fe0d29bd33fd2e5a7d4 ==> 97680f2cb8cc28da7b2e71dc533a960114048624 passed
Test Ok

github-actions · 2026-01-08T05:59:54Z

✅ Forge suite `framework_upgrade` success on `a09bb94430a970de7bc45fe0d29bd33fd2e5a7d4` ==> `97680f2cb8cc28da7b2e71dc533a960114048624`

Compatibility test results for a09bb94430a970de7bc45fe0d29bd33fd2e5a7d4 ==> 97680f2cb8cc28da7b2e71dc533a960114048624 (PR)
Upgrade the nodes to version: 97680f2cb8cc28da7b2e71dc533a960114048624
framework_upgrade::framework-upgrade::full-framework-upgrade : committed: 2172.24 txn/s, submitted: 2179.75 txn/s, failed submission: 7.51 txn/s, expired: 7.51 txn/s, latency: 1342.24 ms, (p50: 1200 ms, p70: 1500, p90: 1800 ms, p99: 2400 ms), latency samples: 196760
framework_upgrade::framework-upgrade::full-framework-upgrade : committed: 2186.62 txn/s, submitted: 2193.15 txn/s, failed submission: 6.52 txn/s, expired: 6.52 txn/s, latency: 1353.69 ms, (p50: 1200 ms, p70: 1500, p90: 1800 ms, p99: 2700 ms), latency samples: 194400
5. check swarm health
Compatibility test for a09bb94430a970de7bc45fe0d29bd33fd2e5a7d4 ==> 97680f2cb8cc28da7b2e71dc533a960114048624 passed
Upgrade the remaining nodes to version: 97680f2cb8cc28da7b2e71dc533a960114048624
framework_upgrade::framework-upgrade::full-framework-upgrade : committed: 2334.28 txn/s, submitted: 2343.07 txn/s, failed submission: 8.79 txn/s, expired: 8.79 txn/s, latency: 1253.84 ms, (p50: 1200 ms, p70: 1500, p90: 1700 ms, p99: 2100 ms), latency samples: 212461
Test Ok

This commit implements the logic to compute root hashes for hot state. It's an initial version with a number of TODO items, but it's a reasonable start. Most of the logic is gated behind a flag so we are not enabling this in mainnet yet, until we run things in testnet for some time and gain more confidence. How this roughly works: - `State::update` computes the changes to hot state. They are saved in `ExecutionOutput`. - The above changes are passed to `StateSummary::update` and used to compute in-memory `SparseMerkleTree`. - The resulting Merkle trees are committed to persisted database (`hot_state_merkle_db` introduced in #18385) so the proofs can be used in turn for future `StateSummary::update`.

This was referenced Dec 23, 2025

[Hot State] Use config to replace hard-coded parameters #18366

Merged

[Hot State] Delete unused code #18365

Merged

wqfish force-pushed the pr18385 branch 3 times, most recently from e43e00d to bf1a4dd Compare December 24, 2025 00:08

wqfish changed the title ~~[Hot State] Add separate StateMerkleDb for hot state~~ [Hot State] Add a separate StateMerkleDb for hot state Dec 24, 2025

wqfish force-pushed the pr18385 branch 2 times, most recently from e92b3e4 to 6c89feb Compare December 24, 2025 00:32

wqfish marked this pull request as ready for review December 24, 2025 00:33

wqfish requested review from 0xmaayan, JoshLind, banool, grao1991, gregnazario and lightmark as code owners December 24, 2025 00:33

wqfish requested a review from zekun000 December 24, 2025 00:33

cursor bot reviewed Dec 24, 2025

View reviewed changes

storage/aptosdb/src/state_merkle_db.rs Outdated Show resolved Hide resolved

wqfish force-pushed the pr18385 branch from 6c89feb to 575a496 Compare December 24, 2025 04:52

This was referenced Dec 26, 2025

[Layered Map] Expose inner layers #18353

Merged

[Hot State] Compute root hash for hot state #18390

Merged

wqfish added the CICD:run-e2e-tests when this label is present github actions will run all land-blocking e2e tests from the PR label Dec 27, 2025

wqfish force-pushed the pr18385 branch from 575a496 to df9f996 Compare December 27, 2025 20:14

cursor bot reviewed Dec 27, 2025

View reviewed changes

config/src/config/storage_config.rs Show resolved Hide resolved

This comment has been minimized.

Sign in to view

wqfish force-pushed the pr18385 branch from df9f996 to 7eea285 Compare January 1, 2026 19:28

cursor bot reviewed Jan 1, 2026

View reviewed changes

storage/aptosdb/src/state_merkle_db.rs Show resolved Hide resolved

This comment has been minimized.

Sign in to view

wqfish force-pushed the pr18385 branch from f2b1802 to 2fe33a8 Compare January 8, 2026 04:02

[Storage Config] Make db path override related fields private

affa1f1

This part of the config is exposed via the `get_dir_paths` API.

wqfish force-pushed the pr18385 branch 2 times, most recently from 375c1a9 to 19548d3 Compare January 8, 2026 04:38

This comment has been minimized.

Sign in to view

wqfish force-pushed the pr18385 branch from 19548d3 to 97680f2 Compare January 8, 2026 05:06

wqfish enabled auto-merge (rebase) January 8, 2026 05:16

This comment has been minimized.

Sign in to view

wqfish merged commit 5524f4d into main Jan 8, 2026
101 of 104 checks passed

wqfish deleted the pr18385 branch January 8, 2026 06:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Hot State] Add a separate StateMerkleDb for hot state #18385

[Hot State] Add a separate StateMerkleDb for hot state #18385

Uh oh!

wqfish commented Dec 23, 2025 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

github-actions bot commented Jan 8, 2026

Uh oh!

github-actions bot commented Jan 8, 2026

Uh oh!

github-actions bot commented Jan 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Hot State] Add a separate StateMerkleDb for hot state #18385

[Hot State] Add a separate StateMerkleDb for hot state #18385

Uh oh!

Conversation

wqfish commented Dec 23, 2025 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

Uh oh!

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

github-actions bot commented Jan 8, 2026

✅ Forge suite realistic_env_max_load success on 97680f2cb8cc28da7b2e71dc533a960114048624

Uh oh!

github-actions bot commented Jan 8, 2026

✅ Forge suite compat success on a09bb94430a970de7bc45fe0d29bd33fd2e5a7d4 ==> 97680f2cb8cc28da7b2e71dc533a960114048624

Uh oh!

github-actions bot commented Jan 8, 2026

✅ Forge suite framework_upgrade success on a09bb94430a970de7bc45fe0d29bd33fd2e5a7d4 ==> 97680f2cb8cc28da7b2e71dc533a960114048624

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

wqfish commented Dec 23, 2025 •

edited by cursor bot

Loading

✅ Forge suite `realistic_env_max_load` success on `97680f2cb8cc28da7b2e71dc533a960114048624`

✅ Forge suite `compat` success on `a09bb94430a970de7bc45fe0d29bd33fd2e5a7d4` ==> `97680f2cb8cc28da7b2e71dc533a960114048624`

✅ Forge suite `framework_upgrade` success on `a09bb94430a970de7bc45fe0d29bd33fd2e5a7d4` ==> `97680f2cb8cc28da7b2e71dc533a960114048624`