Bug Fix: Make batch production non-optional during recovery. Add sequencer proof processing tests by preston-evans98 · Pull Request #2605 · Sovereign-Labs/sovereign-sdk

preston-evans98 · 2026-03-18T19:06:23Z

Description

This PR rewrites and expands the sequencer proof processing test suite to include resync and recovery, as well as handling of proofs that arrive while an open batch is in progress. This should fix the flaky test as well as improving coverage.

The new tests uncovered a pre-existing bug in recovery mode. The sequencer used produce_batch_if_convenient, but assumed that the requested batches were always produced. Trigger a failure was easier thanks to proofs (the blob sender is much more likely to be busy) which allowed the new tests to detect the issue. This PR fixes the issue by removing the "is_convenient" checks from batch production during recovery mode only.

I have updated CHANGELOG.md with a new entry if my PR makes any breaking changes or fixes a bug. If my PR removes a feature or changes its behavior, I provide help for users on how to migrate to the new behavior.
I have carefully reviewed all my Cargo.toml changes before opening the PRs. (Are all new dependencies necessary? Is any module dependency leaked into the full-node (hint: it shouldn't)?)

Linked Issues

Enable proof processing in sequencer #2571

Expose zk proof progress through a shared manager status and a controlled prover/blueprint path so sequencer tests can hold and release aggregate proof publication deterministically. Refactor proof progress tracking into a helper so status updates stay coupled to state changes.

bkolad · 2026-03-19T12:25:57Z

crates/full-node/sov-stf-runner/src/processes/zk_manager/mod.rs

            // Update the next height to receive
            self.stf_info_receiver
                .inc_next_height_to_receive_by(num_proofs_to_create as u64);
+            self.proof_progress.finish_aggregate_proof_creation();


begin_aggregate_proof_creation sets aggregate_proof_in_flight = true, and finish_aggregate_proof_creation clears it. However, if create_aggregate_proof_with_retries or publish_proof_blob_with_metadata fails in this code block, the flag is never reset.

I think this is fine as the ZkProofManager just exits if this happens

bkolad · 2026-03-19T12:55:41Z

crates/utils/sov-test-utils/src/test_rollup.rs

 /// background to test node APIs.
 #[derive(Clone)]
 pub struct RollupBuilder<R: FullNodeBlueprint<Native>> {
+    blueprint: Arc<R>,


This will make RollupBuilder stateful, which means some state can persist across restarts.

builder.with_blueprint(..); let rollup = builder.start() let builder = rollup.shutdown() / rollup.restart() rollup_after_restart = builder.star() rollup_after_restart <- inherited r`eady_proof_count`

In this flow, rollup_after_restart may inherit state from the previous run. For example, in ManualProofPostingRtAgnosticBlueprint, we keep shared state such as ready_proof_count, available_releases, and is_open. That shared state would also be inherited by rollup_after_restart.

Would it be possible to add an Option<R> in start_test_rollup. So we would provide the bluerint only when TestRollup is created?

Or we could jut add another method start_test_rollup_with_manual_proof_posting and call ManualProofPostingRtAgnosticBlueprint::<TestSpec, RT>::new_with_control() in it.

bkolad · 2026-03-19T13:27:50Z

crates/full-node/sov-sequencer/src/preferred/mod.rs

-                    .trigger_batch_production_if_convenient_msg(
-                        "recover_and_catch_up:dump_catchup_batches",
-                    )
+                    .trigger_batch_production_msg("recover_and_catch_up:dump_catchup_batches")


One of the checks that is skipped now is:

if !self.seq_config.automatic_batch_production { warn!("Skipping batch production due to settings"); return; }

This means that the sequencer will produce batches during recovery, although it is configured to never do it. This only matters in tests, but it can still be confusing. Let's log an error if we hit recovery and automatic_batch_production = true?

Another option: we could panic when this happens, and cfg!(debug_assertions) = true

preston-evans98 added 6 commits March 17, 2026 19:05

Improve tests

97e6edc

Fix sequencer recovery batch production bug

ac85ae7

Fmt

4ba0f31

Cleanup test code

84012ad

Hopefully fix flake. Add logging on worrying failure

0c93c06

bkolad reviewed Mar 19, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug Fix: Make batch production non-optional during recovery. Add sequencer proof processing tests#2605

Bug Fix: Make batch production non-optional during recovery. Add sequencer proof processing tests#2605
preston-evans98 wants to merge 6 commits intodevfrom
preston/sequencer-proof-tests

preston-evans98 commented Mar 18, 2026

Uh oh!

bkolad Mar 19, 2026

Uh oh!

bkolad Mar 19, 2026

Uh oh!

bkolad Mar 19, 2026 •

edited

Loading

Uh oh!

bkolad Mar 19, 2026

Uh oh!

bkolad Mar 19, 2026 •

edited

Loading

Uh oh!

bkolad Mar 19, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

preston-evans98 commented Mar 18, 2026

Description

Linked Issues

Uh oh!

bkolad Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

bkolad Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

bkolad Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bkolad Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

bkolad Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bkolad Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bkolad Mar 19, 2026 •

edited

Loading

bkolad Mar 19, 2026 •

edited

Loading

bkolad Mar 19, 2026 •

edited

Loading