Implement the `RPCBlockHeaderSubscriber` for indexing finalized results #728

m-Peter · 2025-01-21T17:17:59Z

Closes: #727

Description

For contributor use:

Targeted PR against master branch
Linked to Github issue with discussion and accepted design OR link to spec that describes this work.
Code follows the standards mentioned here.
Updated relevant documentation
Re-reviewed Files changed in the Github PR explorer
Added appropriate labels

Summary by CodeRabbit

New Features
- Two experimental CLI flags to enable soft-finality behavior and optional sealing verification.
- Live block-tracking ingestion with optional sealing verification and backfill support.
- Persistent event-hash store and batched DB operations to support verification and recovery.
Bug Fixes / Improvements
- Clearer, context-rich error messages for trace collection/storage.
- Improved cross-spork block/event retrieval and subscriptions; added accessors for block event payloads.
Tests
- Expanded and refactored cross-spork client tests for range/gap scenarios.
Chores
- More verbose, consolidated dependency tidying in build checks.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2025-01-21T17:18:07Z

Walkthrough

Adds experimental soft-finality indexing: new RPCBlockTrackingSubscriber and SealingVerifier, EventsHash Pebble storage and batch helpers, CrossSporkClient methods for block headers/events, CLI flags to enable features, and related bootstrap, storage, model, and test updates.

Changes

Cohort / File(s)	Summary
Configuration & CLI `config/config.go`, `cmd/run/cmd.go`	Add `ExperimentalSoftFinalityEnabled` and `ExperimentalSealingVerificationEnabled` config fields and CLI flags; wire flags into runtime config.
Bootstrap & Orchestration `bootstrap/bootstrap.go`	Wire `EventsHash` into storages; choose between `RPCBlockTrackingSubscriber` (with optional `SealingVerifier`) and `RPCEventSubscriber` based on experimental flags; use events hash to set/remove processed sealed height when forcing start.
Sealing Verification `services/ingestion/sealing_verifier.go`	New `SealingVerifier` engine to compute/compare hashes for unsealed vs sealed events, backfill sealed history, maintain caches, and advance processed sealed height.
Block Tracking & Subscription `services/ingestion/block_tracking_subscriber.go`	New `RPCBlockTrackingSubscriber` to subscribe to block headers from start height, backfill spork ranges, fetch events per block, coordinate with `SealingVerifier`, and handle reconnection/error/recovery.
Cross-Spork Client `services/requester/cross-spork_client.go`, `services/requester/cross-spork_client_test.go`	Add `GetEventsForBlockHeader`, `SubscribeBlockHeadersFromStartHeight`, spork-root helpers (`IsSporkRootBlockHeight`, `CurrentSporkRootHeight`), update initialization to capture spork root height, and extend tests for spork-range gaps.
Event Hash Storage (Pebble) `storage/pebble/events_hash.go`, `storage/pebble/keys.go`, `storage/pebble/db.go`, `storage/pebble/storage.go`	New `EventsHash` type and keys (`eventsHashKey`, `sealedEventsHeightKey`); add `WithBatch` helper, batched set/remove processed sealed height, and internal `delete` helper.
Engine / Models / Tests `services/ingestion/engine.go`, `models/events.go`, `services/testutils/mock_client.go`	Improve transaction trace error messages, add `CadenceEvents.BlockEvents()` accessor, and include `SporkRootBlockHeight` in test mock node version info.
Build / Makefile `Makefile`	Use verbose `go mod tidy -v` and consolidate test tidy invocation.

Sequence Diagram(s)

sequenceDiagram
    autonumber
    actor Bootstrap
    participant BTS as RPCBlockTrackingSubscriber
    participant SV as SealingVerifier
    participant Cross as CrossSporkClient
    participant EH as EventsHash
    Note over Bootstrap,BTS: startup & wiring based on experimental flags

    Bootstrap->>BTS: Subscribe(ctx, startHeight)
    BTS->>Cross: SubscribeBlockHeadersFromStartHeight(startHeight)
    alt Backfill phase
        BTS->>Cross: Get historical block headers/events
        BTS->>SV: notify backfill complete
    end
    loop Live streaming
        Cross-->>BTS: BlockHeader
        BTS->>Cross: GetEventsForBlockHeader(header)
        Cross-->>BTS: Block events (unsealed)
        BTS->>SV: AddFinalizedBlock(unsealed events)
        SV->>EH: Read/Set processed sealed height
        SV->>Cross: (sealed stream subscription) sealed events arrive
        SV->>SV: compute sealed/unsealed hashes & verify
        alt hashes match
            SV->>EH: SetProcessedSealedHeight(height)
        else mismatch
            SV->>BTS: signal verification error / recovery
        end
    end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Areas needing extra attention:

services/ingestion/sealing_verifier.go — concurrency, caches, backfill, canonical hashing correctness.
services/ingestion/block_tracking_subscriber.go — backfill/subscribe/reconnect logic and verifier interaction.
services/requester/cross-spork_client.go — spork-root height handling and client selection across ranges.
storage/pebble/events_hash.go & db.go — batch atomicity, removal logic, and error handling.

Possibly related PRs

Sync soft-finality branch with changes from main #848 — Very closely related; overlaps EventsHash, SealingVerifier, RPCBlockTrackingSubscriber and wiring.
Update flow-go #673 — Related event ingestion and cross-spork request handling changes.
Batch index updates #332 — Related pebble batch/storage API changes used by EventsHash.

Suggested labels

Feature, EVM

Suggested reviewers

zhangchiqing
peterargue
janezpodhostnik

Poem

🐇 I hop through hashes, light and spry,

I match sealed echoes to unsealed sky,
I guard each bloom of event and height,
Soft-finality hums into the night,
A rabbit's wink — the index stays right.

Pre-merge checks and finishing touches

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Title check	⚠️ Warning	The title references 'RPCBlockHeaderSubscriber' but the PR implements 'RPCBlockTrackingSubscriber' - the actual implemented class name differs from the title.	Update the title to accurately reflect the implemented component name: 'Implement the RPCBlockTrackingSubscriber for indexing finalized results' or similar.
Docstring Coverage	⚠️ Warning	Docstring coverage is 50.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Linked Issues check	✅ Passed	All core requirements from issue #727 are implemented: subscribing to finalized block headers via RPCBlockTrackingSubscriber, retrieving EVM events per block via GetEventsForBlockHeader, and indexing them normally with optional sealing verification.
Out of Scope Changes check	✅ Passed	All changes directly support the core objective. Supporting infrastructure includes storage abstractions (EventsHash, pebble.WithBatch), configuration flags for experimental features, and test utilities - all necessary for the feature implementation.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch mpeter/poc-index-finalized-block-results

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

peterargue

added a couple small comments, but otherwise this looks good. let's get it running against one of the live networks.

services/ingestion/block_header_subscriber.go

m-Peter · 2025-01-23T15:29:54Z

@peterargue Thanks for the review 🙌 I've addressed all the comments.
Did you perhaps have the chance to take a look at the behavior described in #727 (comment) ? This is puzzling to me 🤔

services/ingestion/block_tracking_subscriber.go

…ersFromStartHeight

…or function resuse

…empty EVM blocks

Co-authored-by: Peter Argue <[email protected]>

…ange

…lity-with-main

Sync Soft Finality with changes from `main` branch

…lized-block-results

…finality [Back-port] Bump flow-go version to `v0.43.0`

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

go.mod (1)
3-3: Invalid go directive: use major.minor (no patch).

The go version in go.mod must be like 1.25, not 1.25.0. This breaks tooling.

Apply:
-go 1.25.0
+go 1.25
tests/go.mod (1)
3-3: Invalid go directive in tests module.

Same as root: use 1.25, not 1.25.0.

Apply:
-go 1.25.0
+go 1.25

🧹 Nitpick comments (4)

tests/go.mod (4)
14-14: Version drift: zerolog differs between root (v1.33.0) and tests (v1.34.0).

Not harmful, but avoid dual versions unless needed. Consider aligning to one version across modules.

Apply (either direction):
-	github.com/rs/zerolog v1.34.0
+	github.com/rs/zerolog v1.33.0
181-184: Prometheus stack minor bumps only in tests: consider aligning with root.

To reduce duplicate versions and metric wiring surprises, align client_golang, common, and procfs with the root module.

233-233: Duplicate YAML modules in graph.

Both go.yaml.in/yaml/v2 (v2.4.2) and gopkg.in/yaml.v2 (v2.4.0 at Line 252) appear. Prefer a single path if possible; if transitive, consider bumping deps to converge, then go mod tidy.

249-250: gRPC/protobuf patch drift vs root.

Tests use grpc v1.75.1 and protobuf v1.36.8; root has v1.75.0 and v1.36.7. Harmless, but aligning reduces duplicate loads and avoids subtle ABI issues.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between a8f7437 and efd12b0.

⛔ Files ignored due to path filters (2)

go.sum is excluded by !**/*.sum
tests/go.sum is excluded by !**/*.sum

📒 Files selected for processing (2)

go.mod (2 hunks)
tests/go.mod (7 hunks)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Lint
GitHub Check: Test

🔇 Additional comments (4)

go.mod (2)

14-14: flow-go bump to v0.43.0: confirm API compatibility.

v0.43.0 likely changes Access API/proto shapes. Please confirm RPCBlockHeaderSubscriber/CrossSporkClient compile and run against this version in CI, and that protobuf-gen outputs were refreshed.

148-156: Contract/template/protobuf version sync looks good; verify event types.

Upgrading core-contracts, templates, NFT, and flow protobuf to newer versions is consistent with finalized-header indexing. Please verify emitted EVM event type identifiers and payload fields used by the indexer didn’t change, and run go mod tidy -compat=1.25 to lock a coherent graph.

tests/go.mod (2)

10-10: Emulator pseudo‑version: does it expose finalized‑header subscription?

You noted E2E is blocked because the Emulator lacks this endpoint. This update suggests a very recent pseudo‑version; please confirm whether it now implements finalized‑header subscription. If not, gate/skip tests that depend on it to keep CI green.

12-12: Align flow-go version with root module (OK).

Tests now use v0.43.0, matching the root. Good for API parity.

Run go mod tidy -compat=1.25 in both modules to ensure the graph is minimal and consistent.

…lized-block-results

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

bootstrap/bootstrap.go (1)

618-634: API verification confirms the atomicity concern is valid: SetProcessedSealedHeight creates its own batch and cannot participate in atomic writes, while SetLatestCadenceHeight accepts an external batch.

The issue is real. SetProcessedSealedHeight(height uint64) only accepts the height parameter and internally calls WithBatch(e.store, func(batch *pebbleDB.Batch) error {...}), creating and committing its own batch immediately. In contrast, SetLatestCadenceHeight(height uint64, batch *pebble.Batch) accepts an external batch parameter, allowing it to participate in the deferred atomic commit. If batch.Commit() fails after SetProcessedSealedHeight has already persisted, the two stores will be left inconsistent. The suggested deferred approach is the correct solution given the API constraints.

🧹 Nitpick comments (2)

bootstrap/bootstrap.go (2)
119-137: Use the provided ctx instead of context.Background

Propagate cancellation/timeouts from StartEventIngestion.

Apply:
-	latestCadenceBlock, err := b.client.GetLatestBlock(context.Background(), true)
+	latestCadenceBlock, err := b.client.GetLatestBlock(ctx, true)
@@
-	_, err = b.client.GetBlockHeaderByHeight(context.Background(), latestCadenceHeight)
+	_, err = b.client.GetBlockHeaderByHeight(ctx, latestCadenceHeight)
157-187: Nil-verifier safety is already properly handled; logging suggestion remains optional

The existing code already guards all verifier dereferences with nil-checks (block_tracking_subscriber.go lines 110 and 195), and StartEventIngestion is correctly called after StartAPIServer. The suggested logging additions from the diff have not been applied—these remain optional for improved observability.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between a404eab and 6552a69.

📒 Files selected for processing (1)

bootstrap/bootstrap.go (5 hunks)

🧰 Additional context used

🧬 Code graph analysis (1)

bootstrap/bootstrap.go (5)

storage/pebble/events_hash.go (2)

EventsHash (14-16)

NewEventsHash (18-22)

services/ingestion/event_subscriber.go (2)

EventSubscriber (27-33)

NewRPCEventSubscriber (49-65)

services/ingestion/sealing_verifier.go (2)

SealingVerifier (30-54)

NewSealingVerifier (57-82)

services/ingestion/block_tracking_subscriber.go (1)

NewRPCBlockTrackingSubscriber (47-65)

models/errors/errors.go (1)

ErrStorageNotInitialized (36-36)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Test
GitHub Check: Lint

🔇 Additional comments (3)

bootstrap/bootstrap.go (3)

60-61: Add EventsHash to Storages — LGTM

Field is appropriately scoped and enables verifier wiring.

607-608: Create EventsHash — LGTM

Initialization matches the Storage instance.

697-698: Wire EventsHash into Storages — LGTM

Storage bundle exposes EventsHash for verifier/consumers.

[Back-port] Update to latest `flow-go` for Forte upgrade

Handle missing evm block in spork root block

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (4)

services/ingestion/block_tracking_subscriber.go (2)
143-294: Consider adding exponential backoff for reconnection attempts.

The reconnection logic on Line 279 attempts to reconnect on NotFound errors without any backoff delay. Consider adding exponential backoff to avoid tight reconnection loops during transient failures.

Example enhancement:
+	reconnectDelay := 200 * time.Millisecond
+	maxReconnectDelay := 5 * time.Second
+
 				if err := connect(lastReceivedHeight + 1); err != nil {
+					time.Sleep(reconnectDelay)
+					reconnectDelay = reconnectDelay * 2
+					if reconnectDelay > maxReconnectDelay {
+						reconnectDelay = maxReconnectDelay
+					}
 					eventsChan <- models.NewBlockEventsError(
353-369: Add maximum retry limit to prevent infinite loops.

The retry loop on Lines 353-369 can run indefinitely on persistent NotFound or ResourceExhausted errors. Consider adding a maximum retry count with exponential backoff to prevent infinite loops during prolonged outages.

Example enhancement:
+	const maxRetries = 30
+	retryCount := 0
+	retryDelay := 200 * time.Millisecond
+
 	// retry until we get the block from an execution node that has the events
 	for {
 		evts, err = r.client.GetEventsForBlockHeader(
 			ctx,
 			eventType,
 			blockHeader,
 		)
 		if err != nil {
 			// retry after a short pause
 			if status.Code(err) == codes.NotFound || status.Code(err) == codes.ResourceExhausted {
+				retryCount++
+				if retryCount > maxRetries {
+					return flow.BlockEvents{}, fmt.Errorf("max retries exceeded after %d attempts: %w", maxRetries, err)
+				}
-				time.Sleep(200 * time.Millisecond)
+				time.Sleep(retryDelay)
+				retryDelay = retryDelay * 2
+				if retryDelay > 5*time.Second {
+					retryDelay = 5*time.Second
+				}
 				continue
 			}
storage/pebble/events_hash.go (1)
47-62: Document the sequential insertion assumption.

The BatchRemoveAboveHeight method assumes event hashes are inserted sequentially with no gaps (line 52-53 comment). Consider adding a top-level comment on the EventsHash struct documenting this invariant, as it's critical for the correctness of this deletion logic.

Example enhancement:
+// EventsHash provides storage for per-height event hashes.
+//
+// INVARIANT: Event hashes are stored sequentially with no gaps in height.
+// This invariant is relied upon by BatchRemoveAboveHeight for efficient cleanup.
 type EventsHash struct {
 	store *Storage
 }
services/ingestion/sealing_verifier.go (1)

40-48: Consider monitoring cache sizes for production deployment.

The unsealedBlocksToVerify and sealedBlocksToVerify maps have no explicit size bounds. While normal operation should keep these small (cleaned up in verifyBlock at lines 415-416), a sustained divergence between sealed and unsealed streams could cause unbounded growth.

For production use, consider adding metrics to monitor cache sizes and alerts for divergence thresholds.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between e302a12 and 5c97d88.

📒 Files selected for processing (9)

bootstrap/bootstrap.go (5 hunks)
models/events.go (1 hunks)
services/ingestion/block_tracking_subscriber.go (1 hunks)
services/ingestion/sealing_verifier.go (1 hunks)
services/requester/cross-spork_client.go (7 hunks)
services/requester/cross-spork_client_test.go (2 hunks)
services/testutils/mock_client.go (1 hunks)
storage/pebble/events_hash.go (1 hunks)
storage/pebble/storage.go (1 hunks)

🧰 Additional context used

🧠 Learnings (2)

📚 Learning: 2025-10-06T10:14:49.706Z

Learnt from: m-Peter
PR: onflow/flow-evm-gateway#890
File: services/ingestion/event_subscriber.go:235-239
Timestamp: 2025-10-06T10:14:49.706Z
Learning: In services/ingestion/event_subscriber.go, when reconnecting after disconnect errors (DeadlineExceeded, Internal, Unavailable), the subscription should reconnect at lastReceivedHeight rather than lastReceivedHeight+1. This avoids errors when the next height doesn't exist yet, and duplicate event processing is safe because the ingestion engine is explicitly designed to be idempotent (storage uses batch.Set() which overwrites existing entries).

Applied to files:

services/ingestion/block_tracking_subscriber.go

📚 Learning: 2025-03-07T01:35:09.751Z

Learnt from: peterargue
PR: onflow/flow-evm-gateway#772
File: services/requester/keystore/key_store.go:50-62
Timestamp: 2025-03-07T01:35:09.751Z
Learning: In the flow-evm-gateway codebase, panics are acceptable in scenarios where immediate detection of critical bugs is desired during development and testing, particularly for invariant violations that should never occur in a correctly functioning system (e.g., when a key is available but locked in the keystore implementation).

Applied to files:

services/ingestion/block_tracking_subscriber.go

🧬 Code graph analysis (6)

storage/pebble/events_hash.go (3)

storage/pebble/storage.go (1)

Storage (13-16)

storage/pebble/db.go (1)

WithBatch (65-84)

models/errors/errors.go (2)

ErrEntityNotFound (38-38)

ErrStorageNotInitialized (36-36)

services/ingestion/sealing_verifier.go (5)

services/ingestion/engine.go (1)

Engine (40-57)

models/engine.go (2)

EngineStatus (19-23)

NewEngineStatus (25-31)

services/requester/cross-spork_client.go (1)

CrossSporkClient (111-116)

storage/pebble/events_hash.go (1)

EventsHash (14-16)

models/errors/errors.go (3)

ErrStorageNotInitialized (36-36)

ErrDisconnected (24-24)

ErrEntityNotFound (38-38)

services/ingestion/block_tracking_subscriber.go (6)

services/ingestion/event_subscriber.go (2)

RPCEventSubscriber (37-47)

NewRPCEventSubscriber (49-65)

services/ingestion/sealing_verifier.go (1)

SealingVerifier (30-54)

services/requester/cross-spork_client.go (1)

CrossSporkClient (111-116)

services/requester/keystore/key_store.go (1)

KeyLock (21-37)

models/events.go (3)

BlockEvents (213-216)

NewBlockEventsError (257-261)

NewSingleBlockEvents (240-255)

models/errors/errors.go (1)

ErrDisconnected (24-24)

bootstrap/bootstrap.go (5)

storage/pebble/events_hash.go (2)

EventsHash (14-16)

NewEventsHash (18-22)

services/ingestion/event_subscriber.go (2)

EventSubscriber (27-33)

NewRPCEventSubscriber (49-65)

services/ingestion/sealing_verifier.go (2)

SealingVerifier (30-54)

NewSealingVerifier (57-82)

services/ingestion/block_tracking_subscriber.go (1)

NewRPCBlockTrackingSubscriber (48-66)

models/errors/errors.go (1)

ErrStorageNotInitialized (36-36)

services/requester/cross-spork_client_test.go (2)

services/testutils/mock_client.go (1)

SetupClientForRange (64-77)

services/requester/cross-spork_client.go (1)

NewCrossSporkClient (120-178)

services/requester/cross-spork_client.go (1)

models/events.go (1)

BlockEvents (213-216)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Test

🔇 Additional comments (33)

services/testutils/mock_client.go (1)

100-101: LGTM! Test mock aligned with spork root height changes.

Setting both NodeRootBlockHeight and SporkRootBlockHeight to startHeight provides consistent test behavior and aligns with the broader PR changes that introduce spork root height handling in the CrossSporkClient.

storage/pebble/storage.go (1)

61-65: LGTM! Batch delete helper follows established patterns.

The new delete method mirrors the existing set method's signature and batch-oriented design, enabling atomic deletion operations used by the EventsHash storage cleanup logic.

models/events.go (1)

155-158: LGTM! Clean accessor for underlying Flow events.

The new BlockEvents() getter provides a straightforward way to access the internal flow.BlockEvents, supporting the integration with the SealingVerifier workflow.

services/requester/cross-spork_client_test.go (1)

76-172: LGTM! Enhanced test coverage for spork boundary scenarios.

The test restructuring into explicit subtests improves clarity and adds valuable coverage for:

Continuous range validation between past and current sporks

Gap detection between spork root and node root heights

Gap detection between past spork end and current spork root

These scenarios align well with the updated CrossSporkClient validation logic.

bootstrap/bootstrap.go (5)

60-60: LGTM! EventsHash field added to Storages.

The new EventsHash field integrates the events hash storage mechanism into the Storages struct, enabling verification workflows.

157-187: LGTM! Subscriber selection logic properly gated by experimental flags.

The conditional logic appropriately selects between RPCBlockTrackingSubscriber (with optional SealingVerifier) and the original RPCEventSubscriber based on the experimental flags. The verifier is correctly instantiated only when both soft finality and sealing verification are enabled.

607-607: LGTM! EventsHash properly initialized.

The EventsHash is correctly constructed using NewEventsHash(store) during storage setup.

621-636: LGTM! Proper cleanup of EventsHash when forcing start height.

The logic correctly:

Retrieves the processed sealed height

Updates it to ForceStartCadenceHeight if it's higher

Removes event hashes above the forced height

This ensures consistency when reindexing from a specific height.

700-700: LGTM! EventsHash included in returned Storages.

The EventsHash is properly included in the returned Storages struct, completing the wiring.

services/ingestion/block_tracking_subscriber.go (6)

24-25: LGTM! Well-defined error types for specific failure scenarios.

The error definitions clearly distinguish between system transaction failures and spork root block edge cases.

42-46: LGTM! Clear struct composition with optional verifier.

The struct properly extends RPCEventSubscriber and includes an optional SealingVerifier for experimental verification workflows.

48-66: LGTM! Constructor properly initialized.

The constructor correctly composes the base RPCEventSubscriber and includes the optional verifier.

73-137: LGTM! Subscribe method handles backfill and verifier lifecycle.

The implementation correctly:

Handles backfill for past sporks

Feeds backfilled events to the verifier

Starts the verifier after backfilling (since backfilled data is sealed)

Transitions to live subscription

The verifier lifecycle management is appropriate for the PoC workflow.

296-342: LGTM! Event fetching logic properly handles block and transaction events.

The implementation correctly:

Fetches block events and checks for empty transaction hash

Handles ErrSystemTransactionFailed gracefully by continuing to fetch transaction events

Combines block and transaction events for processing

The error handling for system transaction failures is appropriate.

380-410: LGTM! Proper validation of block executed events.

The logic correctly:

Validates the expected number of block events

Handles the spork root block edge case

Verifies system transaction status when events are missing

Returns appropriate errors for unexpected scenarios

The defensive checks ensure data integrity.

storage/pebble/events_hash.go (5)

14-22: LGTM! Clean constructor for EventsHash storage.

The struct and constructor follow established pebble storage patterns.

24-35: LGTM! Store method uses batch operations for atomicity.

The method correctly uses WithBatch to ensure atomic storage of event hashes with proper error wrapping.

37-44: LGTM! GetByHeight properly retrieves and converts hashes.

The method correctly retrieves the stored bytes and converts them to flow.Identifier with appropriate error handling.

64-74: LGTM! Proper handling of uninitialized storage.

The ProcessedSealedHeight method correctly distinguishes between uninitialized storage (ErrStorageNotInitialized) and other errors.

76-87: LGTM! Batch variants support atomic operations.

Both SetProcessedSealedHeight and BatchSetProcessedSealedHeight are properly implemented, with the former delegating to the batch variant via WithBatch.

services/requester/cross-spork_client.go (6)

112-115: LGTM! Struct updated with spork root height field.

The currentSporkRootHeight field properly captures the spork root block height for boundary checks.

126-149: LGTM! Proper distinction between spork root and node root heights.

The initialization logic correctly:

Captures both SporkRootBlockHeight and NodeRootBlockHeight

Logs a warning when they differ (indicating missing blocks in the gap)

Documents the rationale for using SporkRootBlockHeight for checks

This handles the edge case where nodes are bootstrapped after the spork root block.

163-170: LGTM! Validation ensures continuous height ranges.

The validation correctly checks that:

Past spork clients form a continuous range

The last past spork ends at the node root block height (the first available height on the node)

This ensures no gaps in the queryable height range.

182-193: LGTM! New public methods expose spork root height information.

The new methods IsSporkRootBlockHeight and CurrentSporkRootHeight provide clean APIs for querying spork boundary information used by the block tracking subscriber.

288-298: LGTM! GetEventsForBlockHeader enables finalized block queries.

The method correctly uses GetEventsForBlockIDs (rather than the height range endpoint) to allow querying finalized but unsealed blocks, which is essential for the soft-finality feature.

300-315: LGTM! Block header subscription properly exposed.

The SubscribeBlockHeadersFromStartHeight method correctly:

Delegates to the appropriate client for the start height

Casts to grpc.Client to access the subscription API

Returns appropriate errors if the cast fails

This enables the block tracking subscriber to follow finalized blocks.

services/ingestion/sealing_verifier.go (7)

56-82: Constructor initialization looks correct.

The logic to compute lastProcessedUnsealedHeight from startHeight is appropriate, correctly handling the edge case when startHeight is 0.

139-164: Connection retry logic is reasonable for a PoC.

The string-based error matching (lines 153-154) is fragile but may be the only option if the Access API doesn't provide structured error types. The infinite retry with logging should make issues observable during operation.

Consider verifying whether the Access API provides structured error types that could replace the string matching for more robust error handling.

228-295: Backfill logic correctly handles missing block events.

The length check at line 263 correctly compares the number of BlockEvents entries (one per height) rather than individual events. The accumulation logic (lines 267-288) appropriately handles the edge case where EVM block events are missing by grouping transactions into the next available block.

297-347: Sequential ordering and caching logic is sound.

The CompareAndSwap check (lines 304-308) ensures sealed events are processed in strict sequential order, which is appropriate for this verification use case. The dual condition at line 324 correctly handles both missing unsealed data and rollback scenarios.

The logger.Fatal usage (line 334) was previously flagged in past reviews.

349-393: Unsealed event processing mirrors sealed event logic correctly.

The operation ordering (lines 362-371) is correct: store the hash before updating the height counter to maintain consistency. The symmetric structure with onSealedEvents makes the verification logic easier to understand.

429-477: Hash calculation and canonical ordering implemented correctly.

The canonical sorting (lines 464-469) by transaction index and event index ensures deterministic hash computation. The conversion logic properly handles all required fields including transaction IDs and indices.

102-123: Storage initialization logic handles edge cases correctly.

The initialization at lines 115-117 properly handles the case when startHeight is 0, ensuring lastVerifiedHeight is set appropriately. The fallback to initialize storage (lines 118-120) when ErrStorageNotInitialized is encountered is a good defensive pattern.

coderabbitai · 2025-10-22T15:01:34Z

services/ingestion/sealing_verifier.go

+		case sealedEvents, ok := <-eventsChan:
+			if !ok {
+				if ctx.Err() != nil {
+					return nil
+				}
+				return fmt.Errorf("failed to receive block events: %w", err)
+			}


⚠️ Potential issue | 🔴 Critical

Critical: Undefined variable err at line 192.

When eventsChan closes unexpectedly (line 188), the error message at line 192 attempts to wrap an undefined err variable. The err is only in scope within the errChan case (line 201).

Apply this diff to fix the undefined variable:

case sealedEvents, ok := <-eventsChan: if !ok { if ctx.Err() != nil { return nil } - return fmt.Errorf("failed to receive block events: %w", err) + return fmt.Errorf("sealed events channel closed unexpectedly") }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

case sealedEvents, ok := <-eventsChan:

if !ok {

if ctx.Err() != nil {

return nil

}

return fmt.Errorf("failed to receive block events: %w", err)

}

case sealedEvents, ok := <-eventsChan:

if !ok {

if ctx.Err() != nil {

return nil

}

return fmt.Errorf("sealed events channel closed unexpectedly")

}

🤖 Prompt for AI Agents

In services/ingestion/sealing_verifier.go around lines 187 to 193, the error return wraps an undefined variable `err` when eventsChan is closed; replace the invalid fmt.Errorf(..., err) with a concrete error (for example fmt.Errorf("failed to receive block events: channel closed") or errors.New("events channel closed")) so the function returns a valid error value when the events channel closes unexpectedly.

…lized-block-results

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

bootstrap/bootstrap.go (1)
630-647: Force cadence height enhancements are well-implemented.

The logic correctly handles resetting event hash state when forcing a start height:

Conditionally resets processed sealed height only when it's ahead of the forced height (line 640-644)

Always removes event hash entries above the forced height (line 645-647)

Proper error handling with descriptive messages and wrapping

Batch operations ensure atomicity

Consider clarifying the error message on line 642 to better reflect the operation:
-				return nil, nil, fmt.Errorf("failed to set latest verified sealed height: %w", err)
+				return nil, nil, fmt.Errorf("failed to set processed sealed height to forced start height: %w", err)
The current message says "latest verified sealed height" but we're actually setting it to the forced start height, which may not be the latest.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 2862171 and 5baae43.

📒 Files selected for processing (1)

bootstrap/bootstrap.go (5 hunks)

🧰 Additional context used

🧠 Learnings (1)

📚 Learning: 2025-10-06T10:14:49.706Z

Learnt from: m-Peter
Repo: onflow/flow-evm-gateway PR: 890
File: services/ingestion/event_subscriber.go:235-239
Timestamp: 2025-10-06T10:14:49.706Z
Learning: In services/ingestion/event_subscriber.go, when reconnecting after disconnect errors (DeadlineExceeded, Internal, Unavailable), the subscription should reconnect at lastReceivedHeight rather than lastReceivedHeight+1. This avoids errors when the next height doesn't exist yet, and duplicate event processing is safe because the ingestion engine is explicitly designed to be idempotent (storage uses batch.Set() which overwrites existing entries).

Applied to files:

bootstrap/bootstrap.go

🧬 Code graph analysis (1)

bootstrap/bootstrap.go (5)

storage/pebble/events_hash.go (2)

EventsHash (14-16)

NewEventsHash (18-22)

services/ingestion/event_subscriber.go (2)

EventSubscriber (27-33)

NewRPCEventSubscriber (49-65)

services/ingestion/sealing_verifier.go (2)

SealingVerifier (30-54)

NewSealingVerifier (57-82)

services/ingestion/block_tracking_subscriber.go (1)

NewRPCBlockTrackingSubscriber (48-66)

models/errors/errors.go (1)

ErrStorageNotInitialized (37-37)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Lint
GitHub Check: Test

🔇 Additional comments (4)

bootstrap/bootstrap.go (4)

60-60: LGTM! EventsHash storage added to Storages.

The new EventsHash field is properly typed and consistent with the other storage fields in the struct.

618-618: LGTM! EventsHash initialized correctly.

The initialization follows the expected pattern and properly uses the store parameter.

711-711: LGTM! EventsHash properly wired into returned Storages.

The field is correctly added to the return struct, completing the integration of EventsHash into the storage initialization flow.

163-193: Based on the verification results, the original review comment makes valid architectural observations but the execution verification was incomplete. The script output confirms the structure but doesn't show how the verifier field is actually used in methods, which is necessary to fully verify nil-safety.

Code design is sound; verify verifier nil-handling in implementation.

The conditional subscriber creation follows good architectural patterns—the two-level feature flag (outer for soft finality, inner for sealing verification) is clean, and both paths create fresh subscriber instances starting from nextCadenceHeight.

However, confirm that NewRPCBlockTrackingSubscriber properly handles the optional nil verifier parameter in all method calls. The verifier field can be nil when ExperimentalSealingVerificationEnabled is false, and Go requires explicit nil checks before dereferencing. Verify that all methods using the verifier guard against nil or document that the parameter must never be nil.

Additionally, since both subscribers are instantiated fresh from the current height on each bootstrap, toggling ExperimentalSoftFinalityEnabled should not cause state inconsistencies—confirm no persistent subscriber-type mapping exists in storage.

…lized-block-results

coderabbitai

Actionable comments posted: 1

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 5baae43 and 48b6d0c.

📒 Files selected for processing (2)

cmd/run/cmd.go (3 hunks)
config/config.go (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

cmd/run/cmd.go

coderabbitai · 2025-11-20T09:14:04Z

config/config.go

+	// ExperimentalSoftFinalityEnabled enables the experimental soft finality feature which syncs
+	// EVM block and transaction data from the upstream Access node before the block is sealed.
+	// CAUTION: This feature is experimental and may return incorrect data in certain circumstances.
+	ExperimentalSoftFinalityEnabled bool
+	// ExperimentalSealingVerificationEnabled enables the experimental sealing verification feature
+	// which verifies the hash of the EVM events ingested by the requester engine match the hash
+	// of the events from the sealed block in the Flow network.
+	// CAUTION: This feature is experimental and will cause the node to halt if the events don't match.
+	ExperimentalSealingVerificationEnabled bool


⚠️ Potential issue | 🟡 Minor

🧩 Analysis chain

LGTM! The experimental configuration fields are well-documented.

The addition of these experimental feature flags is clear and includes appropriate CAUTION warnings about potential data issues and node halting behavior. The naming follows Go conventions and the documentation is explicit about the risks.

However, given the PR context noting that integration tests cannot currently run (Emulator limitation) and the known limitations with execution forks requiring manual operator intervention, please ensure comprehensive operator-facing documentation exists.

Run the following script to verify operator documentation exists for these experimental features:

🏁 Script executed:

#!/bin/bash # Description: Verify that operator documentation exists for the new experimental features # Search for documentation files that might cover these features echo "=== Searching for documentation files ===" fd -e md -e txt -e rst . | head -20 # Search for references to the experimental flags in documentation echo -e "\n=== Searching for soft finality documentation ===" rg -n -C3 --type=md "soft.finality|ExperimentalSoftFinalityEnabled" echo -e "\n=== Searching for sealing verification documentation ===" rg -n -C3 --type=md "sealing.verification|ExperimentalSealingVerificationEnabled" # Search for any operator guides or deployment docs echo -e "\n=== Searching for operator/deployment guides ===" rg -n -l --type=md "operator|deployment|configuration|experimental" | head -10

Length of output: 355

🏁 Script executed:

#!/bin/bash # Search for any references to the experimental features in the entire codebase echo "=== Searching for ExperimentalSoftFinalityEnabled references ===" rg -n "ExperimentalSoftFinalityEnabled" . | head -20 echo -e "\n=== Searching for ExperimentalSealingVerificationEnabled references ===" rg -n "ExperimentalSealingVerificationEnabled" . | head -20 echo -e "\n=== Checking README.md for experimental feature documentation ===" cat README.md | head -100 echo -e "\n=== Checking if there's any EXPERIMENTAL documentation pattern ===" rg -n -C2 "EXPERIMENTAL|experimental.*feature" --type=md | head -40

Length of output: 6591

🏁 Script executed:

#!/bin/bash # Check how the experimental flags are used in bootstrap.go echo "=== bootstrap.go lines 165-175 (ExperimentalSoftFinalityEnabled usage) ===" sed -n '160,180p' bootstrap/bootstrap.go echo -e "\n=== cmd/run/cmd.go lines 225-240 (Flag definition and assignment) ===" sed -n '225,240p' cmd/run/cmd.go echo -e "\n=== Check if there's any configuration documentation ===" find . -name "*config*" -type f | grep -E "\.(md|txt|rst)$" echo -e "\n=== Search for any mention of flags/configuration in docs ===" rg -n "flag|--experimental" README.md | head -20

Length of output: 2161

🏁 Script executed:

#!/bin/bash # Find where the experimental flags are defined as command-line flags echo "=== Searching for experimentalSoftFinalityEnabled flag definition ===" rg -n "experimentalSoftFinalityEnabled" . -B2 -A2 | grep -E "(flag|Flag|Define|cmd|command)" -A2 -B2 echo -e "\n=== Searching for experimentalSealingVerificationEnabled flag definition ===" rg -n "experimentalSealingVerificationEnabled" . -B2 -A2 | grep -E "(flag|Flag|Define|cmd|command)" -A2 -B2 echo -e "\n=== Check cmd/run/cmd.go for flag.Bool definitions ===" rg -n "flag.Bool|flag.String" cmd/run/cmd.go | tail -20 echo -e "\n=== Extract README.md configuration table (lines 231-280) ===" sed -n '231,280p' README.md

Length of output: 9751

Add experimental feature flags to README.md configuration table.

The code changes are sound and the experimental flags have appropriate runtime warnings in cmd/run/cmd.go (lines 299-300). However, these flags are missing from the README.md configuration documentation table (lines 231-280), where operators are directed to find all available configuration options.

Add entries for experimental-soft-finality-enabled and experimental-sealing-verification-enabled to the configuration table so operators have complete visibility of all available flags and their purposes.

🤖 Prompt for AI Agents

In config/config.go around lines 123 to 131, two experimental boolean flags (ExperimentalSoftFinalityEnabled and ExperimentalSealingVerificationEnabled) were added but the README.md configuration table (around lines 231-280) is missing entries for them; update the README.md config table to include rows for --experimental-soft-finality-enabled and --experimental-sealing-verification-enabled with concise descriptions matching the comments/warnings in cmd/run/cmd.go (mentioning that they are experimental and include the CAUTION text about possible incorrect data for soft finality and node halt for sealing verification), and ensure flag names, default values, and descriptions align with existing table formatting and style.

…lized-block-results

m-Peter self-assigned this Jan 21, 2025

This was referenced Jan 21, 2025

PoC to allow indexing unsealed finalized execution results #727

Closed

Bump flow-go-sdk & cadence versions onflow/flow-emulator#779

Merged

m-Peter force-pushed the mpeter/poc-index-finalized-block-results branch from 10a8bdd to 4fd4a5d Compare January 22, 2025 16:41

peterargue reviewed Jan 23, 2025

View reviewed changes

m-Peter marked this pull request as ready for review January 23, 2025 15:29

m-Peter requested review from janezpodhostnik and zhangchiqing as code owners January 23, 2025 15:29

This comment was marked as resolved.

Sign in to view

peterargue reviewed Jan 24, 2025

View reviewed changes

m-Peter added 7 commits January 27, 2025 10:34

Implement the RPCBlockHeaderSubscriber for indexing finalized results

a276412

Rename RPCBlockHeaderSubscriber to RPCBlockTrackingSubscriber

7d36012

Remove redundant fetching of block header prior to SubscribeBlockHead…

75a1d18

…ersFromStartHeight

Improve error handling in RPCBlockTrackingSubscriber subscription

ce83ebb

Simplify RPCBlockTrackingSubscriber by embedding RPCEventSubscriber f…

d159467

…or function resuse

Optimize RPCBlockTrackingSubscriber to avoid fetch EVM tx events for …

5e8f388

…empty EVM blocks

Improve length checks for EVM related events

bfe6188

m-Peter force-pushed the mpeter/poc-index-finalized-block-results branch from 314e7cc to bfe6188 Compare January 27, 2025 09:43

This comment was marked as resolved.

Sign in to view

Close events channel in RPCBlockTrackingSubscriber

037c234

This comment was marked as resolved.

Sign in to view

Improve error handling to use the gRPC status code instead of strings

6afde4d

This comment was marked as resolved.

Sign in to view

Simplify constructor of RPCBlockTrackingSubscriber

ff927f7

Co-authored-by: Peter Argue <[email protected]>

This comment was marked as resolved.

Sign in to view

m-Peter added 2 commits January 27, 2025 13:20

Remove redundant fields from RPCBlockTrackingSubscriber

4555263

Make us of GetEventsForBlockIDs method instead of GetEventsForHeightR…

34dbde5

…ange

m-Peter added 4 commits August 21, 2025 11:58

Merge remote-tracking branch 'origin/main' into mpeter/sync-soft-fina…

ed3f17b

…lity-with-main

Replace onflow/go-ethereum with ethereum/go-ethereum

f51de72

Merge remote-tracking branch 'origin/main' into mpeter/sync-soft-fina…

965b10b

…lity-with-main

Merge pull request #869 from onflow/mpeter/sync-soft-finality-with-main

fed891a

Sync Soft Finality with changes from `main` branch

This comment was marked as outdated.

Sign in to view

m-Peter added 5 commits September 11, 2025 19:49

Merge remote-tracking branch 'origin/main' into mpeter/poc-index-fina…

a8f7437

…lized-block-results

Merge remote-tracking branch 'origin/main' into mpeter/poc-index-fina…

eab1a53

…lized-block-results

Merge remote-tracking branch 'origin/main' into mpeter/poc-index-fina…

e3d06a2

…lized-block-results

Merge remote-tracking branch 'origin/main' into mpeter/poc-index-fina…

108e232

…lized-block-results

Merge pull request #882 from onflow/mpeter/bump-flow-go-version-soft-…

efd12b0

…finality [Back-port] Bump flow-go version to `v0.43.0`

coderabbitai bot reviewed Sep 17, 2025

View reviewed changes

m-Peter added 4 commits September 22, 2025 10:45

Merge remote-tracking branch 'origin/main' into mpeter/poc-index-fina…

a404eab

…lized-block-results

Merge remote-tracking branch 'origin/main' into mpeter/poc-index-fina…

733be55

…lized-block-results

Merge remote-tracking branch 'origin/main' into mpeter/poc-index-fina…

fffd7f8

…lized-block-results

Merge remote-tracking branch 'origin/main' into mpeter/poc-index-fina…

6552a69

…lized-block-results

coderabbitai bot reviewed Oct 20, 2025

View reviewed changes

m-Peter and others added 7 commits October 20, 2025 19:15

Merge pull request #905 from onflow/mpeter/soft-finality-update-flow-go

e302a12

[Back-port] Update to latest `flow-go` for Forte upgrade

Handle missing evm block in spork root block

60caf94

verify events even for spork root block

c754eb7

add support for backfilling past spork blocks in sealing verifier

ec1385e

verify backfilled blocks

cb7f210

delete stored hashes during force start height

5e17b0b

Merge pull request #908 from onflow/peter/fix-spork-root-sf

5c97d88

Handle missing evm block in spork root block

coderabbitai bot reviewed Oct 22, 2025

View reviewed changes

m-Peter added 2 commits October 30, 2025 12:43

Merge remote-tracking branch 'origin/main' into mpeter/poc-index-fina…

2862171

…lized-block-results

Merge remote-tracking branch 'origin/main' into mpeter/poc-index-fina…

5baae43

…lized-block-results

coderabbitai bot reviewed Nov 12, 2025

View reviewed changes

Merge remote-tracking branch 'origin/main' into mpeter/poc-index-fina…

48b6d0c

…lized-block-results

coderabbitai bot reviewed Nov 20, 2025

View reviewed changes

Merge remote-tracking branch 'origin/main' into mpeter/poc-index-fina…

28de201

…lized-block-results

Implement the RPCBlockHeaderSubscriber for indexing finalized results #728

Are you sure you want to change the base?

Implement the RPCBlockHeaderSubscriber for indexing finalized results #728

Uh oh!

Conversation

m-Peter commented Jan 21, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jan 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

peterargue left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

m-Peter commented Jan 23, 2025

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as outdated.

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Implement the `RPCBlockHeaderSubscriber` for indexing finalized results #728

Implement the `RPCBlockHeaderSubscriber` for indexing finalized results #728

m-Peter commented Jan 21, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 21, 2025 •

edited

Loading