chore: type fixes #74

HardMax71 · 2026-01-10T21:44:12Z

Summary by cubic

Flattened event data by removing the payload wrapper and moved the domain layer to Pydantic models with a typed event hierarchy. Standardized timestamps to datetime across APIs/SSE/services, unified enum usage, updated indexes/queries/serialization, and added a manual DLQ discard helper.

Refactors
- Introduced BaseEvent/DomainEvent with shared EventMetadata; removed legacy event_metadata and renamed Event → DomainEvent.
- Switched dataclasses → Pydantic across domain models; repositories/services now use model_dump/model_validate and top-level fields (payload.execution_id → execution_id), with a domain_event_adapter where needed.
- Replaced DomainSettingsEvent with DomainUserSettingsChangedEvent and unified timestamps to datetime (no manual ISO strings) across health, SSE, EventBus, and consumer status.
- Used enums directly (no .value casts) across APIs, services, and tests.
New Features
- DLQ manager: added discard_message_manually with state validation (returns bool, logs, skips terminal states).
- Added Kafka event schemas with broader coverage, including ExecutionAccepted, UserLogin, and NotificationPreferencesUpdated.

^{Written for commit e2e95ef. Summary will update on new commits.}

Summary by CodeRabbit

New Features
- Added new event types: User Login, Notification Preferences Updated, and Execution Accepted.
- Admin can now manually discard DLQ messages via a new action.
Improvements
- Stronger, typed event handling across the system for more reliable event processing.
- Timestamps standardized to native datetime objects for clearer, consistent timestamps in streams and exports.
- Event storage and search improved for better query and replay accuracy.
Documentation
- New architecture doc describing the event system and design.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2026-01-10T21:44:28Z

📝 Walkthrough

Walkthrough

Mass migration from dataclasses to Pydantic BaseModel across domain, repo, service, and API layers; introduces a comprehensive typed DomainEvent system (discriminated union and TypeAdapter); removes payload wrapper in DB documents in favor of top-level event fields; switches many timestamps from ISO strings to datetime and uses enum members instead of .value.

Changes

Cohort / File(s)	Summary
Typed domain events `backend/app/domain/events/typed.py`, `backend/app/domain/events/__init__.py`	Adds comprehensive Pydantic-based event types, BaseEvent, EventMetadata, DomainEvent discriminated union and `domain_event_adapter`; reorganizes exports.
DB documents & indexes `backend/app/db/docs/event.py`, `backend/app/events/event_store.py`	Removes `payload` wrapper, enables `extra="allow"` storing event-specific fields at top-level, replaces payload-based sparse indexes with `execution_id`/`pod_name`, updates store/get to use model_dump()/model_validate().
Repositories → domain models `backend/app/db/repositories/*` (event_repository, admin_events_repository, replay_repository, user_settings_repository, execution_repository, notification_repository, admin_user_repository, resource_allocation_repository, saga_repository, saved_script_repository, sse_repository, user_repository)	Systematic shift from dataclass/asdict/dict-unpacking to Pydantic `model_dump()`/`model_validate(..., from_attributes=True)` and to DomainEvent usage in event flows; query key for execution_id moved to top-level `execution_id`.
Domain model migrations `backend/app/domain/*` (execution, replay, user/settings, notification, saga, saved_script, sse, events/event_models.py, events/event_metadata.py removed)	Replaces many dataclasses with Pydantic BaseModel (ConfigDict(from_attributes=True)), adds/renames models (e.g., DomainUserSettingsChangedEvent), updates timestamps and defaults to Field(default_factory=...).
Kafka / infra events `backend/app/infrastructure/kafka/events/*`, `backend/app/infrastructure/kafka/mappings.py`, `backend/app/services/kafka_event_service.py`	Adds new Kafka event classes (UserLoginEvent, NotificationPreferencesUpdatedEvent), maps EventType → Kafka classes, and changes kafka publishing to use the domain adapter and model_dump() for payload/metadata.
Service and API changes `backend/app/services/event_service.py`, `backend/app/services/user_settings_service.py`, `backend/app/services/notification_service.py`, `backend/app/services/sse/*`, `backend/app/api/routes/events.py`, `backend/app/api/routes/replay.py`, `backend/app/api/routes/health.py`, `deploy.sh`	Services now consume/produce DomainEvent/ArchivedEvent, derive correlation_id from metadata, use model_dump() for serialization, emit datetimes not ISO strings, and OpenAPI generation now uses create_app().
DLQ & manager `backend/app/dlq/manager.py`	Minor header/typing cleanups and new public method `discard_message_manually(event_id: str, reason: str) -> bool`.
Enums & metrics `backend/app/events/core/`, `backend/app/services/`	Replaces `.value` usage with enum members for logging/metrics/status fields across many components.
Schema registry async `backend/app/events/schema/schema_registry.py`	`set_compatibility` converted to async and called with await.
Tests & coverage many files under `backend/tests/` (notable: `backend/tests/unit/domain/events/test_event_schema_coverage.py`, updated SSE/integration tests)	Adds event schema coverage tests; updates numerous tests to use DomainEvent, enum members, and datetime types; broad import/formatting adjustments.
Docs / nav `docs/architecture/event-system-design.md`, `mkdocs.yml`	Adds design doc describing three-layer event architecture and test-based schema coverage; updates docs navigation.

Estimated code review effort

🎯 5 (Critical) | ⏱️ ~120+ minutes

Possibly related PRs

chore: tests fix/update #60 — Overlapping changes to event, consumer, and schema-registry wiring (event adapters, schema registry async updates, DLQ flow).
refactor: added settings-driven DI and CSRF middleware with SSE subscribed event #73 — Related edits touching events routes (replay paths) and DLQManager method surface.

Suggested labels

enhancement

Poem

🐰 I hop through code with ears held high,

Dataclasses swapped for Pydantic sky.
Enums stand proud, timestamps keep their time,
Events now typed—each field in line.
A carrot toast: typed events, clean and spry. 🥕

🚥 Pre-merge checks | ✅ 1 | ❌ 2

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 37.08% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Title check	❓ Inconclusive	Title 'chore: type fixes' is vague and generic, using non-descriptive terms that don't convey meaningful information about the changeset's scope or primary change.	Consider a more descriptive title that captures the main refactoring: e.g., 'refactor: flatten event payloads and migrate models to Pydantic' or 'chore: refactor event system with dataclass-to-Pydantic migration'.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

codecov-commenter · 2026-01-10T21:45:50Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

❌ Patch coverage is 93.20513% with 53 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
backend/app/dlq/manager.py	17.64%	14 Missing ⚠️
backend/app/db/repositories/event_repository.py	57.14%	9 Missing ⚠️
...p/db/repositories/admin/admin_events_repository.py	40.00%	6 Missing ⚠️
backend/app/events/event_store.py	40.00%	6 Missing ⚠️
backend/app/db/repositories/replay_repository.py	20.00%	4 Missing ⚠️
...end/app/db/repositories/notification_repository.py	76.92%	3 Missing ⚠️
backend/app/db/repositories/user_repository.py	50.00%	3 Missing ⚠️
backend/app/api/routes/events.py	33.33%	2 Missing ⚠️
backend/app/services/admin/admin_events_service.py	60.00%	2 Missing ⚠️
backend/app/db/repositories/sse_repository.py	0.00%	1 Missing ⚠️
... and 3 more
❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Flag	Coverage Δ
backend-e2e	`58.75% <82.94%> (+0.96%)`	⬆️
backend-integration	`74.76% <91.53%> (+0.37%)`	⬆️
backend-unit	`61.14% <82.43%> (+0.94%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
backend/app/api/routes/admin/events.py	`45.19% <ø> (-0.53%)`	⬇️
backend/app/api/routes/health.py	`100.00% <100.00%> (ø)`
backend/app/api/routes/replay.py	`82.92% <ø> (-0.41%)`	⬇️
backend/app/db/docs/event.py	`100.00% <100.00%> (ø)`
...app/db/repositories/admin/admin_user_repository.py	`91.54% <100.00%> (-0.12%)`	⬇️
...ackend/app/db/repositories/execution_repository.py	`92.59% <100.00%> (+1.36%)`	⬆️
.../db/repositories/resource_allocation_repository.py	`70.58% <100.00%> (-1.64%)`	⬇️
backend/app/db/repositories/saga_repository.py	`55.69% <100.00%> (-0.56%)`	⬇️
...end/app/db/repositories/saved_script_repository.py	`100.00% <100.00%> (ø)`
...nd/app/db/repositories/user_settings_repository.py	`84.44% <100.00%> (-0.34%)`	⬇️
... and 49 more

... and 2 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

cubic-dev-ai

4 issues found across 15 files

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them.


<file name="backend/app/events/event_store.py">

<violation number="1" location="backend/app/events/event_store.py:155">
P1: Query change may break retrieval of existing events. Old documents have `execution_id` stored under `payload.execution_id`, but the query now looks at root level `execution_id`. Consider using a query that checks both paths for backwards compatibility: `{"$or": [{"execution_id": execution_id}, {"payload.execution_id": execution_id}, {"aggregate_id": execution_id}]}`</violation>
</file>

<file name="backend/app/services/admin/admin_events_service.py">

<violation number="1" location="backend/app/services/admin/admin_events_service.py:239">
P1: Using `dict(vars(event))` instead of `asdict(event)` won't recursively convert nested dataclasses like `metadata: EventMetadata` to dicts. This will cause the JSON export to contain string representations of objects instead of properly nested JSON structures. Consider using Pydantic's `model_dump()` method for Pydantic dataclasses, or keep `asdict()` if it was working correctly.</violation>
</file>

<file name="backend/app/domain/user/settings_models.py">

<violation number="1" location="backend/app/domain/user/settings_models.py:78">
P2: The `theme` field uses `str | None` instead of `Theme | None`, which is inconsistent with `DomainUserSettings` and `DomainUserSettingsUpdate` in the same file. If this is a "well-typed domain event", consider using `Theme | None` for type consistency.</violation>
</file>

<file name="backend/app/api/routes/events.py">

<violation number="1" location="backend/app/api/routes/events.py:350">
P1: This change alters the replay semantics: `event.model_extra` returns ALL extra fields, not just the `payload`. If the Event has other extra attributes stored, they would all be included in the published payload. Consider using `getattr(event, 'payload', {})` to access the specific payload attribute while satisfying type checkers.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

backend/app/events/event_store.py

backend/app/services/admin/admin_events_service.py

backend/app/api/routes/events.py

backend/app/domain/user/settings_models.py

coderabbitai

Actionable comments posted: 6

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

backend/app/db/docs/event.py (1)
2-2: Fix CI: remove unused import (Any) flagged by Ruff.

This currently fails GitHub Actions (F401).
Proposed diff
-from typing import Any
backend/app/events/event_store.py (1)
117-121: Exclude Beanie internal fields before deserializing event data.

When retrieving events from MongoDB, doc.model_dump() includes Beanie's internal fields (id, revision_id) added by state management. These are silently ignored during deserialization because event models rely on Pydantic's default extra='ignore' behavior. This creates a brittle contract—if any event model is later configured with extra='forbid', deserialization will fail. Explicitly exclude these fields at the boundary.
Suggested fix (applies to 7 locations)
-        event = self.schema_registry.deserialize_json(doc.model_dump())
+        event = self.schema_registry.deserialize_json(doc.model_dump(exclude={"id", "revision_id"}))
-        events = [self.schema_registry.deserialize_json(doc.model_dump()) for doc in docs]
+        events = [
+            self.schema_registry.deserialize_json(doc.model_dump(exclude={"id", "revision_id"}))
+            for doc in docs
+        ]
Also applies to: 143–147, 160–165, 182–187, 203–207, 216–220, 240–243

🤖 Fix all issues with AI agents

In @backend/app/api/routes/events.py:
- Line 350: The code uses event.model_extra (payload=event.model_extra or {}),
but Event is a Pydantic dataclass and stores extras in __pydantic_extra__, so
replace the direct attribute access with a robust getter that first tries
event.model_extra, then event.__pydantic_extra__, and falls back to an empty
dict; for example, compute the payload via getattr checks on the event object
(checking "model_extra" then "__pydantic_extra__") before passing it into the
payload parameter.

In @backend/app/db/repositories/admin/admin_events_repository.py:
- Around line 194-197: The archival call uses **vars(event) which omits Pydantic
extra fields; update the dict merge to include event.__pydantic_extra__ as well
(e.g., replace **vars(event) with **{**vars(event), **(event.__pydantic_extra__
or {})}) so any extra fields on the Pydantic dataclass are preserved when
constructing the archived record (look for this pattern in
admin_events_repository where deleted_at and deleted_by are set).
- Line 273: The code reads extra Pydantic fields via event.model_extra which
raises AttributeError for Pydantic dataclasses; replace that access with the
pydantic internal extras (use getattr(event, "__pydantic_extra__", {}) or
equivalent) so exec_id = (event.model_extra or {}).get("execution_id") or
event.aggregate_id becomes safe; update the same pattern in
admin_events_repository.py (the line computing exec_id from event) to use
__pydantic_extra__ (or getattr) to avoid AttributeError, mirroring the fix
applied in events.py.

In @backend/app/events/event_store.py:
- Around line 51-53: When constructing EventDocument from the Pydantic model,
call model_dump with exclude_none=True to avoid storing explicit nulls that
defeat MongoDB sparse indexes; update the two sites where EventDocument is
created (the EventDocument(**event.model_dump(), stored_at=now,
ttl_expires_at=ttl) occurrences) to use event.model_dump(exclude_none=True) so
fields like execution_id and pod_name are omitted when None before inserting.

In @backend/app/services/admin/admin_events_service.py:
- Line 239: The event JSON export uses event_dict = dict(vars(event)) which
drops Pydantic extra fields stored in event.model_extra (so fields like
execution_id and pod_name are lost); update the assignment of event_dict in the
export logic (the variable named event_dict in admin_events_service.py) to merge
vars(event) with event.model_extra when present (i.e., combine the two dicts so
extra fields are included), following the same model_extra usage you saw in
admin_events_repository.py.

🧹 Nitpick comments (1)

backend/app/domain/user/settings_models.py (1)
69-86: Make DomainUserSettingsChangedEvent align with emitted changes (Theme type + custom_settings).

If these events are the typed source-of-truth for replay/history, consider:

theme: Theme | None (not str | None) for consistent typing/validation.

add custom_settings: dict[str, Any] | None = None if it can be emitted (it is, via DomainUserSettingsUpdate).
Proposed diff
 class DomainUserSettingsChangedEvent:
@@
-    theme: str | None = None
+    theme: Theme | None = None
@@
     reason: str | None = None
     correlation_id: str | None = None
+    custom_settings: dict[str, Any] | None = None

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 41dab25 and 9efb740.

📒 Files selected for processing (15)

backend/app/api/routes/events.py
backend/app/db/docs/event.py
backend/app/db/repositories/admin/admin_events_repository.py
backend/app/db/repositories/event_repository.py
backend/app/db/repositories/replay_repository.py
backend/app/db/repositories/user_settings_repository.py
backend/app/dlq/manager.py
backend/app/domain/events/event_models.py
backend/app/domain/replay/models.py
backend/app/domain/user/__init__.py
backend/app/domain/user/settings_models.py
backend/app/events/event_store.py
backend/app/services/admin/admin_events_service.py
backend/app/services/kafka_event_service.py
backend/app/services/user_settings_service.py

🧰 Additional context used

🧬 Code graph analysis (8)

backend/app/domain/replay/models.py (1)

backend/app/services/coordinator/queue_manager.py (1)

execution_id (30-31)

backend/app/db/repositories/user_settings_repository.py (3)

backend/app/domain/user/settings_models.py (2)

DomainUserSettings (35-46)

DomainUserSettingsChangedEvent (70-85)

backend/app/domain/events/query_builders.py (1)

limit (22-24)

backend/app/domain/events/event_models.py (1)

correlation_id (60-61)

backend/app/dlq/manager.py (2)

backend/app/dlq/models.py (1)

DLQMessageStatus (10-16)

backend/app/db/repositories/dlq_repository.py (1)

_doc_to_message (26-31)

backend/app/events/event_store.py (1)

backend/app/events/schema/schema_registry.py (1)

deserialize_json (178-193)

backend/app/domain/user/__init__.py (1)

backend/app/domain/user/settings_models.py (1)

DomainUserSettingsChangedEvent (70-85)

backend/app/services/user_settings_service.py (2)

backend/app/domain/user/settings_models.py (3)

DomainUserSettingsChangedEvent (70-85)

DomainUserSettings (35-46)

DomainSettingsHistoryEntry (89-96)

backend/app/db/repositories/user_settings_repository.py (1)

get_settings_events (31-59)

backend/app/services/admin/admin_events_service.py (1)

backend/tests/unit/services/idempotency/test_middleware.py (1)

event (32-36)

backend/app/domain/user/settings_models.py (2)

backend/app/services/coordinator/queue_manager.py (1)

user_id (34-35)

backend/app/domain/events/event_models.py (1)

correlation_id (60-61)

🪛 GitHub Actions: MyPy Type Checking

backend/app/api/routes/events.py

[error] 350-350: mypy check failed: 'Event' has no attribute 'model_extra' [attr-defined]. Command: 'uv run mypy --config-file pyproject.toml --strict .'

🪛 GitHub Actions: Ruff Linting

backend/app/db/docs/event.py

[error] 2-2: F401 'Any' imported but unused. Remove unused import. Found 1 error. Command failed: 'uv run ruff check . --config pyproject.toml'.

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (4)

GitHub Check: Build Frontend
GitHub Check: cubic · AI code reviewer
GitHub Check: Integration Tests
GitHub Check: E2E Tests

🔇 Additional comments (14)

backend/app/domain/events/event_models.py (1)
44-57: LGTM: Event model properly configured for extra fields.

The addition of extra="allow" configuration and explicit status/error fields aligns with the payload flattening strategy. The Pydantic dataclass will now store event-specific fields at the top level.

Note: Extra fields in Pydantic dataclasses are stored in __pydantic_extra__, not model_extra (which is only available on BaseModel). Code accessing extra fields must use event.__pydantic_extra__.
How do you access extra fields in Pydantic v2 dataclasses with extra="allow"?
backend/app/domain/replay/models.py (1)

56-56: execution_id is properly indexed at the document root level.

The query path execution_id correctly targets the root-level field. EventDocument (backend/app/db/docs/event.py) defines a sparse ASCENDING index on execution_id (line 59) and includes it in the text search index (line 79). The model's extra="allow" configuration enables flexible storage of event-specific fields at the document root, supporting the payload flattening design.

backend/app/db/repositories/replay_repository.py (1)

94-94: LGTM: Simplified batch construction aligns with flattened event storage.

The change to use model_dump(exclude=...) directly is cleaner and aligns with the PR's goal of eliminating nested payload fields in favor of top-level extra fields.

backend/app/services/kafka_event_service.py (2)

244-252: Consistent with publish_event - payload spread as kwargs.

The change mirrors the pattern in publish_event at Line 93. The payload construction (Lines 239-241) explicitly excludes base fields, which reduces the risk of field name conflicts when spreading with **payload.

86-94: Event model is correctly configured to support **payload unpacking.

The Event model in backend/app/domain/events/event_models.py already has extra="allow" configured via @dataclass(config=ConfigDict(extra="allow")), which properly supports the unpacking of event-specific payload fields. Actual usage shows payload fields are kept separate from base Event fields, preventing field name collisions.

backend/app/dlq/manager.py (2)

327-327: LGTM: Explicit type annotation improves clarity.

Replacing the cast with a direct type annotation makes the code more explicit and type-safe.

491-513: LGTM: New manual discard method follows established patterns.

The new discard_message_manually method:

Mirrors the structure of retry_message_manually (Lines 447-460), maintaining consistency

Includes appropriate state guards against terminal states (DISCARDED, RETRIED)

Has proper error handling and logging

backend/app/db/repositories/event_repository.py (4)

39-40: LGTM: Improved event storage with None filtering and timestamp defaults.

The changes improve robustness:

vars(event) with None filtering prevents storing null values

setdefault ensures stored_at is always populated

59-60: Consistent batch storage implementation.

The same pattern from store_event is applied here, with the optimization of using a single timestamp for the entire batch.

332-337: LGTM: Consistent use of model_dump for archival.

The archival process now uses model_dump(exclude=...) to copy event data, consistent with the PR's flattening approach.

147-151: Execution ID field and index verified.

The execution_id is properly indexed as a top-level field in EventDocument with a sparse index (idx_execution_id), supporting the query change. The field is stored via extra="allow" configuration and included in both ascending and text search indexes.

backend/app/domain/user/__init__.py (1)

17-17: All references to DomainSettingsEvent have been updated successfully.

The breaking change from DomainSettingsEvent to DomainUserSettingsChangedEvent is complete. The old name has been fully removed from the codebase, and the new name is used consistently across all files that depend on it: user_settings_service.py, user_settings_repository.py, and the domain layer exports.

backend/app/events/event_store.py (1)

155-161: Review the event storage design for legacy payload.* fields.

The current get_execution_events() query searches for top-level execution_id and aggregate_id fields. If the database contains legacy events with payload.execution_id (from an earlier schema version), the query will miss them.

The EventDocument schema indicates a flat structure (comment: "no payload wrapper needed"), and no instances of payload.execution_id appear in the codebase. However, the MongoDB aggregation queries in query_builders.py still reference $payload.duration_seconds and $payload.status, suggesting the possibility of legacy or mixed-format data.

Before relying on the current query, verify:

Whether legacy events with payload.* fields exist in the production database

If yes, either add a temporary {"payload.execution_id": execution_id} clause to the $or query or run a migration script to flatten existing documents

backend/app/db/repositories/user_settings_repository.py (1)

34-59: Both concerns in this review can be safely dismissed:

str(et) matching: EventType is a StringEnum, so str(et) correctly returns the stored enum value (e.g., "user_settings_updated"). The filter works as intended.

Payload structure: Payload fields are flattened to the top-level in EventDocument via **payload in KafkaEventService.publish_event (line 93-94). The user_id and changed_fields are stored as direct document fields, not nested. DomainUserSettingsChangedEvent will receive all required fields correctly.

The implementation is sound.

Likely an incorrect or invalid review comment.

backend/app/api/routes/events.py

backend/app/db/repositories/admin/admin_events_repository.py

backend/app/events/event_store.py

backend/app/services/admin/admin_events_service.py

backend/app/services/user_settings_service.py

…ersions, compatibility with beanie, etc + no manual datetime -> str conversion + single point of definition for class EventMetadata(BaseModel)

coderabbitai

Actionable comments posted: 5

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (6)

backend/app/db/docs/event.py (1)
2-2: Remove unused import flagged by linter.

The pipeline failure indicates Any from typing is imported but unused.
🔧 Proposed fix
-from typing import Any
backend/app/events/core/consumer.py (1)
219-231: Fix ConsumerStatus.state type definition to accept ConsumerState enum.

The ConsumerStatus.state field is typed as str in backend/app/events/core/types.py (line 168), but the code now passes self._state which is a ConsumerState enum object. Update the type annotation to:
state: ConsumerState
The datetime changes to ConsumerMetricsSnapshot.last_message_time and last_updated are correct—they are already properly typed as datetime | None and match the datetime objects being passed.
backend/app/domain/replay/models.py (1)
31-45: Fix ReplayFilter.is_empty() ignoring exclude_event_types.

exclude_event_types isn’t considered, so a filter with only exclusions will be treated as “empty” (risking unintended broad replays/browses depending on caller behavior).
Proposed fix
     def is_empty(self) -> bool:
         return not any(
             [
                 self.event_ids,
                 self.execution_id,
                 self.correlation_id,
                 self.aggregate_id,
                 self.event_types,
+                self.exclude_event_types,
                 self.start_time,
                 self.end_time,
                 self.user_id,
                 self.service_name,
                 self.custom_query,
             ]
         )
backend/app/services/user_settings_service.py (1)
152-161: Custom settings updates won’t survive replay (event model drops custom_settings, apply ignores it).

update_custom_setting() emits changed_fields=["custom_settings", ...], but DomainUserSettingsChangedEvent doesn’t include custom_settings and is extra="ignore", so the value is lost; _apply_event() also filters it out. After cache expiry or restart, get_user_settings_fresh() will reconstruct without those changes.

Minimum fixes (pick one consistent approach):

Event-source it: add custom_settings: dict[str, Any] | None to DomainUserSettingsChangedEvent, include it in _settings_fields, and apply it.

If intentionally not event-sourced: on custom_settings updates, write a snapshot immediately (or store custom_settings elsewhere), and don’t rely on replay.
Partial fix in this file (still requires updating DomainUserSettingsChangedEvent)
-    _settings_fields = {"theme", "timezone", "date_format", "time_format", "notifications", "editor"}
+    _settings_fields = {"theme", "timezone", "date_format", "time_format", "notifications", "editor", "custom_settings"}
Also applies to: 206-219
backend/app/db/repositories/user_settings_repository.py (1)
30-57: Use .value for consistency with event_repository.py.

Both str(et) and et.value produce identical results because StringEnum.__str__() returns self.value. However, event_repository.py (line 100) uses .value consistently. Adopt the same pattern here for code clarity:
Fix
-            In(EventDocument.event_type, [str(et) for et in event_types]),
+            In(EventDocument.event_type, [et.value for et in event_types]),
backend/app/db/repositories/event_repository.py (1)
39-53: Use event.execution_id for the EXECUTION_ID span attribute, not aggregate_id.

The current implementation sets the span attribute from event.aggregate_id, but most domain events have execution_id as their primary execution identifier. The get_execution_events method confirms these are distinct concepts by querying both fields separately. This mislabels execution traces for observability.

The same issue exists in backend/app/events/event_store.py. Use getattr(event, "execution_id", "") to handle edge cases like UserSettingsUpdatedEvent which lack an execution context.
Proposed fix
         add_span_attributes(
             **{
                 str(EventAttributes.EVENT_TYPE): str(event.event_type),
                 str(EventAttributes.EVENT_ID): event.event_id,
-                str(EventAttributes.EXECUTION_ID): event.aggregate_id or "",
+                str(EventAttributes.EXECUTION_ID): getattr(event, "execution_id", "") or "",
             }
         )

🤖 Fix all issues with AI agents

In @backend/app/db/repositories/admin/admin_events_repository.py:
- Around line 178-185: The archive_event method creates EventArchiveDocument by
spreading event.model_dump() while also passing deleted_at and deleted_by
explicitly, causing duplicate keyword arguments; fix by calling
EventArchiveDocument with event.model_dump(exclude={'deleted_at','deleted_by'})
and then supplying deleted_at=datetime.now(timezone.utc) and
deleted_by=deleted_by, or alternatively remove the explicit
deleted_at/deleted_by args and rely on the values from event.model_dump();
update the archive_event function and reference EventArchiveDocument,
DomainEvent, and event.model_dump() accordingly.
- Around line 252-255: The loop over original_events currently reads
execution_id from event.model_extra (as in the snippet using exec_id =
(event.model_extra or {}).get("execution_id") or event.aggregate_id) which is
correct because EventDocument uses extra="allow"; to make presence/type safety
optional, either keep this access but add a short comment referencing
EventDocument's extra="allow", or validate/deserialize the raw event before
reading execution_id by calling domain_event_adapter.validate_python(event,
from_attributes=True) to get a typed domain event where execution_id is an
explicit field, then read execution_id from that object instead of model_extra.

In @backend/app/domain/events/typed.py:
- Around line 240-264: The DomainEvent discriminated union only lists 19 event
classes but the EventType enum contains ~54 values, so TypeAdapter(DomainEvent)
will fail for unmapped event_type values; update the DomainEvent union (and the
TypeAdapter) to include corresponding event classes for all EventType members
(e.g., add classes for USER_REGISTERED, USER_LOGIN/LOGGED_IN/LOGGED_OUT,
USER_UPDATED/DELETED, all NOTIFICATION_*, SCRIPT_*, SAGA_*, *_COMMAND,
security/resource/system events, etc.) or prune unused EventType entries so
every EventType has a matching event class; ensure each added class uses the
same discriminator key "event_type" and is imported/defined before the
DomainEvent union so that domain_event_adapter: TypeAdapter[DomainEvent] can
successfully deserialize all possible event_type values.

In @backend/app/domain/saga/models.py:
- Line 5: Remove the unused computed_field import from the top-level import list
in models.py; update the import line that currently reads "from pydantic import
BaseModel, ConfigDict, Field, computed_field" to omit computed_field since
SagaListResult uses a standard @property for has_more, leaving the other imports
unchanged.

In @backend/app/services/admin/admin_events_service.py:
- Line 246: The runtime AttributeError occurs because EventFilter is a dataclass
(EventFilter) and event_filter.model_dump() is invalid; replace the model_dump
call with dataclasses.asdict(event_filter) (import asdict from dataclasses) to
serialize the dataclass, or alternatively change EventFilter to a Pydantic
BaseModel in event_models so model_dump remains valid; update the serialization
in the code path that constructs "filters_applied" (where event_filter is
referenced) accordingly.

🧹 Nitpick comments (8)

backend/app/domain/saga/models.py (1)
66-69: Consider using computed_field for serialization.

Using @property works for attribute access but the has_more value won't be included when the model is serialized via model_dump(). If this field should be present in API responses, use @computed_field instead.
Optional: Use computed_field for serialization
-    @property
+    @computed_field
+    @property
     def has_more(self) -> bool:
         """Calculate has_more."""
         return (self.skip + len(self.sagas)) < self.total
This would also justify keeping the computed_field import.
backend/app/api/routes/events.py (1)

349-354: LGTM! Payload extraction aligns with event flattening.

The logic correctly extracts event-specific fields by excluding BaseEvent base fields and the event_type discriminator. This implements the PR's objective of removing the payload wrapper and working with top-level fields.

One optional consideration: event.model_dump() includes None values by default. If you want to exclude optional fields that are None from the replayed payload, consider using event.model_dump(exclude_none=True) on line 351. However, the current behavior may be intentional if None values carry semantic meaning.
backend/app/services/event_bus.py (1)
139-139: Prefer model_dump() over vars() for Pydantic models.

Using vars(event) directly accesses the instance dictionary, bypassing Pydantic's serialization logic. This can miss custom serializers, validators, or computed fields.
♻️ Proposed refactor
-                value = json.dumps(vars(event)).encode("utf-8")
+                value = json.dumps(event.model_dump(mode="json")).encode("utf-8")
This ensures proper serialization of datetime fields and respects any custom Pydantic configuration.
backend/app/domain/replay/models.py (1)

47-88: Execution-id query flattening looks consistent; consider dropping redundant str() cast.

execution_id is already typed as str | None, so str(self.execution_id) is likely unnecessary unless callers pass non-strings. If you want the coercion, consider typing it as str | UUID | None instead.

backend/app/domain/saved_script/models.py (1)

31-39: Consider making DomainSavedScriptUpdate.updated_at optional (set it in repo/service instead).

As written, constructing DomainSavedScriptUpdate() will still advance updated_at, which can turn “no changes” into a write.

backend/app/db/repositories/user_settings_repository.py (1)

51-57: Guard e.metadata access if it can be null.

If EventDocument.metadata is optional in the schema, e.metadata.correlation_id can throw; consider getattr(e.metadata, "correlation_id", None).

backend/app/services/user_settings_service.py (1)

163-180: Consider using event.model_dump(exclude_none=True) for history value lookup.

This avoids “present-but-None” keys and makes lookups cleaner, especially if the model config changes around extras. (Pydantic v2 behavior detail.)

backend/app/domain/user/user_models.py (1)

106-127: Consider Pydantic-native validation for email/password (instead of is_valid).

E.g., EmailStr for email and constrained strings for password length; reduces duplicated “did we call is_valid?” callsites. (Pydantic v2 APIs.)

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 9efb740 and 67f92b3.

📒 Files selected for processing (41)

backend/app/api/routes/events.py
backend/app/api/routes/health.py
backend/app/db/docs/event.py
backend/app/db/repositories/admin/admin_events_repository.py
backend/app/db/repositories/admin/admin_user_repository.py
backend/app/db/repositories/event_repository.py
backend/app/db/repositories/execution_repository.py
backend/app/db/repositories/notification_repository.py
backend/app/db/repositories/replay_repository.py
backend/app/db/repositories/resource_allocation_repository.py
backend/app/db/repositories/saga_repository.py
backend/app/db/repositories/saved_script_repository.py
backend/app/db/repositories/sse_repository.py
backend/app/db/repositories/user_repository.py
backend/app/db/repositories/user_settings_repository.py
backend/app/domain/admin/overview_models.py
backend/app/domain/admin/replay_updates.py
backend/app/domain/events/__init__.py
backend/app/domain/events/event_metadata.py
backend/app/domain/events/event_models.py
backend/app/domain/events/typed.py
backend/app/domain/execution/models.py
backend/app/domain/notification/models.py
backend/app/domain/replay/models.py
backend/app/domain/saga/models.py
backend/app/domain/saved_script/models.py
backend/app/domain/sse/models.py
backend/app/domain/user/settings_models.py
backend/app/domain/user/user_models.py
backend/app/events/core/consumer.py
backend/app/events/core/types.py
backend/app/schemas_pydantic/sse.py
backend/app/services/admin/admin_events_service.py
backend/app/services/event_bus.py
backend/app/services/event_service.py
backend/app/services/kafka_event_service.py
backend/app/services/notification_service.py
backend/app/services/sse/sse_service.py
backend/app/services/user_settings_service.py
backend/tests/unit/services/pod_monitor/test_monitor.py
backend/tests/unit/services/sse/test_sse_service.py

💤 Files with no reviewable changes (1)

backend/app/domain/events/event_metadata.py

🧰 Additional context used

🧬 Code graph analysis (24)

backend/app/db/repositories/admin/admin_user_repository.py (4)

backend/app/domain/user/user_models.py (3)

User (44-57)

UserListResult (83-91)

UserUpdate (60-80)

backend/app/db/repositories/user_repository.py (2)

get_user_by_id (22-24)

update_user (53-62)

backend/app/services/coordinator/queue_manager.py (1)

user_id (34-35)

backend/app/core/security.py (1)

get_password_hash (35-36)

backend/app/events/core/consumer.py (3)

backend/app/services/notification_service.py (1)

state (163-164)

backend/app/events/core/producer.py (1)

state (56-57)

backend/app/services/pod_monitor/monitor.py (1)

state (140-142)

backend/app/services/event_service.py (3)

backend/app/domain/events/typed.py (1)

ArchivedEvent (219-235)

backend/app/db/repositories/event_repository.py (1)

get_event (68-72)

backend/app/services/coordinator/queue_manager.py (1)

user_id (34-35)

backend/app/domain/admin/overview_models.py (1)

backend/app/domain/events/event_models.py (1)

EventStatistics (125-136)

backend/app/domain/replay/models.py (1)

backend/app/services/coordinator/queue_manager.py (1)

execution_id (30-31)

backend/app/services/admin/admin_events_service.py (2)

backend/tests/unit/services/idempotency/test_middleware.py (1)

event (32-36)

backend/app/domain/events/query_builders.py (1)

limit (22-24)

backend/app/services/user_settings_service.py (3)

backend/app/domain/user/settings_models.py (2)

DomainUserSettingsChangedEvent (72-89)

DomainUserSettings (34-47)

backend/app/db/repositories/user_settings_repository.py (1)

get_settings_events (30-57)

backend/app/domain/events/query_builders.py (1)

limit (22-24)

backend/app/api/routes/events.py (2)

backend/app/domain/events/typed.py (1)

BaseEvent (27-38)

backend/app/infrastructure/kafka/events/base.py (1)

BaseEvent (13-37)

backend/app/db/repositories/notification_repository.py (1)

backend/app/domain/notification/models.py (4)

DomainNotification (16-45)

DomainNotificationUpdate (96-108)

DomainNotificationSubscription (48-67)

DomainSubscriptionUpdate (111-126)

backend/app/db/repositories/saga_repository.py (1)

backend/app/domain/saga/models.py (1)

Saga (10-27)

backend/tests/unit/services/pod_monitor/test_monitor.py (1)

backend/app/db/repositories/event_repository.py (1)

store_event (39-52)

backend/app/db/repositories/admin/admin_events_repository.py (3)

backend/app/domain/replay/models.py (1)

ReplayFilter (11-88)

backend/app/db/repositories/replay_repository.py (1)

update_replay_session (71-79)

backend/app/domain/admin/replay_updates.py (1)

ReplaySessionUpdate (8-23)

backend/app/db/repositories/user_settings_repository.py (1)

backend/app/domain/user/settings_models.py (2)

DomainUserSettings (34-47)

DomainUserSettingsChangedEvent (72-89)

backend/app/db/repositories/resource_allocation_repository.py (1)

backend/app/domain/saga/models.py (1)

DomainResourceAllocation (133-147)

backend/app/domain/events/event_models.py (2)

backend/app/core/utils.py (1)

StringEnum (6-31)

backend/tests/unit/services/idempotency/test_middleware.py (1)

event (32-36)

backend/app/db/docs/event.py (1)

backend/app/domain/events/typed.py (1)

EventMetadata (13-24)

backend/app/db/repositories/sse_repository.py (2)

backend/app/db/repositories/execution_repository.py (1)

get_execution (27-35)

backend/app/domain/execution/models.py (1)

DomainExecution (22-37)

backend/app/db/repositories/execution_repository.py (1)

backend/app/domain/execution/models.py (2)

DomainExecution (22-37)

DomainExecutionUpdate (88-98)

backend/app/services/kafka_event_service.py (3)

backend/app/db/repositories/event_repository.py (1)

store_event (39-52)

backend/app/events/event_store.py (1)

store_event (46-72)

backend/app/infrastructure/kafka/mappings.py (1)

get_event_class_for_type (72-138)

backend/app/db/repositories/replay_repository.py (1)

backend/app/domain/replay/models.py (1)

ReplaySessionState (119-146)

backend/tests/unit/services/sse/test_sse_service.py (1)

backend/app/domain/execution/models.py (1)

ResourceUsageDomain (13-19)

backend/app/db/repositories/user_repository.py (3)

backend/app/domain/user/user_models.py (2)

User (44-57)

DomainUserCreate (129-139)

backend/app/db/repositories/admin/admin_user_repository.py (2)

create_user (25-28)

get_user_by_id (53-55)

backend/app/services/coordinator/queue_manager.py (1)

user_id (34-35)

backend/app/domain/events/typed.py (5)

backend/app/domain/enums/common.py (1)

Environment (27-33)

backend/app/domain/enums/storage.py (1)

StorageType (16-22)

backend/app/domain/execution/models.py (1)

ResourceUsageDomain (13-19)

backend/app/services/coordinator/queue_manager.py (2)

user_id (34-35)

execution_id (30-31)

backend/app/schemas_pydantic/admin_events.py (1)

progress_percentage (104-105)

backend/app/domain/events/__init__.py (2)

backend/app/domain/events/event_models.py (13)

EventBrowseResult (94-100)

EventDetail (104-109)

EventExportRow (183-196)

EventFilter (54-65)

EventListResult (83-90)

EventProjection (140-149)

EventQuery (69-79)

EventReplayInfo (153-160)

EventSortOrder (15-17)

EventStatistics (125-136)

EventSummary (44-50)

HourlyEventCount (113-115)

UserEventCount (119-121)

backend/app/domain/events/typed.py (20)

BaseEvent (27-38)

EventMetadata (13-24)

ExecutionAcceptedEvent (61-66)

ExecutionCompletedEvent (91-97)

ExecutionFailedEvent (100-108)

ExecutionQueuedEvent (69-73)

ExecutionRequestedEvent (44-58)

ExecutionRunningEvent (84-88)

ExecutionStartedEvent (76-81)

ExecutionTimeoutEvent (111-117)

PodCreatedEvent (131-135)

PodDeletedEvent (181-185)

PodFailedEvent (161-169)

PodRunningEvent (145-149)

PodScheduledEvent (138-142)

PodSucceededEvent (152-158)

PodTerminatedEvent (172-178)

ResultFailedEvent (199-203)

ResultStoredEvent (191-196)

UserSettingsUpdatedEvent (209-213)

🪛 GitHub Actions: MyPy Type Checking

backend/app/services/admin/admin_events_service.py

[error] 246-246: "EventFilter" has no attribute "model_dump" [attr-defined]

🪛 GitHub Actions: Ruff Linting

backend/app/domain/saga/models.py

[error] 5-5: F401: 'pydantic.computed_field' imported but unused. Remove unused import.

backend/app/db/docs/event.py

[error] 2-2: F401: 'typing.Any' imported but unused. Remove unused import.

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (4)

GitHub Check: Scan Backend
GitHub Check: cubic · AI code reviewer
GitHub Check: Integration Tests
GitHub Check: E2E Tests

🔇 Additional comments (93)

backend/app/db/repositories/saga_repository.py (5)

32-32: LGTM!

Correct migration from asdict(saga) to saga.model_dump() for Pydantic v2 serialization when constructing the Beanie document.

43-47: LGTM!

The use of Saga.model_validate(doc, from_attributes=True) is the correct Pydantic v2 pattern for constructing domain models from Beanie documents. Since Saga has ConfigDict(from_attributes=True), this properly reads attributes directly from the document instance.

62-62: LGTM!

Consistent application of model_validate for list comprehension matches the pattern used elsewhere in the repository.

74-74: LGTM!

Consistent with the other retrieval methods in this repository.

118-118: LGTM!

Completes the consistent migration to model_validate across all saga retrieval paths.

backend/app/domain/saga/models.py (2)

10-27: LGTM!

Clean migration to Pydantic BaseModel with proper use of ConfigDict(from_attributes=True) and Field(default_factory=...) for mutable defaults. The lambda-based datetime factories correctly generate UTC timestamps.

30-41: LGTM!

All remaining model classes are consistently migrated to Pydantic BaseModel with proper configuration. The use of ConfigDict(from_attributes=True) enables attribute-based construction from ORM/document objects across the codebase.

Also applies to: 44-53, 72-79, 82-93, 96-110, 113-130, 133-147, 150-160

backend/app/db/repositories/resource_allocation_repository.py (1)

12-18: LGTM!

Correct migration to Pydantic v2 patterns:

create_data.model_dump() properly serializes the Pydantic model for document construction

model_validate(doc, from_attributes=True) correctly constructs the domain model from the Beanie document

This aligns with the same pattern used in saga_repository.py.

backend/app/db/repositories/execution_repository.py (5)

20-25: LGTM! Clean migration to Pydantic v2 patterns.

The use of model_dump() and model_validate(..., from_attributes=True) is the correct approach for converting between domain models and Beanie documents. The from_attributes=True parameter properly handles attribute access from the document object.

27-35: LGTM! Consistent Pydantic v2 conversion pattern.

The retrieval and conversion logic correctly uses model_validate(..., from_attributes=True) to construct the domain model from the Beanie document.

37-46: LGTM! Proper use of exclude_none for partial updates.

Using model_dump(exclude_none=True) is the correct approach for partial updates, ensuring only explicitly set fields are updated in the database.

67-78: LGTM! Consistent bulk conversion pattern.

The list comprehension with model_validate(..., from_attributes=True) correctly converts multiple Beanie documents to domain models, maintaining consistency with the single-object retrieval methods.

48-65: LGTM. The resource_usage handling correctly uses model_dump() with a ternary operator to handle the optional field. ResourceUsageDomain is defined as a Pydantic BaseModel and supports the model_dump() method.

backend/app/api/routes/health.py (1)

18-18: Type safety improvement is backward compatible.

The migration from string to native datetime is clean and improves type safety. Pydantic v2 automatically serializes the timezone-aware datetime to ISO 8601 format (e.g., "2025-06-01T12:13:14+00:00"), which matches the previous .isoformat() output exactly. No changes needed for API consumers.

Also applies to: 34-34

backend/app/db/repositories/replay_repository.py (4)

18-23: LGTM! Correct migration to Pydantic model_dump().

The switch from asdict(session) to session.model_dump() correctly aligns with the Pydantic BaseModel-based ReplaySessionState.

25-29: LGTM! Proper use of model_validate with from_attributes.

Using model_validate(doc, from_attributes=True) is the correct Pydantic v2 pattern for constructing a model from a Beanie Document's attributes.

71-79: LGTM! Clean partial update handling.

model_dump(exclude_none=True) correctly excludes None values for partial updates, which is the idiomatic Pydantic v2 approach.

91-96: LGTM! Simplified event batching aligns with payload flattening.

The removal of payload-merging logic and direct use of doc.model_dump(exclude=...) correctly reflects the flattened event structure where event-specific fields are now at the document level.

backend/app/events/core/types.py (2)

149-161: LGTM! Clean migration to Pydantic BaseModel.

The conversion from dataclass to BaseModel with ConfigDict(from_attributes=True) is correct. The timestamp field type change from str | None to datetime | None improves type safety and aligns with the PR's goal of tightening typing.

163-172: LGTM! Consistent with ConsumerMetricsSnapshot migration.

ConsumerStatus correctly follows the same Pydantic BaseModel pattern with from_attributes=True configuration.

backend/app/schemas_pydantic/sse.py (3)

31-31: LGTM! Timestamp type tightened to datetime.

Changing from str | None to datetime | None is consistent with the PR's typing improvements. Pydantic will serialize these to ISO 8601 strings in JSON responses, maintaining compatibility with SSE consumers.

77-78: LGTM! Consistent timestamp typing in notification events.

Both timestamp and created_at fields are now properly typed as datetime | None, aligning with the execution event schema changes.

Also applies to: 88-88

101-101: LGTM! Non-optional datetime for RedisNotificationMessage.

created_at is now a required datetime field without a default, which is appropriate for Redis messages that should always have a creation timestamp.

backend/app/domain/sse/models.py (3)

10-20: LGTM! ShutdownStatus migrated to BaseModel.

The model correctly includes from_attributes=True configuration and properly defines all shutdown status fields.

23-33: LGTM! SSEHealthDomain correctly structured.

The health domain model properly aggregates shutdown status and other health metrics with appropriate typing.

36-48: LGTM! SSEExecutionStatusDomain and SSEEventDomain properly defined.

Both models follow the established Pydantic BaseModel pattern with from_attributes=True. The timestamp field correctly uses datetime type.

backend/app/db/docs/event.py (4)

14-30: LGTM! EventDocument correctly updated for flattened event structure.

The extra="allow" configuration properly enables flexible event-specific fields at the document level, and from_attributes=True enables attribute-based model population. The docstring accurately describes the new storage pattern.

42-44: LGTM! Sparse indexes for event-specific fields.

Using sparse=True is correct for execution_id and pod_name since these fields only exist on certain event types. This avoids indexing documents where these fields are absent.

58-68: LGTM! Text search index updated for flattened structure.

The text search index now correctly references execution_id at the top level instead of payload.execution_id.

72-93: LGTM! EventArchiveDocument mirrors EventDocument changes.

The archive document correctly adopts the same extra="allow" pattern and removes the payload field, maintaining consistency with the active events collection.

backend/app/domain/events/__init__.py (2)

1-43: LGTM! Clean reorganization of event module exports.

The imports are well-organized, separating query/filter/result types from typed event classes. This provides a clear public API surface for the events domain.

45-87: LGTM! Comprehensive all with clear categorization.

The __all__ list is properly organized with comments distinguishing "Query/filter/result types" from "Typed events", making the module's public API discoverable and well-documented.

backend/app/domain/execution/models.py (5)

13-19: LGTM! ResourceUsageDomain correctly migrated.

The model properly uses ConfigDict(from_attributes=True) and defines appropriate default values for resource metrics.

22-37: LGTM! DomainExecution with proper Field defaults.

The use of Field(default_factory=lambda: str(uuid4())) for execution_id and Field(default_factory=lambda: datetime.now(timezone.utc)) for timestamps correctly ensures unique IDs and UTC-aware timestamps on instantiation.

40-51: LGTM! ExecutionResultDomain properly structured.

The model correctly combines required fields (execution_id, status, exit_code, stdout, stderr) with optional fields and appropriate defaults.

54-73: LGTM! LanguageInfoDomain and ResourceLimitsDomain properly defined.

Both models follow the established pattern with from_attributes=True configuration and appropriate field types.

76-98: LGTM! Create and Update models properly defined.

DomainExecutionCreate correctly defines required fields for execution creation, and DomainExecutionUpdate uses Optional types for all fields to support partial updates.

backend/app/domain/user/settings_models.py (3)

46-47: Verify the timestamp default behavior.

Using Field(default_factory=lambda: datetime.now(timezone.utc)) means a new timestamp is generated each time the field's default is evaluated during model creation. This is likely the intended behavior for created_at and updated_at fields.

However, ensure that when updating existing settings, these timestamps are explicitly set rather than relying on defaults, as Pydantic will call the default_factory for any missing field during model instantiation.

72-89: Event model design looks good.

The DomainUserSettingsChangedEvent with extra="ignore" allows forward compatibility when deserializing events that may contain additional fields not defined in the current schema. This is a good practice for event-driven architectures.

The explicit field definitions provide strong typing while maintaining flexibility.

42-44: LGTM: Proper use of Field defaults.

Using Field(default_factory=...) for mutable default values (lists, dicts, and complex objects) is the correct pattern to avoid shared mutable state across instances.

backend/app/domain/admin/overview_models.py (1)

8-8: LGTM: Consistent type migration.

The migration from Event to DomainEvent aligns with the PR's objective to standardize event typing across the codebase. The import and field type are updated consistently.

Also applies to: 34-34

backend/tests/unit/services/pod_monitor/test_monitor.py (1)

11-11: LGTM: Test doubles aligned with domain changes.

The test infrastructure correctly reflects the migration to DomainEvent. The FakeEventRepository now matches the actual EventRepository signature (as shown in the relevant code snippet at backend/app/db/repositories/event_repository.py:38-51).

Also applies to: 53-57

backend/app/services/notification_service.py (1)

803-803: This concern is unfounded; the code change is correct.

RedisNotificationMessage has created_at typed as datetime (line 101 in backend/app/schemas_pydantic/sse.py), and the message is serialized via model_dump_json() (line 81 in backend/app/services/sse/redis_bus.py). Pydantic v2 natively handles datetime serialization to ISO 8601 format, so passing the raw datetime object is the correct approach and aligns with Pydantic best practices.

Likely an incorrect or invalid review comment.

backend/app/db/repositories/sse_repository.py (1)

16-16: LGTM! Timestamp and model validation improvements.

Both changes align with the PR's migration to typed domain models:

Line 16: Using native datetime objects instead of ISO strings provides better type safety and allows downstream consumers to work with properly typed temporal data.

Line 23: Using model_validate(doc, from_attributes=True) is the correct Pydantic v2 pattern for constructing domain models from Beanie documents, consistent with the pattern used in execution_repository.py.

Also applies to: 23-23

backend/app/services/sse/sse_service.py (1)

61-61: LGTM! Consistent timestamp type migration.

The systematic replacement of .isoformat() calls with native datetime objects across all SSE event constructions (CONNECTED, SUBSCRIBED, ERROR, SHUTDOWN, HEARTBEAT, NOTIFICATION events) ensures type consistency throughout the SSE flow. This aligns with the Pydantic models' expectations and the repository changes that now return datetime objects.

Also applies to: 75-75, 90-90, 135-135, 148-148, 207-207, 220-220, 234-234

backend/app/api/routes/events.py (1)

300-300: LGTM! Safe metadata access.

The conditional access to result.metadata.correlation_id with the null check prevents AttributeError when metadata is not present.

backend/tests/unit/services/sse/test_sse_service.py (2)

72-72: LGTM! Test aligns with repository timestamp change.

The mock now returns a native datetime object, matching the production SSERepository.get_execution_status behavior updated in this PR.

172-172: LGTM! Keyword arguments improve clarity.

Using explicit keyword arguments for ResourceUsageDomain construction enhances readability and prevents errors from positional argument reordering. This aligns with the Pydantic model definition in backend/app/domain/execution/models.py.

backend/app/db/repositories/saved_script_repository.py (1)

9-9: LGTM! Complete Pydantic v2 migration.

The repository now consistently uses Pydantic v2 patterns throughout:

model_dump() replaces asdict() for serializing domain models to dictionaries

model_validate(doc, from_attributes=True) replaces manual dict unpacking for constructing domain models from Beanie documents

model_dump(exclude_none=True) simplifies update logic by automatically filtering None values

This aligns with the broader codebase migration to Pydantic-based domain models and follows the same pattern used in other repositories throughout this PR.

Also applies to: 11-11, 18-18, 33-33, 35-35, 49-49

backend/app/services/admin/admin_events_service.py (3)

26-40: LGTM!

The use of model_dump(mode="json") correctly serializes the Pydantic EventExportRow model with automatic datetime-to-ISO-string conversion. The dict construction with display-friendly keys is appropriate for CSV export.

239-240: LGTM!

Using model_dump(mode="json") for events correctly serializes DomainEvent instances with automatic datetime conversion, making the removed default=str parameter on line 251 unnecessary.

271-277: The defensive metadata check is necessary and correct.

The metadata field is typed as EventMetadata | None with a default value of None across all DomainEvent variants (inherited from BaseEvent). The null check prevents AttributeError when accessing correlation_id on a potentially None metadata object, making this defensive pattern essential rather than optional.

backend/app/domain/events/event_models.py (4)

5-9: LGTM!

Import changes correctly support the migration to Pydantic-based domain events.

86-86: Consistent migration to DomainEvent.

All event-containing result types now consistently use DomainEvent instead of the old Event dataclass, aligning with the PR's goal of tightening typing across the event system.

Also applies to: 97-97, 107-107, 156-156, 167-167

183-196: LGTM!

The conversion of EventExportRow from a dataclass to a Pydantic BaseModel with from_attributes=True correctly supports attribute-based instantiation and automatic serialization via model_dump(mode="json"). The datetime type for timestamp is appropriate since mode="json" handles ISO string conversion.

171-178: Clarify whether events with missing metadata should be filtered out on line 178.

The filter if e.metadata and not e.metadata.service_name.startswith("system-") will exclude events where metadata is None. If metadata is expected to always be present, this guard is redundant. If it can be None, confirm whether such events should be silently excluded or handled differently.

backend/app/services/event_bus.py (3)

20-28: LGTM!

Converting EventBusEvent to a Pydantic BaseModel with datetime instead of str for the timestamp improves type safety and aligns with the PR's goal of tightening typing.

157-164: LGTM!

Creating the event with a datetime timestamp directly is correct. Pydantic will automatically handle serialization to ISO string format when needed (e.g., for JSON).

290-292: LGTM!

Using EventBusEvent.model_validate(event_dict) correctly deserializes the Kafka message, with Pydantic automatically parsing ISO timestamp strings back to datetime objects.

backend/app/domain/admin/replay_updates.py (1)

3-23: LGTM!

The conversion of ReplaySessionUpdate from a dataclass to a Pydantic BaseModel with from_attributes=True is clean and consistent with the broader Pydantic v2 migration across the codebase.

backend/app/db/repositories/admin/admin_user_repository.py (4)

25-28: LGTM!

The use of model_dump() for serialization and model_validate(doc, from_attributes=True) for deserialization correctly implements the Pydantic v2 pattern for Beanie document handling.

30-51: LGTM!

The list comprehension using model_validate(doc, from_attributes=True) correctly converts Beanie documents to domain models, consistent with the repository pattern.

53-55: LGTM!

Correct use of model_validate with proper None handling.

57-70: LGTM!

The update method correctly uses model_dump(exclude_none=True) to only update fields that were set, preserves password hashing logic, and properly returns the updated domain model via model_validate.

backend/app/services/event_service.py (2)

8-15: LGTM!

The updated imports align with the PR's domain event typing strategy, replacing generic Event types with DomainEvent and ArchivedEvent.

179-191: Good defensive access to metadata.

The conditional check event.metadata.user_id if event.metadata else None guards against missing metadata, preventing potential AttributeErrors.

backend/app/db/repositories/notification_repository.py (3)

24-27: LGTM!

Correctly migrated to Pydantic v2: model_dump() for serialization and model_validate(from_attributes=True) for domain model construction from Beanie documents.

29-38: Proper partial update pattern.

Using model_dump(exclude_none=True) ensures only provided fields are updated, preventing None values from overwriting existing data.

66-91: Consistent model conversion in list operations.

The list comprehension correctly applies model_validate(from_attributes=True) to each document, maintaining consistency with the repository's conversion pattern.

backend/app/db/repositories/user_repository.py (2)

13-20: LGTM!

The repository correctly adopts Pydantic v2 patterns for both user retrieval and creation, using model_dump() for document construction and model_validate(from_attributes=True) for domain model conversion.

53-62: Proper update handling with Pydantic v2.

The update flow correctly uses model_dump(exclude_none=True) to extract only provided fields and updates the updated_at timestamp before persisting.

backend/app/services/kafka_event_service.py (3)

105-116: LGTM!

The Kafka event construction properly mirrors the domain event structure while using the Avro metadata for Kafka-specific publishing.

243-256: Consistent domain event construction in publish_base_event.

The method correctly extracts payload by excluding base fields and reconstructs a typed domain event via domain_event_adapter, maintaining consistency with the primary publish_event flow.

81-97: No actionable issue found. The code explicitly excludes base fields (event_id, event_type, event_version, timestamp, aggregate_id, metadata) when extracting the payload (line 244), preventing any field conflicts when spreading **payload into event_data. The event classes do not override base fields, and the discriminated union properly validates the combined structure.

Likely an incorrect or invalid review comment.

backend/app/db/repositories/admin/admin_events_repository.py (2)

47-62: LGTM!

The browse_events method correctly uses domain_event_adapter.validate_python(from_attributes=True) to convert Beanie documents to typed domain events, aligning with the PR's domain event typing strategy.

187-196: LGTM!

The replay session persistence and retrieval correctly use model_dump() and model_validate(from_attributes=True), maintaining consistency with Pydantic v2 patterns.

backend/app/domain/replay/models.py (2)

91-117: PrivateAttr progress callback approach is clean.

119-161: Pydantic BaseModel migration + UTC defaults look good.

backend/app/domain/saved_script/models.py (1)

8-39: Migration to BaseModel + Field factories looks good.

backend/app/db/repositories/user_settings_repository.py (1)

16-28: Snapshot serialize/deserialize via model_validate / model_dump is aligned with the PR goals.

backend/app/services/user_settings_service.py (1)

68-86: Rebuild-from-snapshot+typed-events flow looks consistent.

backend/app/domain/user/user_models.py (1)

35-151: BaseModel conversion is straightforward and consistent.

backend/app/domain/notification/models.py (1)

16-126: Pydantic migration + default factories look good (no shared-mutable defaults).

backend/app/db/repositories/event_repository.py (2)

54-67: DomainEvent adapter usage across reads/writes is consistent and simplifies typing.

Also applies to: 68-73, 74-144, 256-304, 339-367

145-155: Remove this comment; EventDocument does not declare an execution_id field.

EventDocument stores execution_id dynamically via extra="allow" configuration and only defines it as a sparse index, not as a field. Since it's not a declared field, the raw dict clause {"execution_id": execution_id} is the correct and only approach to query it.

Likely an incorrect or invalid review comment.

backend/app/domain/events/typed.py (8)

13-24: LGTM!

EventMetadata is well-structured with appropriate defaults and optional fields. The correlation_id default factory ensures unique IDs, and from_attributes=True enables ORM compatibility.

27-38: LGTM!

BaseEvent correctly uses Pydantic v2 patterns with appropriate default factories for UUIDs and timezone-aware datetimes. The 30-day TTL default is reasonable for event retention.

44-126: LGTM!

The execution event classes are well-designed with:

Proper Literal discriminators for each event type

Appropriate optional vs required fields

Consistent use of ResourceUsageDomain for resource tracking

Sensible defaults for output fields (stdout, stderr)

131-186: LGTM!

Pod lifecycle events are consistently structured with appropriate defaults. The container_statuses field as a string (line 149) works for simple serialization, though if richer status data is needed later, consider making it a structured type.

191-204: LGTM!

Result events correctly capture storage outcomes with appropriate optional fields.

209-214: LGTM!

UserSettingsUpdatedEvent correctly separates the subject user_id (whose settings changed) from metadata.user_id (who triggered the change). The changed_fields list is a good pattern for audit trails.

219-236: LGTM!

ArchivedEvent intentionally extends BaseModel directly rather than BaseEvent, preserving original event values without applying defaults. The archive-specific fields (deleted_at, deleted_by, deletion_reason) provide good audit context.

1-10: LGTM!

Imports are well-organized and all appear to be used. The Pydantic v2 imports (ConfigDict, Discriminator, TypeAdapter) are the correct modern API.

backend/app/db/repositories/admin/admin_events_repository.py

backend/app/domain/events/typed.py

backend/app/domain/saga/models.py

backend/app/services/admin/admin_events_service.py

cubic-dev-ai

4 issues found across 41 files (changes from recent commits).

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them.


<file name="backend/app/services/event_bus.py">

<violation number="1" location="backend/app/services/event_bus.py:27">
P1: Changing `timestamp` from `str` to `datetime` breaks JSON serialization in `publish()`. The existing `json.dumps(vars(event))` at line 139 cannot serialize `datetime` objects. Replace `vars(event)` with `event.model_dump(mode='json')` which properly converts datetime to ISO string, or use `event.model_dump_json()` directly.</violation>
</file>

<file name="backend/app/domain/events/event_models.py">

<violation number="1" location="backend/app/domain/events/event_models.py:178">
P2: Events with `metadata=None` are silently filtered out when `include_system_events=False`. Consider explicitly handling the None case - perhaps events without metadata should be included since they're not definitively system events.</violation>
</file>

<file name="backend/app/domain/saga/models.py">

<violation number="1" location="backend/app/domain/saga/models.py:66">
P2: `computed_field` is imported but unused. The `@property` decorator on `has_more` won't include this field in serialization (`model_dump()`). If `has_more` needs to be serialized (as the original dataclass behavior suggests), use `@computed_field` instead of `@property`.</violation>
</file>

<file name="backend/app/db/repositories/event_repository.py">

<violation number="1" location="backend/app/db/repositories/event_repository.py:334">
P2: The refactored code no longer excludes `id` and `revision_id` when creating the archived document. The old code explicitly did `doc.model_dump(exclude={"id", "revision_id"})`, ensuring a new MongoDB `_id` would be generated on insert. The new `model_validate(doc, from_attributes=True)` copies the original document's `id`, causing the archive to be inserted with the same `_id` as the source document. While this works (different collection), it's a behavioral change that may not be intentional.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

backend/app/services/event_bus.py

backend/app/domain/events/event_models.py

backend/app/domain/saga/models.py

backend/app/db/repositories/event_repository.py

cubic-dev-ai

4 issues found across 32 files (changes from recent commits).

Prompt for AI agents (all issues)


Check if these issues are valid — if so, understand the root cause of each and fix them.


<file name="backend/tests/integration/dlq/test_dlq_discard.py">

<violation number="1" location="backend/tests/integration/dlq/test_dlq_discard.py:151">
P2: `stats_before.by_status` stores counts with string keys (e.g., "discarded"), but this lookup now passes the enum object instead of its string value, so it never finds the existing count and the test can pass even if stats do not change.</violation>

<violation number="2" location="backend/tests/integration/dlq/test_dlq_discard.py:160">
P2: `stats_after.by_status` also uses string keys, so looking up with the enum object never reflects the updated discard count.</violation>
</file>

<file name="backend/app/events/core/dispatcher.py">

<violation number="1" location="backend/app/events/core/dispatcher.py:171">
P2: Return type annotation mismatch: the method signature declares `dict[str, dict[str, int]]` but now returns `dict[EventType, dict[str, int]]`. Update the return type annotation to match the actual return type.</violation>
</file>

<file name="backend/tests/conftest.py">

<violation number="1" location="backend/tests/conftest.py:138">
P1: The fixture now closes the Redis client in its finally block, but the comment explicitly says "Don't close here - Dishka's RedisProvider handles cleanup when scope exits". The `RedisProvider` in `backend/app/core/providers.py` already calls `await client.aclose()` in its teardown. This will cause a double-close scenario which can lead to errors or unexpected behavior. Either remove the `aclose()` call (keeping the original behavior), or update/remove the misleading comment if manual cleanup is now intentionally required.</violation>
</file>

_{Reply with feedback, questions, or to request a fix. Tag @cubic-dev-ai to re-run a review.}

backend/tests/conftest.py

backend/tests/integration/dlq/test_dlq_discard.py

backend/app/events/core/dispatcher.py

coderabbitai

Actionable comments posted: 6

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (6)

backend/app/api/routes/admin/events.py (1)
196-214: Replace asdict(session) with session.model_dump() and remove the asdict import—ReplaySessionState is a Pydantic model, not a dataclass.

The current code calls asdict() on ReplaySessionState, which is a Pydantic BaseModel (not a dataclass). This will crash at runtime with TypeError. Use session.model_dump() instead, which is the standard Pydantic method used throughout the codebase for similar conversions. The asdict import from dataclasses can be removed since it's no longer needed in this file.
Proposed fix
-from dataclasses import asdict
 from datetime import datetime
 from typing import Annotated

@@ -209,7 +208,7 @@ async def get_replay_status(session_id: str, service: FromDishka[AdminEventsService]) -> EventReplayStatusResponse:
         return EventReplayStatusResponse(
             **{
-                **asdict(session),
+                **session.model_dump(),
                 "status": session.status,
                 "estimated_completion": estimated_completion,
                 "execution_results": execution_results,
backend/tests/integration/test_notifications_routes.py (1)
239-239: Fix enum-to-string comparison bug.

When Pydantic v2 deserializes enum fields, accessing them returns enum members, not string values. Comparing updated_subscription.channel == "in_app" will fail because updated_subscription.channel is NotificationChannel.IN_APP (an enum member), not the string "in_app".
🐛 Proposed fix to use enum members
-        assert updated_subscription.channel == "in_app"
+        assert updated_subscription.channel == NotificationChannel.IN_APP

-        assert updated_subscription.channel == "webhook"
+        assert updated_subscription.channel == NotificationChannel.WEBHOOK

-        assert updated_subscription.channel == "slack"
+        assert updated_subscription.channel == NotificationChannel.SLACK
Also applies to: 277-277, 304-304
backend/tests/unit/services/pod_monitor/test_monitor.py (1)
161-166: Fix mypy “non-overlapping equality” by avoiding narrowing on .state properties.
Mypy treats assert pm.state == MonitorState.RUNNING as permanently narrowing the property; then STOPPED comparisons error. Snapshot to a local variable (or add a targeted ignore).
Proposed fix
@@
     await pm.__aenter__()
-    assert pm.state == MonitorState.RUNNING
+    state = pm.state
+    assert state == MonitorState.RUNNING
 
     await pm.aclose()
     assert pm.state == MonitorState.STOPPED
@@
     async with create_pod_monitor(cfg, service, _test_logger) as monitor:
-        assert monitor.state == MonitorState.RUNNING
+        state = monitor.state
+        assert state == MonitorState.RUNNING
 
     assert monitor.state == MonitorState.STOPPED
@@
     async with create_pod_monitor(
             cfg, service, _test_logger, k8s_clients=mock_k8s_clients
     ) as monitor:
-        assert monitor.state == MonitorState.RUNNING
+        state = monitor.state
+        assert state == MonitorState.RUNNING
         assert monitor._clients is mock_k8s_clients
         assert monitor._v1 is mock_v1
 
     assert monitor.state == MonitorState.STOPPED
Also applies to: 559-563, 583-591
backend/app/services/rate_limit_service.py (1)
258-278: Address inconsistent algorithm label values in metrics.

The algorithm label now uses enum objects (lines 264, 270, 275), but other code paths use string literals like "disabled", "bypassed", and "no_limit" (lines 210, 234, 247). This inconsistency could cause issues:

If RateLimitAlgorithm enum values are uppercase (e.g., "SLIDING_WINDOW"), they won't match the lowercase string literals.

Metrics aggregation and dashboards may fail to correlate related measurements.
🔧 Recommended fix: standardize algorithm label values

Either define enum members for all algorithm states:
# In app/domain/rate_limit.py
class RateLimitAlgorithm(StrEnum):
    SLIDING_WINDOW = "sliding_window"
    TOKEN_BUCKET = "token_bucket"
    DISABLED = "disabled"
    BYPASSED = "bypassed"
    NO_LIMIT = "no_limit"
Or consistently use .value or str() when passing enums to metrics to ensure predictable string representations.
backend/tests/integration/test_replay_routes.py (2)
187-187: Use enum member instead of string literal.

This line uses the string literal "paused" while the rest of the file consistently uses enum members. This is inconsistent with the PR's enum standardization goals and may cause comparison failures if pause_result.status is an enum member.
🔧 Proposed fix
-            if pause_result.status == "paused":
+            if pause_result.status == ReplayStatus.PAUSED:
364-364: Replace string literals with enum members.

This assertion uses string literals ["created", "pending"] instead of enum members, which is inconsistent with the rest of the file and the PR's enum standardization effort.

Note that "pending" does not appear in the ReplayStatus enum based on the values used elsewhere in this file (CREATED, RUNNING, COMPLETED, FAILED, CANCELLED). Verify whether "pending" is a valid status or if this should be ReplayStatus.CREATED.
🔧 Proposed fix (if "pending" is not a valid status)
-        assert replay_response.status in ["created", "pending"]
+        assert replay_response.status == ReplayStatus.CREATED
Or if both are valid:
-        assert replay_response.status in ["created", "pending"]
+        assert replay_response.status in [ReplayStatus.CREATED, ReplayStatus.PENDING]

🤖 Fix all issues with AI agents

In @backend/app/db/repositories/admin/admin_events_repository.py:
- Line 253: EventDocument currently relies on undeclared fields in model_extra
for execution_id; add an explicit field declaration to the EventDocument model:
add execution_id: str | None = None to the class (analogous to aggregate_id) so
code can reference event.execution_id directly instead of (event.model_extra or
{}).get(...), and ensure any Pydantic config (e.g., extra="allow") remains
unchanged.

In @backend/app/dlq/manager.py:
- Around line 491-514: discard_message_manually currently returns bool but can
raise exceptions and has a TOCTOU race between reading doc.status and performing
side effects; update it to perform an atomic conditional DB update on
DLQMessageDocument (e.g., conditional update where event_id matches and status
NOT IN {DISCARDED, RETRIED, maybe SCHEDULED}) to set a transient state or mark
as DISCARD_IN_PROGRESS, then reload/convert via _doc_to_message and call
_discard_message; also either wrap the whole flow in try/except to log errors
and return False on failure (preserving the bool contract) or explicitly
document that exceptions propagate and remove the implied False-on-error
behavior—reference functions/classes: discard_message_manually,
DLQMessageDocument, DLQMessageStatus, _doc_to_message, _discard_message, and
_monitor_dlq.
- Around line 149-153: The tracing span is recording the unprefixed
self.dlq_topic while the consumer actually subscribes to topic_name =
f"{self.settings.KAFKA_TOPIC_PREFIX}{self.dlq_topic}", causing mismatch; change
the code that sets EventAttributes.KAFKA_TOPIC (where the span is created for
consumed messages) to use msg.topic() (the actual consumed topic) instead of
self.dlq_topic so traces reflect the real topic name. Ensure any other usages in
the message handling path that set Kafka topic attributes also use msg.topic()
for consistency.

In @backend/app/events/core/dispatcher.py:
- Around line 169-171: The return type annotation of get_metrics is incorrect:
it currently states dict[str, dict[str, int]] but the method returns
self._event_metrics which uses EventType keys; update the annotation on
get_metrics to dict[EventType, dict[str, int]] (import or reference the
EventType enum used by self._event_metrics) so the signature matches the actual
return value and type checkers no longer flag a mismatch.

In @backend/tests/conftest.py:
- Around line 133-138: Update the outdated comment above the redis client
fixture to state that the explicit await client.aclose() is intentional
defensive redundancy even though Dishka's RedisProvider also cleans up on scope
exit; reference the redis.Redis client obtained via await scope.get(redis.Redis)
and the use of client.aclose() so readers understand both cleanup paths are
present and idempotent, and optionally apply the same explicit cleanup pattern
to the db fixture (which currently relies solely on Dishka) for consistency.

In @backend/tests/integration/result_processor/test_result_processor.py:
- Line 120: The test is comparing a raw MongoDB field string to an enum member
(doc.get("status") vs ExecutionStatus.COMPLETED) which will fail; update the
assertion to either compare the string value to the enum's value (e.g.,
doc.get("status") == ExecutionStatus.COMPLETED.value) or deserialize the MongoDB
document into your Pydantic/domain model (e.g., build the model from doc and
assert model.status == ExecutionStatus.COMPLETED) so the types match; change the
assertion around doc.get("status") accordingly.

🧹 Nitpick comments (11)

backend/tests/e2e/test_execution_routes.py (1)
94-94: Optional: Simplify enum membership check.

The list() call is unnecessary since Python enums are iterable and support the in operator directly.
♻️ Simplified assertion
-        assert execution_result.status in list(ExecutionStatusEnum)
+        assert execution_result.status in ExecutionStatusEnum
backend/tests/integration/events/test_consumer_group_monitor_real.py (1)
28-28: Good change; consider using is for consistency.

The update correctly compares to the enum member instead of its string value, aligning with the PR's goal of enum usage consistency.

However, for enum comparisons, prefer is over == (enum members are singletons). The rest of the file consistently uses is for enum comparisons (lines 26, 47, 52, 58, 64, 69, 75, 81, 90).
♻️ Suggested refactor for consistency
-    assert summary["group_id"] == gid and summary["health"] == ConsumerGroupHealth.UNHEALTHY
+    assert summary["group_id"] == gid and summary["health"] is ConsumerGroupHealth.UNHEALTHY
backend/app/services/pod_monitor/monitor.py (2)
199-203: Be explicit about metric label types (string vs enum).
If the metrics layer expects str labels, pass .value (or otherwise ensure StringEnum is a str subtype) to avoid surprising exporter behavior / attribute cardinality changes.
Proposed fix (if metrics want plain strings)
@@
-                        self._metrics.record_pod_monitor_watch_error(ErrorType.RESOURCE_VERSION_EXPIRED)
+                        self._metrics.record_pod_monitor_watch_error(ErrorType.RESOURCE_VERSION_EXPIRED.value)
@@
-                        self._metrics.record_pod_monitor_watch_error(ErrorType.API_ERROR)
+                        self._metrics.record_pod_monitor_watch_error(ErrorType.API_ERROR.value)
@@
-                self._metrics.record_pod_monitor_watch_error(ErrorType.UNEXPECTED)
+                self._metrics.record_pod_monitor_watch_error(ErrorType.UNEXPECTED.value)
@@
-            self._metrics.record_pod_monitor_watch_error(ErrorType.PROCESSING_ERROR)
+            self._metrics.record_pod_monitor_watch_error(ErrorType.PROCESSING_ERROR.value)
@@
-            self._metrics.record_pod_monitor_event_processing_duration(duration, event.event_type)
+            self._metrics.record_pod_monitor_event_processing_duration(duration, event.event_type.value)
@@
-            self._metrics.record_pod_monitor_watch_error(ErrorType.PROCESSING_ERROR)
+            self._metrics.record_pod_monitor_watch_error(ErrorType.PROCESSING_ERROR.value)
@@
-            self._metrics.record_pod_monitor_event_published(event.event_type, phase)
+            self._metrics.record_pod_monitor_event_published(str(event.event_type), phase)
Also applies to: 276-277, 320-325, 339-340

445-457: get_status()["state"] should stay trivially JSON-serializable.
Returning the enum instance is fine if it behaves like a string everywhere this dict is consumed; otherwise consider returning .value.
backend/app/db/repositories/admin/admin_events_repository.py (1)
263-263: Simplify redundant conditional.

The conditional exec_doc.status if exec_doc.status else None is redundant. If exec_doc.status is None or any falsy value, it already evaluates to that value.
♻️ Proposed simplification
-                            "status": exec_doc.status if exec_doc.status else None,
+                            "status": exec_doc.status,
Or, if the intent is to explicitly convert empty strings to None:
-                            "status": exec_doc.status if exec_doc.status else None,
+                            "status": exec_doc.status or None,
backend/app/db/repositories/event_repository.py (3)
39-52: Prefer immutable pattern over mutating model_dump() result.

Lines 40-41 mutate the dictionary returned by model_dump() with setdefault(). While this works (Pydantic v2 returns a new dict), mutating return values is a code smell and could lead to subtle bugs if the implementation changes.
♻️ Proposed fix
-        data = event.model_dump(exclude_none=True)
-        data.setdefault("stored_at", datetime.now(timezone.utc))
-        doc = EventDocument(**data)
+        data = event.model_dump(exclude_none=True)
+        doc = EventDocument(stored_at=datetime.now(timezone.utc), **data)
Or use dict merging for clarity:
-        data = event.model_dump(exclude_none=True)
-        data.setdefault("stored_at", datetime.now(timezone.utc))
-        doc = EventDocument(**data)
+        data = {"stored_at": datetime.now(timezone.utc), **event.model_dump(exclude_none=True)}
+        doc = EventDocument(**data)
54-66: Same mutation issue in batch operation.

Lines 60-61 have the same dictionary mutation pattern as store_event. Apply the same refactoring for consistency.
♻️ Proposed fix
         docs = []
         for event in events:
-            data = event.model_dump(exclude_none=True)
-            data.setdefault("stored_at", now)
-            docs.append(EventDocument(**data))
+            data = {"stored_at": now, **event.model_dump(exclude_none=True)}
+            docs.append(EventDocument(**data))
100-100: Remove redundant list() conversion.

The list() call is unnecessary since event_types is already typed as list[EventType] | None. This redundant conversion adds no value and could mask type issues.
♻️ Proposed fix
-            In(EventDocument.event_type, list(event_types)) if event_types else None,
+            In(EventDocument.event_type, event_types) if event_types else None,
backend/app/dlq/manager.py (3)

220-223: Ensure event_type is a string-like value for metrics attributes.

You now pass message.event_type directly into DLQ metrics (Line 222, 353, 372). If event_type is ever a non-str Enum (not str/StrEnum), OpenTelemetry metrics attribute encoding can break or produce unexpected labels.

Recommendation: either guarantee DLQMessage.event_type is str/StringEnum at the model boundary, or normalize once (e.g., .value) before recording metrics.

Also applies to: 352-354, 370-373

231-241: OpenTelemetry span attributes must be primitives; verify enum types.

Using EventAttributes.* keys is good, but please confirm both self.dlq_topic and dlq_message.event_type are str-subclasses (not plain Enums), since span attributes must be primitives / sequences of primitives.

447-460: Log status serialization: confirm it’s JSON-safe in your logging pipeline.

extra={"status": doc.status} (Line 455) is fine if DLQMessageStatus is str-like; if not, structured logging JSON encoders often fail on Enums. Consider doc.status.value if you see serialization issues.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 67f92b3 and f06e4af.

📒 Files selected for processing (32)

backend/app/api/routes/admin/events.py
backend/app/db/repositories/admin/admin_events_repository.py
backend/app/db/repositories/event_repository.py
backend/app/db/repositories/notification_repository.py
backend/app/dlq/manager.py
backend/app/events/consumer_group_monitor.py
backend/app/events/core/dispatcher.py
backend/app/events/core/producer.py
backend/app/services/execution_service.py
backend/app/services/kafka_event_service.py
backend/app/services/notification_service.py
backend/app/services/pod_monitor/monitor.py
backend/app/services/rate_limit_service.py
backend/app/services/result_processor/processor.py
backend/app/services/saga/saga_orchestrator.py
backend/scripts/create_topics.py
backend/tests/conftest.py
backend/tests/e2e/test_execution_routes.py
backend/tests/integration/dlq/test_dlq_discard.py
backend/tests/integration/dlq/test_dlq_retry.py
backend/tests/integration/events/test_consumer_group_monitor_real.py
backend/tests/integration/result_processor/test_result_processor.py
backend/tests/integration/services/admin/test_admin_user_service.py
backend/tests/integration/test_events_routes.py
backend/tests/integration/test_notifications_routes.py
backend/tests/integration/test_replay_routes.py
backend/tests/integration/test_saga_routes.py
backend/tests/unit/core/test_security.py
backend/tests/unit/events/test_event_dispatcher.py
backend/tests/unit/services/pod_monitor/test_event_mapper.py
backend/tests/unit/services/pod_monitor/test_monitor.py
backend/tests/unit/services/sse/test_sse_service.py

🚧 Files skipped from review as they are similar to previous changes (1)

backend/app/services/notification_service.py

🧰 Additional context used

🧬 Code graph analysis (12)

backend/tests/integration/events/test_consumer_group_monitor_real.py (1)

backend/app/events/consumer_group_monitor.py (1)

ConsumerGroupHealth (15-21)

backend/app/dlq/manager.py (4)

backend/app/core/metrics/dlq.py (3)

record_dlq_message_received (49-50)

record_dlq_message_retried (52-55)

record_dlq_message_discarded (57-60)

backend/app/core/tracing/models.py (1)

EventAttributes (9-27)

backend/app/dlq/models.py (1)

DLQMessageStatus (10-16)

backend/app/db/repositories/dlq_repository.py (1)

_doc_to_message (26-31)

backend/tests/conftest.py (2)

backend/tests/unit/services/pod_monitor/test_monitor.py (1)

aclose (73-74)

backend/app/core/lifecycle.py (1)

aclose (33-42)

backend/app/services/saga/saga_orchestrator.py (4)

backend/app/events/core/producer.py (1)

state (56-57)

backend/app/services/notification_service.py (1)

state (163-164)

backend/app/services/pod_monitor/monitor.py (1)

state (140-142)

backend/app/events/core/consumer.py (1)

state (202-203)

backend/tests/unit/services/pod_monitor/test_monitor.py (6)

backend/tests/unit/conftest.py (1)

app (68-69)

backend/app/db/repositories/event_repository.py (1)

store_event (39-52)

backend/app/events/event_store.py (1)

store_event (46-72)

backend/app/events/core/producer.py (1)

state (56-57)

backend/app/services/pod_monitor/monitor.py (2)

state (140-142)

MonitorState (42-48)

backend/app/events/core/consumer.py (1)

state (202-203)

backend/tests/unit/core/test_security.py (1)

backend/app/core/security.py (1)

SecurityService (23-122)

backend/tests/integration/dlq/test_dlq_discard.py (1)

backend/app/dlq/models.py (1)

DLQMessageStatus (10-16)

backend/tests/integration/dlq/test_dlq_retry.py (1)

backend/app/dlq/models.py (1)

DLQMessageStatus (10-16)

backend/app/events/core/dispatcher.py (2)

backend/app/events/core/producer.py (1)

metrics (60-61)

backend/app/events/core/consumer.py (1)

metrics (206-207)

backend/app/services/pod_monitor/monitor.py (1)

backend/app/core/metrics/kubernetes.py (3)

record_pod_monitor_watch_error (146-147)

record_pod_monitor_event_processing_duration (137-138)

record_pod_monitor_event_published (140-141)

backend/app/services/kafka_event_service.py (3)

backend/app/db/repositories/event_repository.py (1)

store_event (39-52)

backend/app/events/event_store.py (1)

store_event (46-72)

backend/app/infrastructure/kafka/mappings.py (1)

get_event_class_for_type (72-138)

backend/tests/unit/services/sse/test_sse_service.py (1)

backend/app/domain/execution/models.py (1)

ResourceUsageDomain (13-19)

🪛 GitHub Actions: MyPy Type Checking

backend/tests/unit/services/pod_monitor/test_monitor.py

[error] 165-165: Non-overlapping equality check (left operand type: "Literal[MonitorState.RUNNING]", right operand type: "Literal[MonitorState.STOPPED]")

[error] 562-562: Non-overlapping equality check (left operand type: "Literal[MonitorState.RUNNING]", right operand type: "Literal[MonitorState.STOPPED]")

[error] 590-590: Non-overlapping equality check (left operand type: "Literal[MonitorState.RUNNING]", right operand type: "Literal[MonitorState.STOPPED]")

backend/app/api/routes/admin/events.py

[error] 209-209: No overload variant of "asdict" matches argument type "ReplaySessionState"

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (3)

GitHub Check: cubic · AI code reviewer
GitHub Check: Integration Tests
GitHub Check: E2E Tests

🔇 Additional comments (43)

backend/tests/integration/dlq/test_dlq_retry.py (1)

125-137: Good alignment: use enum member keys for by_status.

Switching to stats_*.by_status.get(DLQMessageStatus.RETRIED, 0) is consistent with enum-typed status usage and avoids .value drift. Only thing to sanity-check: DLQRepository.get_dlq_stats() should return a by_status mapping keyed by DLQMessageStatus (vs str) after any (de)serialization steps.

backend/tests/integration/dlq/test_dlq_discard.py (1)

149-161: Good alignment: use enum member keys for by_status.

Using DLQMessageStatus.DISCARDED as the lookup key matches the direction of the PR. Please just confirm get_dlq_stats().by_status is actually keyed by DLQMessageStatus in the returned stats model (not str keys coming from persistence/JSON).

backend/tests/integration/services/admin/test_admin_user_service.py (1)

21-21: No changes needed. The change from UserRole.USER.value to UserRole.USER is correct and safe. Since UserRole is a StringEnum (which inherits from StrEnum), enum members are strings at the Python level. Motor correctly serializes them to the database as string values. This usage is consistent throughout the codebase in unit tests, Pydantic schemas, and services.

Likely an incorrect or invalid review comment.

backend/app/services/kafka_event_service.py (6)

15-15: LGTM!

The import of domain_event_adapter aligns with the PR's objective to use a discriminated union adapter for creating typed domain events.

86-97: Well-structured domain event creation via discriminated union adapter.

The use of domain_event_adapter.validate_python() ensures type-safe event creation based on the event_type discriminator. The approach correctly separates domain event storage from Kafka event construction.

Note that **payload is spread last, so any key in payload matching a base field name (e.g., event_id) would override it. This appears intentional for event-specific field flattening, but callers should ensure payload doesn't accidentally include base field names.

104-116: LGTM!

Good consistency: kafka_event_data correctly sources event_id from domain_event.event_id, ensuring the stored domain event and published Kafka event share the same identifier. The separation of avro_metadata for Kafka and domain_metadata for storage is appropriate.

118-123: LGTM!

The correlation_id is correctly sourced from domain_metadata, which inherits it from the enriched avro_metadata. The empty string fallback handles the None case appropriately for header values.

241-256: LGTM!

The domain event creation pattern in publish_base_event is consistent with publish_event. The base_fields set correctly filters out non-payload fields, and the discriminated union adapter ensures proper typed event instantiation.

232-232: EventType is StringEnum (StrEnum-based), confirming str() removal is correct.

The removal of str() casting for event.event_type at lines 232, 260, and 288 is valid. EventType extends StringEnum, a custom wrapper around Python's StrEnum, which means enum values are inherently strings. This is compatible with:

Line 232: OpenTelemetry span.set_attribute() accepts enum values

Line 260: headers dict typed as Dict[str, str]

Line 288: Logging extra dict expectations

No type issues arise from this change.

backend/app/services/result_processor/processor.py (1)

303-308: No issues found - the change is correct and safe.

The ProcessingState enum is a StringEnum (inheriting from Python's StrEnum), which means enum members ARE actual str instances. Returning self._state (the enum object) instead of self._state.value is functionally equivalent because the enum member itself is a string. The StringEnum class already handles string serialization correctly through its __str__ and __repr__ methods, and all current usages—logging in workers and string method calls in tests—work without issues.

backend/tests/integration/test_notifications_routes.py (4)

56-56: Verify enum member comparison consistency.

The assertions now compare against list(NotificationChannel) and list(NotificationStatus), which contain enum members. This works correctly when the Pydantic NotificationListResponse model deserializes JSON strings to enum members.

However, note the inconsistency with lines 239, 277, 304 where channel comparisons use string literals. Ensure the comparison approach is uniform across the test suite.

Also applies to: 59-59

69-82: Enum usage is correct.

Using enum members directly in the status list and URL parameters aligns with the PR's goal of enum usage consistency. The StrEnum will serialize correctly in the URL, and the assertion properly validates filtered results.

105-105: LGTM: Enum member in query parameter.

Using NotificationStatus.DELIVERED directly in the f-string works correctly for StrEnum, which will use its string value in the URL.

204-204: Enum member comparison is correct.

The pattern subscription.channel in list(NotificationChannel) properly validates that the Pydantic-parsed channel is a valid enum member.

backend/app/db/repositories/notification_repository.py (4)

25-27: LGTM: Pydantic migration pattern is correct.

The conversion from dict-based operations to model_dump()/model_validate() follows Pydantic v2 best practices:

model_dump() serializes domain models to dicts for MongoDB operations

model_validate(..., from_attributes=True) deserializes documents to typed domain models

exclude_none=True prevents overwriting existing fields with None during updates

Also applies to: 35-35, 44-44

91-91: LGTM: Consistent model_validate usage in list operations.

All list comprehensions properly convert MongoDB documents to DomainNotification instances using model_validate(doc, from_attributes=True).

Also applies to: 131-131, 144-144

162-162: LGTM: Subscription methods migrated correctly.

The subscription CRUD operations properly use:

model_dump(exclude_none=True) for building update payloads

model_validate(..., from_attributes=True) for converting documents to DomainNotificationSubscription

The pattern maintains the existing updated_at timestamp handling while leveraging Pydantic's serialization.

Also applies to: 168-168, 173-173, 181-181, 188-188

200-200: LGTM: Simplified enum logging.

Logging list(roles) directly is cleaner than manually extracting .value attributes. For StrEnum, the string representation will display correctly in logs.

backend/tests/unit/services/sse/test_sse_service.py (3)

68-73: LGTM: Timestamp standardization in mock.

The mock now returns a datetime object instead of an ISO string, aligning with the PR's timestamp standardization effort. This is appropriate for test mocking.

172-172: LGTM: Explicit keyword arguments for Pydantic model.

Using keyword arguments for ResourceUsageDomain is best practice with Pydantic models and improves readability.

154-154: EventType is correctly implemented as StrEnum for direct string comparisons.

EventType extends the custom StringEnum class, which is a StrEnum subclass from Python's standard library. This allows the JSON-deserialized string values from _decode() to be compared directly with EventType enum members. The assertions at lines 154 and 189 are correct.

backend/tests/unit/services/pod_monitor/test_monitor.py (1)

11-12: DomainEvent migration in test doubles is consistent.
FakeEventRepository now matches the repo/service contract using DomainEvent and event.event_id.

Also applies to: 53-57

backend/scripts/create_topics.py (1)

48-55: No changes needed — the code is correct as-is.

The KafkaTopic enum is a StringEnum (not a plain Enum), and the custom StringEnum class explicitly overrides __format__ to return the string value directly. When you use f"{topic}", it returns the enum's .value (e.g., "execution_events"), not a representation like ClassName.MEMBER. Additionally, get_topic_configs() is keyed by KafkaTopic enum objects, and the lookup correctly uses the same enum object as the key. Both the topic naming and config lookup are already consistent and safe.

Likely an incorrect or invalid review comment.

backend/app/services/rate_limit_service.py (3)

300-307: No action needed. status.algorithm is a StringEnum (which extends Python's StrEnum), and since StrEnum is a string subclass, it is natively compatible with OpenTelemetry span attributes. The code correctly passes the enum value directly to span.set_attribute() without conversion, and no type checking issues will occur.

155-165: No action needed. The enum-based labels are fully compatible with the metrics library.

The labels use StringEnum, a custom class inheriting from StrEnum. Since StringEnum is a proper string type (inheriting from str), it works seamlessly with OpenTelemetry's attribute handling without any type conversion or compatibility issues.

26-36: The enum serialization implementation is correct and requires no changes.

Both EndpointGroup and RateLimitAlgorithm inherit from StringEnum (a custom StrEnum subclass in app.core.utils), which inherits from str. This allows json.dumps() to serialize enum objects directly as their string values without raising a TypeError. The deserialization in _rule_from_dict() correctly reconstructs enums from string values using the enum constructors. The round-trip serialization cycle (enum → JSON string → enum) works correctly, and integration tests confirm this behavior is functioning as intended.

backend/app/db/repositories/admin/admin_events_repository.py (4)

60-60: LGTM!

The use of domain_event_adapter.validate_python with from_attributes=True correctly converts Beanie documents to typed DomainEvent instances.

167-167: LGTM!

The change from doc.timestamp.isoformat() to doc.timestamp aligns with the PR objective to standardize timestamps to datetime objects across the codebase.

178-185: LGTM!

The signature update to accept DomainEvent and the use of event.model_dump() for archival document construction correctly aligns with the Pydantic migration.

188-188: LGTM!

The replay session persistence correctly uses Pydantic model methods:

model_dump() for creating documents

model_validate(..., from_attributes=True) for converting documents to domain models

exclude_none=True for partial updates

Also applies to: 196-196, 199-199

backend/app/db/repositories/event_repository.py (3)

149-152: LGTM!

The plain dict query {"execution_id": execution_id} at line 150 correctly queries the top-level execution_id field after payload flattening. The Or condition appropriately handles both the flattened field and the aggregate_id fallback.

333-337: LGTM!

The use of model_validate(..., from_attributes=True).model_copy(update=archive_fields) correctly converts documents and adds archival metadata without mutating the original.

360-360: LGTM!

Correctly omits from_attributes=True when validating aggregation pipeline results, which are already plain dicts rather than Beanie documents.

backend/app/dlq/manager.py (1)

321-338: The code is type-safe as-is. inject_trace_context() has an explicit return type of dict[str, str], and OpenTelemetry's text-format propagators inject only US-ASCII strings. No defensive encoding is needed.

Likely an incorrect or invalid review comment.

backend/app/events/consumer_group_monitor.py (1)

419-419: No changes needed. The code at line 419 correctly returns the ConsumerGroupHealth enum directly.

StringEnum extends Python's StrEnum, which means enum instances inherit from str and are natively serializable to JSON without special handling. The test suite confirms this is the intended behavior (assert summary["health"] == ConsumerGroupHealth.UNHEALTHY). The timestamp conversion on line 426 is for datetime formatting, not enum value extraction, so there is no inconsistency—they are different data types requiring different transformations.

Likely an incorrect or invalid review comment.

backend/tests/integration/test_replay_routes.py (1)

236-254: The code change is correct and requires no action. ReplayStatus extends StringEnum, which in turn extends Python's StrEnum. The StrEnum class automatically serializes to its string value when used in string contexts like f-strings, so the enum at line 243 will correctly serialize to "created", "running", etc. without needing the .value attribute. The removal of .value aligns with proper StrEnum usage.

backend/tests/unit/services/pod_monitor/test_event_mapper.py (1)

6-6: LGTM: Enum-based comparisons properly updated.

The test correctly migrated from string-based comparisons (e.event_type.value == "pod_scheduled") to enum-based comparisons (e.event_type == EventType.POD_SCHEDULED). This aligns with the event mapper returning EventType enum members and improves type safety.

Also applies to: 53-106, 216-216

backend/tests/integration/test_events_routes.py (1)

301-301: Same enum serialization concern for event publishing payloads.

Similar to the query endpoint, these publish requests send EventType.SYSTEM_ERROR directly in the JSON payload. Verify that the enum serializes correctly to a string value when sent over HTTP.

Also applies to: 320-320

backend/app/services/saga/saga_orchestrator.py (1)

418-421: LGTM - Consistent enum usage in structured logging.

The change to log the enum object directly instead of its .value is consistent with the PR's goal of standardizing enum usage. Python's logging framework will properly handle the enum via its string representation.

backend/tests/unit/events/test_event_dispatcher.py (1)

59-60: LGTM - Test correctly updated to match API change.

The test assertions now use EventType enum members as dictionary keys, correctly matching the updated EventDispatcher.get_metrics() implementation that returns enum-keyed dictionaries.

backend/app/events/core/dispatcher.py (1)

63-63: LGTM - Enum usage in logging.

Using the enum directly in log messages is safe and improves readability. Python will automatically convert the enum to its string representation.

backend/app/events/core/producer.py (1)

128-147: Original review comment is incorrect — ProducerState is a StringEnum, not a standard Enum, so JSON serialization is not an issue.

The StringEnum implementation (which extends Python's StrEnum) is explicitly designed to behave as a string in all contexts. Instances inherit from str and serialize directly as string values with json.dumps(). Test code confirms this works—assertions like st["state"] == "running" and calls to string methods like .lower() all pass, demonstrating the enum functions as a native string.

All actual usage in the codebase (workers, tests) treats the status dict values as strings without any serialization errors.

Likely an incorrect or invalid review comment.

backend/tests/integration/test_saga_routes.py (1)

58-58: LGTM! Enum usage is now consistent.

The migration from .value to direct enum member usage improves type safety and aligns with the broader Pydantic migration. All changes are consistent:

Query parameters now pass enum members directly (httpx correctly serializes StrEnum members)

Assertions compare against enum members rather than string values

Response validation expects enum members (handled by Pydantic deserialization)

Also applies to: 94-94, 185-185, 219-219, 241-241

backend/app/db/repositories/admin/admin_events_repository.py

backend/app/dlq/manager.py

backend/app/events/core/dispatcher.py

backend/tests/conftest.py

backend/tests/integration/result_processor/test_result_processor.py

coderabbitai

Actionable comments posted: 5

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (3)

backend/tests/unit/services/pod_monitor/test_event_mapper.py (1)
1-239: Fix missing type annotations on inline class attributes.

The MyPy failure originates from this file. The Cond classes defined inline in test_pending_running_and_succeeded_mapping() and test_scheduled_requires_condition() lack type annotations on instance attributes (self.type and self.status), violating the disallow_incomplete_defs = true setting.

Add type annotations to instance attributes:
class Cond:
    type: str
    status: str
    
    def __init__(self, t: str, s: str) -> None:
        self.type = t
        self.status = s
backend/tests/unit/services/idempotency/test_middleware.py (1)
118-122: Remove empty test class shells.

These empty test classes serve no purpose. According to the AI summary, idempotent_handler and IdempotentConsumerWrapper were removed from the import surface. If these components no longer exist or are only tested at integration level, remove these placeholder classes.
🧹 Proposed fix
-class TestIdempotentHandlerDecorator:
-    pass
-
-class TestIdempotentConsumerWrapper:
-    pass
backend/app/api/routes/admin/events.py (1)
206-213: Remove redundant status field override.

The response construction spreads session.model_dump(), which already includes a status field, then explicitly overrides it with session.status on line 209. This is unnecessary and creates code clutter.

While ReplaySessionState.model_dump() returns the status as a StringEnum object (and EventReplayStatusResponse expects str), Pydantic's validation handles this seamlessly since StringEnum inherits from str. The explicit override has no functional effect and should be removed for clarity:
         return EventReplayStatusResponse(
-            **{
-                **session.model_dump(),
-                "status": session.status,
-                "estimated_completion": estimated_completion,
-                "execution_results": execution_results,
-            }
+            **session.model_dump(),
+            estimated_completion=estimated_completion,
+            execution_results=execution_results,
         )

🤖 Fix all issues with AI agents

In
@backend/tests/integration/services/user_settings/test_user_settings_service.py:
- Line 11: Reorder imports in the test module so third-party imports (e.g.,
AsyncContainer from dishka and pytest) come before local app.* imports and after
standard library imports; specifically ensure the standard library imports
(datetime, timezone) remain first, then import pytest and from dishka import
AsyncContainer, and only then import app.domain.enums.Theme,
app.domain.user.settings_models (DomainEditorSettings,
DomainNotificationSettings, DomainUserSettingsUpdate) and
app.services.user_settings_service.UserSettingsService so import ordering
follows PEP 8 conventions.

In @backend/tests/integration/test_admin_routes.py:
- Around line 64-67: Remove the pointless call to original_response.json() in
the test: either delete the line if you don't need the parsed body, or capture
it into a variable (e.g., original_data = original_response.json()) if you
intend to validate or compare the original settings later; locate the call made
after the test_admin.get("/api/v1/admin/settings/") response and update
accordingly.

In @backend/tests/load/plot_report.py:
- Around line 115-118: The list comprehension computing successes uses zip(...,
strict=False) unnecessarily; remove the explicit strict=False since default
behavior suffices. Update the line defining successes (currently successes = [t
- e for t, e in zip(total, errors, strict=False)]) to call zip(total, errors)
instead, keeping the rest of the comprehension unchanged.

In @backend/tests/unit/services/pod_monitor/test_event_mapper.py:
- Line 131: The expression `_ctx(p)` is unused and its result is discarded;
remove this dead call or assign it to a variable if the PodContext is meant to
be used (e.g., store into a variable like `ctx = _ctx(p)`), ensuring any
subsequent assertions or uses reference that variable; specifically update the
test in test_event_mapper.py to either delete the `_ctx(p)` line or replace it
with an assignment to `ctx` (and update references to use `ctx`) so the
PodContext is not silently created and thrown away.

🧹 Nitpick comments (7)

backend/app/api/routes/replay.py (1)
66-66: Avoid duplicate model_dump() calls for better performance.

The expression s.model_dump() is called twice in the same line, which is wasteful. Pydantic's model_dump() can be expensive for complex models with nested structures.
⚡ Proposed optimization
-        SessionSummary.model_validate({**s.model_dump(), **s.model_dump()["config"]})
+        SessionSummary.model_validate({**(dump := s.model_dump()), **dump["config"]})
Or for better readability:
-    return [
-        SessionSummary.model_validate({**s.model_dump(), **s.model_dump()["config"]})
-        for s in service.list_sessions(status=status, limit=limit)
-    ]
+    sessions = []
+    for s in service.list_sessions(status=status, limit=limit):
+        dump = s.model_dump()
+        sessions.append(SessionSummary.model_validate({**dump, **dump["config"]}))
+    return sessions
Additionally, verify that this change resolves the MyPy type checking failure reported in the pipeline. The dict spreading pattern may be contributing to type inference issues. If the error persists, consider adding explicit type hints or using a more structured approach to flatten the config fields.
backend/tests/e2e/test_resource_cleaner_orphan.py (1)
40-47: Simplify the dry-run assertion and avoid .get(...) if the return type is a TypedDict (possible mypy error source).

If cleanup_orphaned_resources() returns a TypedDict (or otherwise tightly typed mapping), res.get("configmaps", []) can trip --strict. Also name in ... is clearer.
Proposed diff
@@
-        # Force as orphaned by using a large cutoff
+        # Treat even just-created resources as eligible by using a zero-hour age threshold
         await cleaner.cleanup_orphaned_resources(namespace=ns, max_age_hours=0, dry_run=True)
@@
             # If cleaner is non-deterministic across runs, re-invoke to reflect current state
             res = await cleaner.cleanup_orphaned_resources(namespace=ns, max_age_hours=0, dry_run=True)
-            assert any(name == cm for cm in res.get("configmaps", []))
+            assert name in res["configmaps"]
Also worth sanity-checking e2e flakiness: with max_age_hours=0, this relies on local datetime.now() vs apiserver creation_timestamp ordering (clock skew can make the resource “not old enough” forever).
backend/tests/integration/services/user_settings/test_user_settings_service.py (1)
1-11: Consider following PEP 8 import ordering.

The dishka import (third-party library) should come before the app.* imports (local application imports) per PEP 8 conventions.
📋 Suggested import ordering
 from datetime import datetime, timezone
 
 import pytest
+from dishka import AsyncContainer
+
 from app.domain.enums import Theme
 from app.domain.user.settings_models import (
     DomainEditorSettings,
     DomainNotificationSettings,
     DomainUserSettingsUpdate,
 )
 from app.services.user_settings_service import UserSettingsService
-from dishka import AsyncContainer
backend/tests/e2e/test_execution_routes.py (1)
116-116: Consider removing or clarifying unused expression.

The expression exec_response.json()["execution_id"] is evaluated but the result is discarded. If this is intended as an implicit validation that the key exists, consider making it explicit:
assert "execution_id" in exec_response.json()
If it's not needed, consider removing it entirely.
♻️ Proposed cleanup

For Line 116 (test_execute_with_error):
-    exec_response.json()["execution_id"]
+    # Execution accepted - error will be processed asynchronously
For Line 272 (test_execution_with_timeout):
-    exec_response.json()["execution_id"]
+    # Execution accepted - will run until timeout
Also applies to: 272-272
backend/tests/integration/services/saga/test_saga_service.py (1)
1-7: Import ordering deviates from convention.

Moving dishka.AsyncContainer (a third-party import) after local app imports contradicts PEP 8 and typical isort conventions, which place third-party imports before local imports. Consider keeping it grouped with pytest:
 from datetime import datetime, timezone
 
 import pytest
+from dishka import AsyncContainer
 from app.domain.enums.user import UserRole
 from app.schemas_pydantic.user import User
 from app.services.saga.saga_service import SagaService
-from dishka import AsyncContainer
The MyPy pipeline failure reported at line 1 appears unrelated to this file's changes—verify the actual MyPy output to identify the root cause.
backend/app/domain/saga/models.py (1)
150-160: Consider removing from_attributes=True from input DTOs.

DomainResourceAllocationCreate is an input/creation DTO, not mapped from ORM objects. The from_attributes=True config is unnecessary here and could be removed to clarify intent:
 class DomainResourceAllocationCreate(BaseModel):
     """Data for creating a resource allocation."""
 
-    model_config = ConfigDict(from_attributes=True)
-
     execution_id: str
This is a minor consistency nit—the code works correctly either way.
backend/tests/integration/notifications/test_notification_sse.py (1)
28-35: Consider capturing the notification for stronger assertions.

While removing the unused variable assignment is correct, the test could be more robust by verifying that the notification_id received via SSE matches the one from the created notification object.
📋 Optional enhancement for test robustness
-    await svc.create_notification(
+    created = await svc.create_notification(
         user_id=user_id,
         subject="Hello",
         body="World",
         tags=["test"],
         severity=NotificationSeverity.MEDIUM,
         channel=NotificationChannel.IN_APP,
     )
Then add an assertion after line 46:
assert msg.notification_id == str(created.notification_id)
This would verify the end-to-end flow integrity between notification creation and SSE delivery.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between f06e4af and 2970ad7.

📒 Files selected for processing (93)

backend/app/api/routes/admin/events.py
backend/app/api/routes/replay.py
backend/app/db/docs/event.py
backend/app/domain/events/event_models.py
backend/app/domain/saga/models.py
backend/tests/e2e/conftest.py
backend/tests/e2e/test_execution_routes.py
backend/tests/e2e/test_resource_cleaner_k8s.py
backend/tests/e2e/test_resource_cleaner_orphan.py
backend/tests/helpers/cleanup.py
backend/tests/helpers/k8s_fakes.py
backend/tests/helpers/kafka.py
backend/tests/integration/app/test_main_app.py
backend/tests/integration/conftest.py
backend/tests/integration/core/test_container.py
backend/tests/integration/db/repositories/test_admin_settings_repository.py
backend/tests/integration/db/repositories/test_saved_script_repository.py
backend/tests/integration/dlq/test_dlq_discard.py
backend/tests/integration/dlq/test_dlq_manager.py
backend/tests/integration/events/test_consume_roundtrip.py
backend/tests/integration/events/test_consumer_lifecycle.py
backend/tests/integration/events/test_dlq_handler.py
backend/tests/integration/events/test_event_dispatcher.py
backend/tests/integration/events/test_event_store.py
backend/tests/integration/events/test_schema_registry_roundtrip.py
backend/tests/integration/idempotency/test_consumer_idempotent.py
backend/tests/integration/idempotency/test_decorator_idempotent.py
backend/tests/integration/idempotency/test_idempotency.py
backend/tests/integration/idempotency/test_idempotent_handler.py
backend/tests/integration/notifications/test_notification_sse.py
backend/tests/integration/result_processor/test_result_processor.py
backend/tests/integration/services/admin/test_admin_user_service.py
backend/tests/integration/services/coordinator/test_execution_coordinator.py
backend/tests/integration/services/events/test_event_bus.py
backend/tests/integration/services/events/test_kafka_event_service.py
backend/tests/integration/services/execution/test_execution_service.py
backend/tests/integration/services/idempotency/test_redis_repository.py
backend/tests/integration/services/notifications/test_notification_service.py
backend/tests/integration/services/rate_limit/test_rate_limit_service.py
backend/tests/integration/services/replay/test_replay_service.py
backend/tests/integration/services/saga/test_saga_service.py
backend/tests/integration/services/saved_script/test_saved_script_service.py
backend/tests/integration/services/sse/test_partitioned_event_router.py
backend/tests/integration/services/sse/test_redis_bus.py
backend/tests/integration/services/user_settings/test_user_settings_service.py
backend/tests/integration/test_admin_routes.py
backend/tests/integration/test_alertmanager.py
backend/tests/integration/test_auth_routes.py
backend/tests/integration/test_dlq_routes.py
backend/tests/integration/test_user_settings_routes.py
backend/tests/load/cli.py
backend/tests/load/config.py
backend/tests/load/monkey_runner.py
backend/tests/load/plot_report.py
backend/tests/load/strategies.py
backend/tests/load/user_runner.py
backend/tests/unit/conftest.py
backend/tests/unit/core/metrics/test_base_metrics.py
backend/tests/unit/core/metrics/test_connections_and_coordinator_metrics.py
backend/tests/unit/core/metrics/test_database_and_dlq_metrics.py
backend/tests/unit/core/metrics/test_execution_and_events_metrics.py
backend/tests/unit/core/metrics/test_health_and_rate_limit_metrics.py
backend/tests/unit/core/metrics/test_kubernetes_and_notifications_metrics.py
backend/tests/unit/core/metrics/test_metrics_classes.py
backend/tests/unit/core/metrics/test_metrics_context.py
backend/tests/unit/core/metrics/test_replay_and_security_metrics.py
backend/tests/unit/core/test_adaptive_sampling.py
backend/tests/unit/core/test_csrf.py
backend/tests/unit/core/test_security.py
backend/tests/unit/core/test_utils.py
backend/tests/unit/events/core/test_consumer_config.py
backend/tests/unit/events/test_event_dispatcher.py
backend/tests/unit/events/test_mappings_and_types.py
backend/tests/unit/events/test_schema_registry_manager.py
backend/tests/unit/schemas_pydantic/test_events_schemas.py
backend/tests/unit/schemas_pydantic/test_execution_schemas.py
backend/tests/unit/schemas_pydantic/test_notification_schemas.py
backend/tests/unit/services/coordinator/test_queue_manager.py
backend/tests/unit/services/coordinator/test_resource_manager.py
backend/tests/unit/services/idempotency/__init__.py
backend/tests/unit/services/idempotency/test_idempotency_manager.py
backend/tests/unit/services/idempotency/test_middleware.py
backend/tests/unit/services/pod_monitor/test_config_and_init.py
backend/tests/unit/services/pod_monitor/test_event_mapper.py
backend/tests/unit/services/pod_monitor/test_monitor.py
backend/tests/unit/services/result_processor/__init__.py
backend/tests/unit/services/saga/test_execution_saga_steps.py
backend/tests/unit/services/saga/test_saga_comprehensive.py
backend/tests/unit/services/saga/test_saga_orchestrator_unit.py
backend/tests/unit/services/sse/test_shutdown_manager.py
backend/tests/unit/services/sse/test_sse_service.py
backend/tests/unit/services/sse/test_sse_shutdown_manager.py
backend/tests/unit/services/test_pod_builder.py

💤 Files with no reviewable changes (20)

backend/tests/unit/services/idempotency/init.py
backend/tests/unit/core/test_adaptive_sampling.py
backend/tests/e2e/test_resource_cleaner_k8s.py
backend/tests/load/config.py
backend/tests/integration/services/sse/test_redis_bus.py
backend/tests/unit/core/metrics/test_health_and_rate_limit_metrics.py
backend/tests/unit/schemas_pydantic/test_notification_schemas.py
backend/tests/unit/conftest.py
backend/tests/unit/services/result_processor/init.py
backend/tests/unit/core/metrics/test_connections_and_coordinator_metrics.py
backend/tests/helpers/cleanup.py
backend/tests/unit/events/core/test_consumer_config.py
backend/tests/unit/schemas_pydantic/test_execution_schemas.py
backend/tests/unit/events/test_schema_registry_manager.py
backend/tests/unit/services/pod_monitor/test_config_and_init.py
backend/tests/unit/core/metrics/test_database_and_dlq_metrics.py
backend/tests/unit/core/metrics/test_metrics_context.py
backend/tests/unit/services/coordinator/test_resource_manager.py
backend/tests/unit/core/metrics/test_base_metrics.py
backend/tests/load/strategies.py

✅ Files skipped from review due to trivial changes (21)

backend/tests/unit/services/sse/test_sse_shutdown_manager.py
backend/tests/unit/services/sse/test_shutdown_manager.py
backend/tests/integration/test_dlq_routes.py
backend/tests/helpers/kafka.py
backend/tests/integration/events/test_consume_roundtrip.py
backend/tests/integration/services/events/test_kafka_event_service.py
backend/tests/unit/core/metrics/test_kubernetes_and_notifications_metrics.py
backend/tests/integration/db/repositories/test_saved_script_repository.py
backend/tests/integration/events/test_dlq_handler.py
backend/tests/unit/core/metrics/test_replay_and_security_metrics.py
backend/tests/helpers/k8s_fakes.py
backend/tests/integration/test_auth_routes.py
backend/tests/unit/events/test_mappings_and_types.py
backend/tests/integration/test_user_settings_routes.py
backend/tests/integration/events/test_consumer_lifecycle.py
backend/tests/integration/db/repositories/test_admin_settings_repository.py
backend/tests/unit/services/idempotency/test_idempotency_manager.py
backend/tests/integration/services/coordinator/test_execution_coordinator.py
backend/tests/integration/services/execution/test_execution_service.py
backend/tests/unit/services/saga/test_saga_comprehensive.py
backend/tests/integration/test_alertmanager.py

🚧 Files skipped from review as they are similar to previous changes (5)

backend/tests/unit/events/test_event_dispatcher.py
backend/tests/unit/services/sse/test_sse_service.py
backend/tests/integration/result_processor/test_result_processor.py
backend/tests/integration/dlq/test_dlq_discard.py
backend/tests/unit/core/test_security.py

🧰 Additional context used

🧬 Code graph analysis (18)

backend/tests/unit/services/pod_monitor/test_event_mapper.py (3)

backend/tests/helpers/k8s_fakes.py (6)

ContainerStatus (59-64)

FakeApi (141-146)

Pod (83-105)

State (47-56)

Terminated (34-38)

Waiting (41-44)

backend/app/services/pod_monitor/event_mapper.py (1)

PodContext (31-38)

backend/app/infrastructure/kafka/events/metadata.py (1)

AvroEventMetadata (9-31)

backend/tests/unit/core/metrics/test_execution_and_events_metrics.py (2)

backend/tests/unit/conftest.py (1)

app (62-63)

backend/app/core/metrics/execution.py (1)

ExecutionMetrics (5-108)

backend/app/api/routes/replay.py (2)

backend/app/schemas_pydantic/replay.py (1)

SessionSummary (38-67)

backend/tests/integration/services/sse/test_redis_bus.py (1)

model_dump (26-27)

backend/tests/unit/services/test_pod_builder.py (1)

backend/tests/unit/conftest.py (1)

client (57-58)

backend/tests/integration/notifications/test_notification_sse.py (1)

backend/app/services/notification_service.py (1)

create_notification (258-332)

backend/tests/unit/services/saga/test_saga_orchestrator_unit.py (4)

backend/app/events/schema/schema_registry.py (1)

SchemaRegistryManager (53-229)

backend/tests/unit/services/pod_monitor/test_monitor.py (2)

produce (69-72)

produce (416-419)

backend/tests/unit/services/saga/test_execution_saga_steps.py (1)

produce (119-121)

backend/app/events/core/producer.py (1)

produce (175-206)

backend/app/api/routes/admin/events.py (1)

backend/tests/integration/services/sse/test_redis_bus.py (1)

model_dump (26-27)

backend/tests/unit/core/metrics/test_metrics_classes.py (1)

backend/tests/unit/conftest.py (1)

app (62-63)

backend/tests/integration/services/notifications/test_notification_service.py (2)

backend/app/domain/notification/models.py (1)

DomainNotificationCreate (78-93)

backend/app/services/coordinator/queue_manager.py (1)

user_id (34-35)

backend/tests/integration/services/saved_script/test_saved_script_service.py (1)

backend/app/domain/saved_script/models.py (1)

DomainSavedScriptUpdate (31-39)

backend/app/domain/events/event_models.py (4)

backend/tests/unit/conftest.py (1)

app (62-63)

backend/app/core/utils.py (1)

StringEnum (6-31)

backend/app/schemas_pydantic/admin_events.py (1)

EventFilter (20-30)

backend/tests/unit/services/idempotency/test_middleware.py (1)

event (30-34)

backend/tests/unit/schemas_pydantic/test_events_schemas.py (1)

backend/app/schemas_pydantic/events.py (1)

EventFilterRequest (65-87)

backend/tests/integration/services/rate_limit/test_rate_limit_service.py (1)

backend/app/domain/rate_limit/rate_limit_models.py (1)

RateLimitConfig (54-127)

backend/app/db/docs/event.py (1)

backend/app/domain/events/typed.py (1)

EventMetadata (13-24)

backend/tests/unit/services/idempotency/test_middleware.py (1)

backend/app/services/idempotency/middleware.py (1)

IdempotentEventHandler (14-90)

backend/tests/integration/app/test_main_app.py (1)

backend/app/db/docs/event.py (2)

Settings (31-68)

Settings (94-99)

backend/tests/unit/services/pod_monitor/test_monitor.py (3)

backend/app/db/repositories/event_repository.py (1)

store_event (39-52)

backend/app/events/core/producer.py (1)

state (56-57)

backend/app/services/pod_monitor/monitor.py (2)

state (140-142)

MonitorState (42-48)

backend/tests/e2e/test_resource_cleaner_orphan.py (1)

backend/app/services/result_processor/resource_cleaner.py (2)

ResourceCleaner (18-286)

cleanup_orphaned_resources (149-173)

🪛 GitHub Actions: MyPy Type Checking

backend/tests/unit/services/pod_monitor/test_event_mapper.py