feat(workflow-engine): Track tainted workflow evaluations by kcons · Pull Request #107311 · getsentry/sentry

kcons · 2026-01-30T00:31:06Z

Report metrics for workflow evaluations that may have produced incorrect results due to errors during condition evaluation ("tainted" results).

This helps monitor evaluation reliability by emitting a single metric process_workflows.workflows_evaluated with a tainted tag, allowing us to track the ratio and number of tainted workflows.
This doesn't yet propagate taintedness to delayed evaluation; that's a planned follow-up.

Updates ISWF-1960.

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable Autofix in the Cursor dashboard.}

cursor · 2026-01-30T00:46:11Z

src/sentry/workflow_engine/processors/workflow.py

+
+    def report_metrics(self, metric_name: str) -> None:
+        metrics_incr(metric_name, self.tainted, tags={"tainted": True})
+        metrics_incr(metric_name, self.untainted, tags={"tainted": False})


Duplicated stats class for tainted evaluation tracking

Low Severity

The new EvaluationStats class duplicates the existing _ConditionEvaluationStats class in delayed_workflow.py. Both have identical fields (tainted: int, untainted: int) and serve the same purpose of tracking tainted vs untainted evaluation counts. The new class adds useful methods (from_results, __add__, report_metrics) that could benefit the delayed workflow code as well. These should be unified into a single class to avoid maintenance burden and ensure consistent taint tracking across both immediate and delayed evaluation paths.

well aware, but need to make that code workflow based first.

🤔 should these be workflow based methods?

One thing i've been thinking about is if we could compose these condition group / condition evaluation methods more, to then reuse in delayed processing as well. If we go down that approach, i'd think of these as DataCondition based.

saponifi3d

generally lgtm, mostly just nitpicks / thoughts.

saponifi3d · 2026-01-30T17:54:22Z

tests/sentry/workflow_engine/processors/test_workflow.py


    def test_workflow_trigger(self) -> None:
-        triggered_workflows, _ = evaluate_workflow_triggers(
+        triggered_workflows, _, _ = evaluate_workflow_triggers(


🤔 since the tuple is growing, should we return a typed dict instead? that way it's a little easier to reason through the returned result.

saponifi3d · 2026-02-06T21:38:21Z

src/sentry/workflow_engine/processors/data_condition_group.py

        return TriggerResult(triggered=self.triggered, error=error)

+    @staticmethod
+    def choose_tainted(a: "TriggerResult", b: "TriggerResult") -> "TriggerResult":


should we just make this a list of TriggerResults and return the first tainted? might be a little more reusable that way.

saponifi3d · 2026-02-06T21:41:42Z

src/sentry/workflow_engine/processors/workflow.py

+                if evaluation.is_tainted():
+                    tainted_untriggered += 1
+                else:
+                    untainted_untriggered += 1


it kinda feels like the tainted / untainted stuff could be encapsulated a little more. could we just add the evaluation result to a list and have it determine this information? That way we don't need to independently track this then rebuild it for the results

saponifi3d · 2026-02-06T21:43:43Z

src/sentry/workflow_engine/processors/workflow.py


 @sentry_sdk.trace
 @scopedstats.timer()
 def evaluate_workflows_action_filters(


unrelated: we might want to look at decomposing this method and the trigger condition methods. it seems like we could probably compose these two a bit more and reduce code replication

saponifi3d · 2026-02-06T21:59:09Z

src/sentry/workflow_engine/processors/workflow.py

+
+    def report_metrics(self, metric_name: str) -> None:
+        metrics_incr(metric_name, self.tainted, tags={"tainted": True})
+        metrics_incr(metric_name, self.untainted, tags={"tainted": False})


🤔 should these be workflow based methods?

One thing i've been thinking about is if we could compose these condition group / condition evaluation methods more, to then reuse in delayed processing as well. If we go down that approach, i'd think of these as DataCondition based.

kcons requested a review from a team as a code owner January 30, 2026 00:31

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Jan 30, 2026

vercel bot deployed to Preview January 30, 2026 00:32 View deployment

cursor bot reviewed Jan 30, 2026

View reviewed changes

kcons force-pushed the kcons/fastwf branch from da18fc6 to 67bf749 Compare January 30, 2026 17:18

vercel bot deployed to Preview January 30, 2026 17:20 View deployment

kcons added 4 commits February 3, 2026 13:11

Report tainted workflow evaluations in non-delayed evaluation

1aaf080

better

ab35054

appease mypy

84ecdab

a bit fancier

5936c6e

kcons force-pushed the kcons/fastwf branch from 67bf749 to 5936c6e Compare February 3, 2026 21:50

vercel bot deployed to Preview February 3, 2026 21:52 View deployment

Merge branch 'master' into kcons/fastwf

e61caa7

vercel bot deployed to Preview February 6, 2026 19:00 View deployment

saponifi3d approved these changes Feb 6, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(workflow-engine): Track tainted workflow evaluations#107311

feat(workflow-engine): Track tainted workflow evaluations#107311
kcons wants to merge 5 commits intomasterfrom
kcons/fastwf

kcons commented Jan 30, 2026 •

edited

Loading

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Jan 30, 2026

Uh oh!

kcons Jan 30, 2026

Uh oh!

saponifi3d Feb 6, 2026

Uh oh!

saponifi3d left a comment

Uh oh!

saponifi3d Jan 30, 2026

Uh oh!

saponifi3d Feb 6, 2026

Uh oh!

saponifi3d Feb 6, 2026

Uh oh!

saponifi3d Feb 6, 2026

Uh oh!

saponifi3d Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

kcons commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor bot Jan 30, 2026

Choose a reason for hiding this comment

Duplicated stats class for tainted evaluation tracking

Uh oh!

kcons Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

saponifi3d Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

saponifi3d left a comment

Choose a reason for hiding this comment

Uh oh!

saponifi3d Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

saponifi3d Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

saponifi3d Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

saponifi3d Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

saponifi3d Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kcons commented Jan 30, 2026 •

edited

Loading