Improve timeout error handling and maxFailures behavior #19

lambdalisue · 2026-01-08T14:10:19Z

Summary

Add timeout signal factory methods with automatic cleanup via using declarations
Enhance timeout errors to track which step was executing when timeout occurred
Refactor retry logic to support caller-controlled error classification via shouldRetry callback
Fix maxFailures to properly abort all scenarios (running and pending) instead of throwing
Preserve raw error types throughout the execution chain

Why

Timeout Error Handling:
Previously, timeout signal creation was scattered and lacked proper cleanup mechanisms. The new static factory methods (timeoutSignal()) provide a consistent way to create timeout signals with automatic cleanup via the Disposable interface and using declarations.

ScenarioTimeoutError now tracks which step was executing when the timeout occurred, providing better debugging context with step name and index information.

Retry Logic Separation:
The retry utility was tightly coupled to specific error types (TimeoutError checks), making it inflexible. By introducing the shouldRetry callback, callers can now control error classification without modifying the retry logic. This also allows preserving raw error types (unknown) instead of converting everything to Error, maintaining type information throughout the chain.

maxFailures Behavior:
When maxFailures was reached, the Runner would throw an error, leaving remaining scenarios unrecorded and preventing proper cleanup. This caused incomplete test results and inconsistent reporter event emission.

Now, the Runner calls controller.abort() with a Skip reason when maxFailures is reached. This:

Aborts in-progress scenarios via signal propagation (they become "skipped")
Creates skip results for unexecuted scenarios
Preserves signal.reason from external aborts (e.g., TimeoutError)
Ensures all scenarios are properly recorded and reporter events are emitted consistently

Test Plan

All existing tests pass
Added 19 new test cases for timeout signal creation and cleanup
Added 3 new test cases for maxFailures behavior (sequential and parallel execution)
Verified timeout errors include step information
Verified retry respects shouldRetry callback
Verified maxFailures aborts all scenarios (running and pending)
Verified signal.reason is preserved from external sources

codecov · 2026-01-08T14:13:21Z

Codecov Report

❌ Patch coverage is 83.73206% with 34 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
packages/probitas-runner/runner.ts	59.57%	19 Missing ⚠️
packages/probitas-runner/step_runner.ts	74.57%	14 Missing and 1 partial ⚠️

📢 Thoughts on this report? Let us know!

…logic Simplified timeout signal creation and enhanced error context tracking through static factory methods and step information enrichment. Changes: - Add timeoutSignal() static methods to timeout error classes with Disposable interface for automatic cleanup via 'using' declarations - Enhance ScenarioTimeoutError to track which step was executing when timeout occurred (name and index) - Refactor retry() to accept shouldRetry callback for caller-controlled error classification without coupling to specific error types - Preserve raw error types through retry (unknown instead of Error) to maintain type information and avoid lossy conversions - Remove isTimeoutError() and ErrorWithRetryMetadata as they're no longer needed with the new architecture - Add comprehensive tests for timeout signal creation, error enrichment, and retry behavior (19 new test cases) This separates concerns between timeout signal creation (error classes), retry logic (retry utility), and error classification (callers via shouldRetry), improving maintainability and testability.

Previously, when maxFailures was reached, the Runner would throw an error, leaving remaining scenarios unrecorded. This caused incomplete test results and prevented proper cleanup. Now, the Runner: - Calls controller.abort() with Skip reason when maxFailures is reached - Aborts in-progress scenarios via signal propagation (status: "skipped") - Creates skip results for unexecuted scenarios (status: "skipped") - Preserves signal.reason from external aborts (e.g., TimeoutError) This ensures all scenarios are properly recorded and reporter events are emitted consistently, providing complete test results even when stopping early due to maxFailures.

Copilot

Pull request overview

This pull request improves timeout error handling and fixes maxFailures behavior in the probitas-runner package. The changes enhance error tracking with step context, refactor retry logic for better flexibility, and ensure proper scenario abortion when maxFailures is reached.

Key Changes:

Added static factory methods (timeoutSignal()) for creating timeout signals with automatic cleanup via using declarations
Enhanced ScenarioTimeoutError to track which step was executing when timeout occurred (step name and index)
Refactored retry logic to support caller-controlled error classification via shouldRetry callback
Fixed maxFailures to properly abort all scenarios (running and pending) using signal propagation instead of throwing

Reviewed changes

Copilot reviewed 10 out of 10 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
packages/probitas-runner/errors.ts	Added static `timeoutSignal()` factory methods to both error classes with Disposable interface for cleanup; enhanced ScenarioTimeoutError constructor to accept step information
packages/probitas-runner/errors_test.ts	Added comprehensive test coverage (19 new tests) for timeout signal creation, cleanup, and error construction with step info
packages/probitas-runner/utils/retry.ts	Added `shouldRetry` callback parameter; changed to preserve raw error types (unknown); pass 1-based attempt numbers to callback function
packages/probitas-runner/utils/retry_test.ts	Added 5 new tests for `shouldRetry` callback behavior and attempt number passing
packages/probitas-runner/step_runner.ts	Integrated timeout signal factory with `using` declarations; implemented `shouldRetry` to skip timeout errors; enhanced error enrichment logic for timeout errors
packages/probitas-runner/step_runner_test.ts	Updated test expectations to match raw error types (strings instead of Error objects)
packages/probitas-runner/scenario_runner.ts	Added step index tracking and ScenarioTimeoutError enrichment with current step information when timeout occurs
packages/probitas-runner/scenario_runner_test.ts	Updated test expectations to match raw error types
packages/probitas-runner/runner.ts	Refactored maxFailures to call `controller.abort()` instead of throwing; added logic to create skip results for unexecuted scenarios; integrated timeout signal factory
packages/probitas-runner/runner_test.ts	Added 3 new tests for maxFailures behavior (sequential and parallel); added 2 new tests for timeout with step info; removed old `_internal.isTimeoutError` tests

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-08T14:17:05Z

packages/probitas-runner/runner.ts

+              status: "skipped",
+              metadata: toScenarioMetadata(scenario),
+              duration: 0,
+              steps: [],
+              error: signal.reason ??
+                new Skip("Skipped due to previous failures"),
+            };
+            scenarioResults.push(skipResult);
+            await this.reporter.onScenarioStart?.(skipResult.metadata);
+            await this.reporter.onScenarioEnd?.(
+              skipResult.metadata,
+              skipResult,
+            );
+            return;


This code block for creating skip results is duplicated in two places (lines 170-183 and 217-227). The logic is nearly identical except for the comment. Consider extracting this into a helper method to reduce duplication and improve maintainability.

Copilot · 2026-01-08T14:17:05Z

packages/probitas-runner/errors.ts

+  static timeoutSignal(
+    timeout: number,
+    params: {
+      stepName: string;
+      attemptNumber: number;
+    },
+  ): AbortSignal & Disposable {
+    const controller = new AbortController();
+    const startTime = performance.now();
+    const timeoutId = setTimeout(() => {
+      const elapsedMs = Math.round(performance.now() - startTime);
+      const error = new StepTimeoutError(
+        params.stepName,
+        timeout,
+        params.attemptNumber,
+        elapsedMs,
+      );
+      controller.abort(error);
+    }, timeout);
+
+    // Cleanup timeout when signal is aborted by other means
+    controller.signal.addEventListener("abort", () => {
+      clearTimeout(timeoutId);
+    }, { once: true });
+
+    // Add Disposable interface for manual cleanup
+    return Object.assign(controller.signal, {
+      [Symbol.dispose]: () => {
+        clearTimeout(timeoutId);
+      },
+    });
+  }


The same cleanup pattern is duplicated in both StepTimeoutError.timeoutSignal and ScenarioTimeoutError.timeoutSignal. Consider extracting this into a shared helper function to reduce code duplication and improve maintainability.

Copilot · 2026-01-08T14:17:05Z

packages/probitas-runner/errors.ts

+  static timeoutSignal(
+    timeout: number,
+    params: {
+      scenarioName: string;
+    },
+  ): AbortSignal & Disposable {
+    const controller = new AbortController();
+    const startTime = performance.now();
+    const timeoutId = setTimeout(() => {
+      const elapsedMs = Math.round(performance.now() - startTime);
+      const error = new ScenarioTimeoutError(
+        params.scenarioName,
+        timeout,
+        elapsedMs,
+      );
+      controller.abort(error);
+    }, timeout);
+
+    // Cleanup timeout when signal is aborted by other means
+    controller.signal.addEventListener("abort", () => {
+      clearTimeout(timeoutId);
+    }, { once: true });
+
+    // Add Disposable interface for manual cleanup
+    return Object.assign(controller.signal, {
+      [Symbol.dispose]: () => {
+        clearTimeout(timeoutId);
+      },
+    });
  }


The same cleanup pattern is duplicated in both StepTimeoutError.timeoutSignal and ScenarioTimeoutError.timeoutSignal. Consider extracting this into a shared helper function to reduce code duplication and improve maintainability.

Copilot AI review requested due to automatic review settings January 8, 2026 14:10

Copilot started reviewing on behalf of lambdalisue January 8, 2026 14:10 View session

lambdalisue enabled auto-merge January 8, 2026 14:10

lambdalisue force-pushed the fix/error branch from d8b71c4 to 12ebe2c Compare January 8, 2026 14:14

lambdalisue added 2 commits January 8, 2026 23:14

lambdalisue force-pushed the fix/error branch from 12ebe2c to a767fe9 Compare January 8, 2026 14:14

Copilot AI reviewed Jan 8, 2026

View reviewed changes

lambdalisue merged commit 16c5423 into main Jan 8, 2026
2 checks passed

lambdalisue deleted the fix/error branch January 8, 2026 14:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve timeout error handling and maxFailures behavior #19

Improve timeout error handling and maxFailures behavior #19

Uh oh!

lambdalisue commented Jan 8, 2026

Uh oh!

codecov bot commented Jan 8, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 8, 2026

Uh oh!

Copilot AI Jan 8, 2026

Uh oh!

Copilot AI Jan 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Improve timeout error handling and maxFailures behavior #19

Improve timeout error handling and maxFailures behavior #19

Uh oh!

Conversation

lambdalisue commented Jan 8, 2026

Summary

Why

Test Plan

Uh oh!

codecov bot commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Key Changes:

Reviewed changes

Uh oh!

Copilot AI Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jan 8, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Jan 8, 2026 •

edited

Loading