feat(e001-s002): /plan routing with Contract-Net scoring and HTTP execution by Dumidu1212 · Pull Request #5 · Dumidu1212/Self-Optimizing-Agent-Coordination-Response-System

Dumidu1212 · 2025-10-12T07:30:29Z

Implements candidate filtering, scoring, selection, fallback, and execution via HTTP. Adds /plan route, planner metrics, and e2e tests. Swagger updated via route schema.

Summary by CodeRabbit

New Features
- Introduced planning capability with a POST /plan endpoint to score, select, and optionally execute tools with timeouts; returns candidates, selection, and execution details.
- Added HTTP-based tool execution with robust timeout and error handling.
- Added tracing identifiers for plan executions.
Metrics
- Exposed /metrics with Prometheus metrics for tools loaded, planner bids, selections, fallbacks, and execution latency.
Documentation
- Updated API documentation version to 0.2.0.
Tests
- Added end-to-end and unit tests for planning and fallback behavior.
Chores
- Updated Jest type definitions (dev dependency).

…, e2e and unit tests

coderabbitai · 2025-10-12T07:30:50Z

Walkthrough

Adds a planning/execution feature: new Planner with HTTP executor, tracing, and metrics; exposes POST /plan route; updates app/server wiring to inject planner; increments Swagger version; introduces metrics endpoint exports; adds e2e and unit tests; bumps @types/jest dev dependency.

Changes

Cohort / File(s)	Summary
Planner feature & wiring `src/app.ts`, `src/server.ts`, `src/routes/plan.ts`	App/server now require and pass a planner. New POST `/plan` route delegates to `deps.planner.plan(...)`. Swagger API version updated to 0.2.0.
Planning core & execution `src/planner/planner.ts`, `src/executors/httpExecutor.ts`, `src/tracing/traceStore.ts`	Introduces Planner class with candidate scoring, selection, execution with fallback and timeouts; HTTP executor for tool endpoints; in-memory trace store for events.
Metrics `src/metrics/metrics.ts`	Adds gauges/counters/histograms for tools loaded, planner bids, selections, fallbacks, and execution latency; exports `metricsRoutes` for `/metrics`.
Tests `tests/e2e/plan.e2e.test.ts`, `tests/unit/planner.fallback.test.ts`	New e2e test for `/plan` execution path and unit test verifying fallback behavior on failure.
Dev dependency `package.json`	Bumps `@types/jest` from `^29.5.14` to `^30.0.0`.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant C as Client
  participant A as Fastify App (/plan)
  participant P as Planner
  participant R as Registry
  participant S as Scorer
  participant E as HTTP Executor
  participant H as HTTP Tool
  participant T as TraceStore
  participant M as Metrics

  C->>A: POST /plan { capability, input, execute, timeout_ms }
  A->>P: plan(ctx)
  P->>T: record("plan_start", ctx)
  P->>R: getTools(capability)
  R-->>P: tools[]
  loop candidates
    P->>S: score(tool, ctx)
    S-->>P: score
    P->>M: inc planner_bids_total{capability, tool}
  end
  alt execute == true
    loop ranked candidates
      P->>T: record("attempt", tool)
      P->>E: execute(tool, input, overallAbort)
      E->>H: HTTP request (timeout/abort aware)
      alt 2xx
        H-->>E: response JSON
        E-->>P: success {output, latency}
        P->>M: inc planner_selection_total{capability, tool}
        P->>M: observe planner_execution_latency_ms{tool}
        P-->>A: 200 { selected, execution: success, traceId }
      else error/timeout
        H-->>E: error/timeout
        E-->>P: failure/timeout
        P->>M: inc planner_fallbacks_total{capability}
        P->>T: record("fallback", reason)
      end
    end
    P-->>A: 200 { error: ALL_CANDIDATES_FAILED, traceId } 
  else execute == false
    P-->>A: 200 { selected (plan only), traceId }
  end
  A-->>C: JSON response

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~60 minutes

Possibly related PRs

feat(e001-s001): coordinator core (registry model/validation, routes,… #3 — Also edits app wiring and metrics exports, likely aligned with introducing planner dependencies and metrics routes.
feat(e001-s002): add planner contracts and simple scorer with unit tests #4 — Adds planner contracts and SimpleScorer, directly related to the Planner integration here.
feat(e001-s001): tool registry S001 #1 — Touches app initialization and metrics/routes, overlapping with this PR’s app and metrics changes.

Poem

A rabbit plots with careful cheer,
Plans and traces crystal-clear.
If tools run slow, we hop to two—
Fallback trails to pull us through.
Metrics nibble, ticks in time,
HTTP hums, success in rhyme.
Thump! The plan returns sublime.

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 33.33% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title Check	✅ Passed	The title succinctly captures the primary feature added—a new /plan routing endpoint—and highlights the key components “Contract-Net scoring” and “HTTP execution” introduced in this changeset.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feat/e001-s002-scoring

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (3)

tests/unit/planner.fallback.test.ts (1)
41-44: Assert the actual fallback tool.

Right now the test only checks that selected.toolId is defined, so it would still pass if the planner kept picking t1. Asserting the ID is 't2' validates that the fallback really occurred.

Apply this diff:
-    expect(res.selected?.toolId).toBeDefined();
+    expect(res.selected?.toolId).toBe('t2');
src/routes/plan.ts (2)
15-26: Consider requiring the capability field in the request schema.

While the planner handles missing capability gracefully (returning INPUT_INVALID), requiring it at the schema level would provide faster feedback and clearer API semantics. The current permissive schema allows requests without a capability to reach the planner.

If you want to enforce capability at the schema level, apply this diff:
         body: {
           type: 'object',
           additionalProperties: false,
+          required: ['capability'],
           properties: {
             capability: { type: 'string' },
64-67: Document the HTTP 200 response for application-level failures.

The handler always returns HTTP 200, even when the planner reports failure (e.g., NO_CANDIDATES, ALL_CANDIDATES_FAILED). This is a valid design choice (application-level errors in the response body rather than HTTP error codes), but it should be documented for API consumers to understand the error handling model.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 22a4603 and 1b84d20.

⛔ Files ignored due to path filters (1)

package-lock.json is excluded by !**/package-lock.json

📒 Files selected for processing (10)

package.json (1 hunks)
src/app.ts (1 hunks)
src/executors/httpExecutor.ts (1 hunks)
src/metrics/metrics.ts (1 hunks)
src/planner/planner.ts (1 hunks)
src/routes/plan.ts (1 hunks)
src/server.ts (2 hunks)
src/tracing/traceStore.ts (1 hunks)
tests/e2e/plan.e2e.test.ts (1 hunks)
tests/unit/planner.fallback.test.ts (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (7)

src/app.ts (5)

src/registry/service.ts (1)

IRegistryService (4-4)

src/planner/contracts.ts (1)

IPlanner (63-65)

src/registry/model.ts (2)

toolSchema (24-71)

registrySchema (73-81)

src/routes/tools.ts (1)

toolsRoutes (8-48)

src/routes/plan.ts (1)

planRoutes (9-69)

src/planner/planner.ts (5)

src/planner/contracts.ts (6)

IPlanner (63-65)

IScorer (49-51)

IToolExecutor (54-60)

PlanContext (10-23)

PlanResult (40-46)

JsonRecord (7-7)

src/registry/service.ts (1)

IRegistryService (4-4)

src/tracing/traceStore.ts (1)

TraceStore (16-34)

src/executors/httpExecutor.ts (1)

execute (9-59)

src/metrics/metrics.ts (3)

plannerBidsTotal (14-18)

plannerSelectionTotal (21-25)

plannerFallbacksTotal (28-32)

tests/e2e/plan.e2e.test.ts (4)

src/planner/contracts.ts (2)

IToolExecutor (54-60)

ExecutionResult (32-37)

src/registry/service.ts (1)

IRegistryService (4-4)

src/planner/planner.ts (1)

Planner (26-116)

src/tracing/traceStore.ts (1)

TraceStore (16-34)

src/server.ts (5)

src/planner/scoring.simple.ts (1)

SimpleScorer (40-54)

src/executors/httpExecutor.ts (1)

HttpExecutor (8-60)

src/tracing/traceStore.ts (1)

TraceStore (16-34)

src/planner/planner.ts (1)

Planner (26-116)

src/app.ts (1)

buildApp (14-30)

tests/unit/planner.fallback.test.ts (5)

src/planner/contracts.ts (2)

IToolExecutor (54-60)

ExecutionResult (32-37)

src/registry/service.ts (1)

IRegistryService (4-4)

src/planner/scoring.simple.ts (1)

SimpleScorer (40-54)

src/tracing/traceStore.ts (1)

TraceStore (16-34)

src/planner/planner.ts (1)

Planner (26-116)

src/executors/httpExecutor.ts (1)

src/planner/contracts.ts (3)

IToolExecutor (54-60)

JsonRecord (7-7)

ExecutionResult (32-37)

src/routes/plan.ts (1)

src/planner/contracts.ts (2)

IPlanner (63-65)

PlanContext (10-23)

🔇 Additional comments (12)

src/server.ts (3)

6-9: LGTM! Clean planner component imports.

The imports for SimpleScorer, Planner, HttpExecutor, and TraceStore align with the new planning architecture introduced in this PR.

25-28: LGTM! Proper dependency injection setup.

The planner is correctly instantiated with all required dependencies (registry, scorer, executor, traces) following the constructor signature from src/planner/planner.ts.

30-30: LGTM! Updated buildApp call aligns with new signature.

The buildApp call now correctly passes both registry and planner, matching the updated signature in src/app.ts.

tests/e2e/plan.e2e.test.ts (1)

48-58: LGTM! Well-structured end-to-end test.

The test appropriately validates the planning flow: capability matching, candidate scoring, selection logic, and execution status. The assertions cover all key aspects of the /plan endpoint behavior.

src/app.ts (4)

7-9: LGTM! Imports align with the new planning feature.

The new imports for planRoutes and IPlanner support the integration of the planning endpoint into the application.

11-14: LGTM! Clear documentation and updated signature.

The comment helpfully describes the function's purpose, and the signature change correctly adds the planner dependency alongside registry.

20-20: LGTM! Appropriate version bump for new feature.

Bumping the Swagger API version from 0.1.0 to 0.2.0 correctly reflects the addition of the /plan endpoint as a new minor feature.

24-26: LGTM! Clean route registration with proper dependency injection.

The route registration correctly separates concerns: toolsRoutes receives only the registry, planRoutes receives only the planner, and metricsRoutes requires no dependencies. This follows good dependency injection practices.

src/metrics/metrics.ts (4)

7-18: LGTM! Well-documented planner bid metrics.

The toolsLoaded, toolLoadErrors, and plannerBidsTotal metrics are clearly documented and appropriately defined with helpful labels (capability, tool) for tracking scoring activity.

20-32: LGTM! Clear selection and fallback tracking metrics.

The plannerSelectionTotal and plannerFallbacksTotal counters provide good observability into the planner's decision-making and fallback behavior. The label choices allow tracking by capability and tool.

34-40: LGTM! Appropriate histogram configuration for execution latency.

The plannerExecutionLatencyMs histogram has reasonable buckets (50ms to 6.4s) that cover typical tool execution times. The exponential bucket distribution will provide good resolution for percentile calculations.

42-47: LGTM! All metrics properly registered.

All newly defined planner metrics are correctly registered with the registry, ensuring they will be exposed via the /metrics endpoint.

coderabbitai · 2025-10-12T07:36:18Z

src/planner/planner.ts

+      plannerFallbacksTotal.labels({ capability }).inc();
+      this.traces.record(traceId, 'fallback', { toolId: tool.id, error: execRes.error, status: execRes.status });
+      // If overall deadline elapsed, break early
+      if (overallController.signal.aborted) break;
+    }
+
+    // No candidate succeeded
+    result.execution = { status: 'failure', error: 'ALL_CANDIDATES_FAILED' };
+    this.traces.record(traceId, 'failure', result.execution);
+    return result;


⚠️ Potential issue | 🟠 Major

Propagate overall timeout instead of ALL_CANDIDATES_FAILED.

When the overall deadline aborts overallController, we break out of the loop but later report {status:'failure', error:'ALL_CANDIDATES_FAILED'}. Clients can’t distinguish a plan-level timeout from actual candidate exhaustion, and traces/metrics misclassify the outcome. Surface the timeout immediately when overallController.signal.aborted so the result (and trace) reflects the real failure mode.

Apply this diff:

plannerFallbacksTotal.labels({ capability }).inc(); this.traces.record(traceId, 'fallback', { toolId: tool.id, error: execRes.error, status: execRes.status }); - // If overall deadline elapsed, break early - if (overallController.signal.aborted) break; + // If the overall deadline elapsed, surface timeout immediately + if (overallController.signal.aborted) { + const reason = String((overallController.signal as unknown as { reason?: unknown }).reason ?? 'overall-timeout'); + result.execution = { status: 'timeout', error: reason }; + this.traces.record(traceId, 'timeout', { reason }); + return result; + } } - // No candidate succeeded - result.execution = { status: 'failure', error: 'ALL_CANDIDATES_FAILED' }; + // No candidate succeeded before the deadline + result.execution = { status: 'failure', error: 'ALL_CANDIDATES_FAILED' }; this.traces.record(traceId, 'failure', result.execution); return result;

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

plannerFallbacksTotal.labels({ capability }).inc();

this.traces.record(traceId, 'fallback', { toolId: tool.id, error: execRes.error, status: execRes.status });

// If overall deadline elapsed, break early

if (overallController.signal.aborted) break;

}

// No candidate succeeded

result.execution = { status: 'failure', error: 'ALL_CANDIDATES_FAILED' };

this.traces.record(traceId, 'failure', result.execution);

return result;

plannerFallbacksTotal.labels({ capability }).inc();

this.traces.record(traceId, 'fallback', { toolId: tool.id, error: execRes.error, status: execRes.status });

// If the overall deadline elapsed, surface timeout immediately

if (overallController.signal.aborted) {

const reason = String((overallController.signal as unknown as { reason?: unknown }).reason ?? 'overall-timeout');

result.execution = { status: 'timeout', error: reason };

this.traces.record(traceId, 'timeout', { reason });

return result;

}

}

// No candidate succeeded before the deadline

result.execution = { status: 'failure', error: 'ALL_CANDIDATES_FAILED' };

this.traces.record(traceId, 'failure', result.execution);

return result;

coderabbitai · 2025-10-12T07:36:18Z

tests/e2e/plan.e2e.test.ts

+class StubExec implements IToolExecutor {
+  async execute(tool: Tool): Promise<ExecutionResult> {
+    // Pretend both succeed; latencies differ by name
+    const latency = tool.id === 'fast' ? 10 : 50;
+    return { status: 'success', latency_ms: latency, output: { id: tool.id } };
+  }
+}


⚠️ Potential issue | 🔴 Critical

Fix the method signature to match IToolExecutor interface.

The execute method signature is missing required parameters. The IToolExecutor interface expects three parameters: tool, input, and overallAbort.

Apply this diff to fix the signature:

class StubExec implements IToolExecutor { - async execute(tool: Tool): Promise<ExecutionResult> { + async execute(tool: Tool, _input: JsonRecord, _overallAbort: AbortSignal): Promise<ExecutionResult> { // Pretend both succeed; latencies differ by name const latency = tool.id === 'fast' ? 10 : 50; return { status: 'success', latency_ms: latency, output: { id: tool.id } }; } }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

class StubExec implements IToolExecutor {

async execute(tool: Tool): Promise<ExecutionResult> {

// Pretend both succeed; latencies differ by name

const latency = tool.id === 'fast' ? 10 : 50;

return { status: 'success', latency_ms: latency, output: { id: tool.id } };

}

}

class StubExec implements IToolExecutor {

async execute(tool: Tool, _input: JsonRecord, _overallAbort: AbortSignal): Promise<ExecutionResult> {

// Pretend both succeed; latencies differ by name

const latency = tool.id === 'fast' ? 10 : 50;

return { status: 'success', latency_ms: latency, output: { id: tool.id } };

}

}

🤖 Prompt for AI Agents

In tests/e2e/plan.e2e.test.ts around lines 23 to 29, the StubExec.execute method currently declares only (tool: Tool) but must match IToolExecutor and accept (tool, input, overallAbort); update the signature to include the missing parameters and their types (e.g., tool: Tool, input: unknown, overallAbort?: AbortSignal), keep the existing body logic, and if input or overallAbort are unused, name them with a leading underscore or explicitly ignore them to satisfy lint/type checks.

feat(e001-s002): planner, HTTP executor, /plan route, planner metrics…

1b84d20

…, e2e and unit tests

coderabbitai bot reviewed Oct 12, 2025

View reviewed changes

Dumidu1212 merged commit e15296f into main Oct 12, 2025
1 of 3 checks passed

coderabbitai bot mentioned this pull request Oct 12, 2025

Resolved all testing erors #6

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(e001-s002): /plan routing with Contract-Net scoring and HTTP execution#5

feat(e001-s002): /plan routing with Contract-Net scoring and HTTP execution#5
Dumidu1212 merged 1 commit intomainfrom
feat/e001-s002-scoring

Dumidu1212 commented Oct 12, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Oct 12, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Oct 12, 2025

Uh oh!

coderabbitai bot Oct 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Dumidu1212 commented Oct 12, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Oct 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Poem

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 12, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Oct 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Dumidu1212 commented Oct 12, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Oct 12, 2025 •

edited

Loading