feat: optional per-invocation MCP tool call cap by dmartinol · Pull Request #79 · RHEcosystemAppEng/google-lightspeed-agent

dmartinol · 2026-04-08T12:45:02Z

Summary

Adds MAX_TOOL_CALLS_PER_INVOCATION (default 0 = disabled) so operators can cap how many MCP tools may start in one ADK agent run. When the limit is exceeded, UsageTrackingPlugin.before_tool_callback short-circuits (ADK error dict); the tool is not executed and after_tool_callback does not run, so blocked calls are not persisted as usage tool_calls.

Motivation

HTTP rate limiting (Redis) bounds incoming A2A requests per principal.
Metering records completed work for billing/analytics.
This layer bounds depth of a single run (model–tool loops), addressing a different abuse pattern than request rate limits.

Implementation

Settings.max_tool_calls_per_invocation with env MAX_TOOL_CALLS_PER_INVOCATION.
Per-invocation counter in UsageTrackingPlugin (invocation_id from ToolContext), guarded by asyncio.Lock.
after_run_callback drops counters for the finished invocation to avoid unbounded memory growth.
Log line on block includes call count, limit, and attempted tool= for clarity.

Limitations (documented)

Counters are in-memory per process — not shared across Cloud Run instances, Podman replicas, or multi-worker processes. docs/metering.md describes mitigations and a future Redis/DB direction for shared caps.

Testing

make lint and make test pass.
New tests in tests/test_usage_plugin.py: disabled limit (0), block after N starts, after_run_callback resets budget.

Documentation and deploy wiring

docs/metering.md (per-invocation tool budget, diagram, comparison to rate limits, multi-instance caveat).
docs/rate-limiting.md (three-layer table).
docs/configuration.md (usage table; also lists existing metering env vars in the table).
.env.example, CLAUDE.md, deploy/cloudrun/service.yaml, deploy/podman/lightspeed-agent-configmap.yaml.

Made with Cursor

Introduces the `MAX_TOOL_CALLS_PER_INVOCATION` environment variable to limit the number of MCP tool executions per agent run. This feature is enforced in-process and is not shared across replicas. Updates to documentation and configuration files reflect this new capability, along with implementation in the `UsageTrackingPlugin` to manage tool call limits during execution. Signed-off-by: Daniele Martinoli <dmartino@redhat.com>

dmartinol requested review from IlonaShishov, luis5tb and yuvalk as code owners April 8, 2026 12:45

dmartinol mentioned this pull request Apr 8, 2026

feat(a2a): GuardrailPlugin for org argument binding (PR #73 follow-up) #78

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: optional per-invocation MCP tool call cap#79

feat: optional per-invocation MCP tool call cap#79
dmartinol wants to merge 1 commit intoRHEcosystemAppEng:mainfrom
dmartinol:feat/usage-tool-call-cap

dmartinol commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dmartinol commented Apr 8, 2026

Summary

Motivation

Implementation

Limitations (documented)

Testing

Documentation and deploy wiring

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant