Feature: Non-Blocking call_tool and request state externalisation #1209

davemssavage · 2025-07-28T15:53:45Z

This patch enables a non-blocking mode of running tools and introduces a RequestStateManager plugin api - this allows for long running tasks to be rejoined on the client and across process restarts.

Motivation and Context

Rather than introducing new protocol messages as per modelcontextprotocol/modelcontextprotocol#617 this patch uses the existing protocol messages with client side changes to allow the client application to submit a call request and then later join that request to get the response and/or any progress notifications that may have occured. This works on top of the existing session resume processes so a client can submit a task in one process and rejoin from another if a persistent version of RequestStateManager is used (this has been tested using a redis implementation - not included in this patch)

Long running tasks currently consume resources indefinitely on the client blocking until they return, Tool calls can currently timeout allowing control to return to the client however this leads to a messy control flow that exposes the internal details of the protocol implementaton - having to check the type of code is httpx.codes.REQUEST_TIMEOUT.

This patch instead introduces new methods request_call_tool and join_call_tool that handle various complex issues with resume tokens and via state stored in a pluggable RequestStateManager. An InMemoryRequestStateManager is provided as the default, other external state managers can be provided to the ClientSession to allow for persistence outside of process memory.

How Has This Been Tested?

This has been unit tested and tested on a server application that managed multiple sessions without blocking the server side threads.

Breaking Changes

None

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation update

Checklist

I have read the MCP Documentation
My code follows the repository's style guidelines
New and existing tests pass locally
I have added appropriate error handling
I have added or updated documentation as needed

Additional context

In order to handle some complex logic around resumption this patch introduces a new ResumeCapability which can be returned during initialisation. It currently is hard coded to assume that if streamable_http protocol is used then server sessions are resumeable. It may be worth while extending this along the lines of modelcontextprotocol/modelcontextprotocol#617 where servers can indicate how long clients might expect a session to be resumable for.

Testing on a non-trivial application has highlighted that at least one progress notification needs to be sent from the server prior to the start_request call returning a request_id. This is due to needing at least one event to trigger the sse.event_id to be passed back to the client for the resume token to be collected. This is a little opaque and relies on the client having passed a progress callback in the start_call_tool method. A new protocol notification sent from the server to indicate it has received the request could be a viable alternative, however one of the benefits of this is patch over modelcontextprotocol/modelcontextprotocol#617 is it is trying to avoid new protocol messages.

…a later state and cancelled

…eout

…ry -> InMemory

…lobal to session rather than per request (read the spec)

…s results in the response being consumed prior to the join, also added a capability that identifies whether the server/transport supports resumption that is passed during initialisation

… timeout on join and subsequent rejoin

… behaviour use None when no result retrieved instead

… in sse

…e result

…er doesn't send an event in a reasonable time period

felixweinberger · 2025-09-23T10:40:18Z

This seems closely related to modelcontextprotocol/modelcontextprotocol#1391 where there's an ongoing discussion about the correct abstraction for long-running or async tools.

In order to support this we'll likely need to align on the right path forward for implementation - if we end up adding this at the protocol layer we'll likely want to leverage any messages provided there.

felixweinberger · 2025-09-30T10:48:54Z

Converting this to draft for now to remove it from the review queue while we wait for the outcome of modelcontextprotocol/modelcontextprotocol#1391

davemssavage added 9 commits July 12, 2025 05:54

add methods to enable call tool requests to be started and joined at …

4165200

…a later state and cancelled

refactor args for clearer meaning, use error vs returning none on tim…

04ff73a

…eout

add resume logic to request/join call_tool functions

288ebe3

Remove None as valid return type from join_call_tool, fix typo ImMemo…

40028da

…ry -> InMemory

send resume on init rather than part of join, refactor resume to be g…

161da46

…lobal to session rather than per request (read the spec)

fix import error

aa2cbec

Refactor code to send resume as part of join call rather than it, thi…

7329cba

…s results in the response being consumed prior to the join, also added a capability that identifies whether the server/transport supports resumption that is passed during initialisation

simplify token capture using events rather than streams, add test for…

e4c25b7

… timeout on join and subsequent rejoin

avoid exceptions during join call tool on timeout as this is expected…

79f3c4e

… behaviour use None when no result retrieved instead

davemssavage requested review from a team and felixweinberger July 28, 2025 15:53

davemssavage mentioned this pull request Jul 28, 2025

[notes] Long running tools/ async tools/ resumability modelcontextprotocol/modelcontextprotocol#982

Open

davemssavage added 4 commits July 28, 2025 16:00

Merge branch 'main' into feature/call-futures

6c47890

uv ruff fixes

f262bb6

add assert for pyright checks

c5eab90

update test description

79eb3c9

davemssavage mentioned this pull request Jul 30, 2025

Resume tokens for long-running operations modelcontextprotocol/modelcontextprotocol#1003

Open

davemssavage added 3 commits August 16, 2025 12:50

pass related request id on progress to allow this to trigger event id…

a8ffd71

… in sse

ruff format fixes

cca5e34

Merge branch 'main' into feature/call-futures

5750c32

davemssavage mentioned this pull request Aug 16, 2025

feature: Async support without using Resource modelcontextprotocol/modelcontextprotocol#650

Draft

9 tasks

davemssavage added 5 commits August 18, 2025 06:42

use move on after rather than fail_after to simplify code for the sam…

f1a973a

…e result

add timeout to request_call_tool to enable clients to unblock if serv…

746d3b8

…er doesn't send an event in a reasonable time period

ruff format fixes

92a6ead

fix broken test

0575f58

fix broken tests due to new tool being added

06e356f

felixweinberger added enhancement New feature or request pending SEP approval When a PR is attached as an implementation detail to a SEP, we mark it as such for triage. needs sync Needs sync with latest main branch to ensure CI passes labels Sep 23, 2025

felixweinberger added the needs more eyes Needs alignment among maintainers whether this is something we want to add label Sep 23, 2025

felixweinberger removed the needs more eyes Needs alignment among maintainers whether this is something we want to add label Sep 30, 2025

felixweinberger marked this pull request as draft September 30, 2025 10:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature: Non-Blocking call_tool and request state externalisation #1209

Feature: Non-Blocking call_tool and request state externalisation #1209

Uh oh!

davemssavage commented Jul 28, 2025 •

edited

Loading

Uh oh!

felixweinberger commented Sep 23, 2025

Uh oh!

felixweinberger commented Sep 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Feature: Non-Blocking call_tool and request state externalisation #1209

Are you sure you want to change the base?

Feature: Non-Blocking call_tool and request state externalisation #1209

Uh oh!

Conversation

davemssavage commented Jul 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation and Context

How Has This Been Tested?

Breaking Changes

Types of changes

Checklist

Additional context

Uh oh!

felixweinberger commented Sep 23, 2025

Uh oh!

felixweinberger commented Sep 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

davemssavage commented Jul 28, 2025 •

edited

Loading