Feature: Async handling of sampling calls #840

wreed4 · 2025-05-29T16:43:36Z

added change and formatted with ruff
added tests

Dispatch _received_request in asynchronous tasks inside the session's _receive_loop.

Motivation and Context

When writing mcp servers that have the potential to return large amounts of data, one workable pattern is to "map/reduce" the results by chunking up the backend response and summarizing it with an LLM, then combining the summaries before returning that combined summary as the tool response. Sampling is the perfect tool for this, but it is locked into a sequential execution. Meaning if I break my data up into 10 chunks I have to sequentially summarize all of those results before my tool can respond. This can lead to very long runtimes which is not necessary since each sampling call only needs its own data.

This change should allow much more efficient "map/reduce" using sampling from MCP Servers (without them implementing their own LLM integration server-side).

How Has This Been Tested?

I've tested this with fast-agent which is one of the only mcp clients that implement sampling. It greatly speeds up my applications.

Breaking Changes

No. Unless a client sends sampling requests concurrently (vs immediately awaiting which is more standard), the behavior will not change.

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation update

Checklist

I have read the MCP Documentation
My code follows the repository's style guidelines
New and existing tests pass locally (There is one existing test that does not pass currently regarding OAuth, but it doesn't seem related. will see if fails in this PR and continue debugging)
I have added appropriate error handling
I have added or updated documentation as needed

Additional context

wreed4 · 2025-05-29T19:42:50Z

the one failing test in the one case, I believe fails due to a race condition. When adding sleeps into the test to try to force order of execution, it reliably fails even without my change as far as I can tell. Will continue to see if I can figure it out, but from what I can tell, the second "await session.send_request" returns without ever triggering the message handler. Removing the sleeps allows it to succeed every time on my machine, but adding the sleeps shows that session.send_request is returning before the message handler is called.

… sleeps and prints and I have no idea why?

wreed4 · 2025-05-29T21:30:44Z

OKAY! I've tracked down the test_streamablehettp_client_resumption failure to the following behavior (seemingly)...

General layout of the test:

create a tool called "long_running_with_checkpoints" which we are intending to break our connection with before it finishes (and resume it later).
register a message handler which is called from shared/session.py::_receive_loop by calling self._handle_incoming which calls the client/session.py version of that function which calls our message handler
Create a client
Start the tool
Wait for it to send the first notification and then disconnect
THE TOOL WILL CONTINUE RUNNING

then, the intended flow seems to be

reconnect to the tool
pick up the remaining notifications it sent us
profit

However, this only seems to work correctly if the tool has sent another notification before we reconnect to it. If the tool has yet to send another notification, the call to send_request hangs forever.

This is, at this point, very outside the scope of my change I'm trying to make, as I've verified that this happens both with and without my change.. But in the spirit of not breaking everyone else, I'll try to fix this as well. If folks want to keep this out and put it in another bug, that works for me too. I can revert whatever I do to these files as they're unrelated to the change I wanted to present.

wreed4 · 2025-05-30T14:40:32Z

I've tracked this down as far as I can and determined, ultimately it should be out of scope of this PR even if I could find out what's happening.. which I haven't been successful in. So I've added a bit more logic to make the test more reliable and opened another issue to capture this error case.

wreed4 · 2025-05-30T15:05:48Z

https://github.com/orgs/modelcontextprotocol/discussions/406

…o async-sampling

modelcontextprotocol#840 - MCP Server Logging Best Practices

Co-authored-by: David Soria Parra <[email protected]>

felixweinberger

Hi @wreed4 thank you for this contribution! And apologies for the time it took to get back to this.

Handling these concurrently seems like an interesting idea but not something we currently specify in the protocol + introduces additional complexity where we want to be sure it's worthwhile adding. The current implementation also seems to lead to duplicate request handling on every request currently.

Given how we currently have an ongoing SEP for async tool execution (see: modelcontextprotocol/modelcontextprotocol#1391), I believe this would fall in the same category - we'd want to elevate this to a protocol level change and not add it ad-hoc as a feature to one (but not other) SDKs.

We would likely need a SEP for this to also ensure other SDKs support this.

felixweinberger · 2025-09-08T18:41:50Z

src/mcp/shared/session.py

+                                async def _handle_received_request() -> None:
+                                    await self._received_request(responder)
+                                    if not responder._completed:  # type: ignore[reportPrivateUsage]
+                                        await self._handle_incoming(responder)


I think this is duplicating line 366 below so we might be calling self._handle_incoming twice for every request? Should we be removing 365-366?

felixweinberger · 2025-09-26T14:24:16Z

Hi @wreed4 thank you for this contribution! And apologies for the time it took to get back to this.

Handling these concurrently seems like an interesting idea but not something we currently specify in the protocol + introduces additional complexity where we want to be sure it's worthwhile adding. The current implementation also seems to lead to duplicate request handling on every request currently.

Given how we currently have an ongoing SEP for async tool execution (see: modelcontextprotocol/modelcontextprotocol#1391), I believe this would fall in the same category - we'd want to elevate this to a protocol level change and not add it ad-hoc as a feature to one (but not other) SDKs.

We would likely need a SEP for this to also ensure other SDKs support this.

Closing for now as discussed here this should likely be a SEP first.

wreed4 added 5 commits May 29, 2025 10:46

added change and formatted with ruff

e895e52

added tests

9065a68

tried another implementation

afe9dfa

formatting

1f3eb27

Add type assertions for TextContent in tests

ed21943

wreed4 added 3 commits May 29, 2025 16:18

I think this works but the test is randomly hanging once I've removed…

aff40b2

… sleeps and prints and I have no idea why?

reduced time a bit

c02a73d

remove prints

dca806b

wreed4 mentioned this pull request May 30, 2025

Resumption of streamable HTTP session has potential for deadlock #860

Closed

wreed4 added 2 commits May 30, 2025 10:35

adding sleep to avoid deadlock

49cbc29

formatting

41ac446

wreed4 mentioned this pull request May 30, 2025

added slow llm to test parallel sampling evalstate/fast-agent#197

Merged

wreed4 force-pushed the async-sampling branch from f1f5131 to 41ac446 Compare June 23, 2025 19:22

wreed4 and others added 3 commits June 23, 2025 15:30

Merge branch 'main' of github.com:modelcontextprotocol/python-sdk int…

3fd71e4

…o async-sampling

reformat

d61dc09

Merge branch 'main' into async-sampling

99a0b26

ihrpr added this to 🐛 🛠 Jun 26, 2025

github-project-automation bot moved this to To triage in 🐛 🛠 Jun 26, 2025

Merge branch 'main' into async-sampling

afc4e91

gspencergoog pushed a commit to gspencergoog/mcp-python-sdk that referenced this pull request Jul 29, 2025

Update server.mdx to including MCP server logging best practices

79ee998

modelcontextprotocol#840 - MCP Server Logging Best Practices

gspencergoog pushed a commit to gspencergoog/mcp-python-sdk that referenced this pull request Jul 29, 2025

Saving Copilot suggested changes for modelcontextprotocol#840

c071714

Co-authored-by: David Soria Parra <[email protected]>

felixweinberger added enhancement New feature or request needs more eyes Needs alignment among maintainers whether this is something we want to add labels Sep 8, 2025

felixweinberger requested changes Sep 8, 2025

View reviewed changes

felixweinberger closed this Sep 26, 2025

github-project-automation bot moved this from To triage to Done in 🐛 🛠 Sep 26, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature: Async handling of sampling calls #840

Feature: Async handling of sampling calls #840

Uh oh!

wreed4 commented May 29, 2025 •

edited

Loading

Uh oh!

wreed4 commented May 29, 2025

Uh oh!

wreed4 commented May 29, 2025

Uh oh!

wreed4 commented May 30, 2025

Uh oh!

wreed4 commented May 30, 2025

Uh oh!

felixweinberger left a comment •

edited

Loading

Uh oh!

felixweinberger Sep 8, 2025

Uh oh!

felixweinberger commented Sep 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Feature: Async handling of sampling calls #840

Feature: Async handling of sampling calls #840

Uh oh!

Conversation

wreed4 commented May 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation and Context

How Has This Been Tested?

Breaking Changes

Types of changes

Checklist

Additional context

Uh oh!

wreed4 commented May 29, 2025

Uh oh!

wreed4 commented May 29, 2025

Uh oh!

wreed4 commented May 30, 2025

Uh oh!

wreed4 commented May 30, 2025

Uh oh!

felixweinberger left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

felixweinberger Sep 8, 2025

Choose a reason for hiding this comment

Uh oh!

felixweinberger commented Sep 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wreed4 commented May 29, 2025 •

edited

Loading

felixweinberger left a comment •

edited

Loading