Support MCP resources, multimodal, unstructured-* with integration tests by afsalthaj · Pull Request #2918 · golemcloud/golem

afsalthaj · 2026-03-03T05:29:00Z

Also updated with #2980 (this required more testing and that took some time)

Note that, I am not doing any kebab case conversions to make things "look" good. I will do that separately, because it will hurt ongoing changes, and need a separate task for it. Things work without that now.

The whole MCP changes right now covers all features except prompt. It took a lot of re-iterations to make things work with MCP clients. OpenAI playground worked with tools but not with resources (because it didn't support).

Claude Desktop and MCP inspector are the new one I tested. Now I can see golem within Claude Desktop successfully. Claude Desktop actually helped with some real bugs.

Here is a screenshot of Claude configured with Golem (thought it was impossible until I came to know about https://www.npmjs.com/package/mcp-remote). Pasting the config here (might be helpful for docs)

In claude_desktop_config.json and restart desktop:

{
  "mcpServers": {
    "golem": {
      "command" : "npx",
      "args" : [ 
         "mcp-remote",
         "http://localhost:9007/mcp",
         "--header",
         "Host: localhost:9007",
         "--allow-http"
       
      ]
    }
  }
}

These were not running for a major time of this PR in the draft state.

MCP inspector also works similar to Claude Desktop. Open AI doesn't because, they don't show options for resources.

I also tested resources, and MCP inspector is the best test that I could do, and it understood both resource template and concrete resource as mentioned in the MCP protocol.

Note

Also note that, everything in Golem doesn't have exact one to one with MCP as mentioned in the spec written by John. We are making the best approximations here. And whatever you see in this PR, has been manually tested with popular clients and making changes will need retesting.

I would also like to test more but only as part of release tests that will be done later. And as part of bug fixes.

Static Resource

It implies the resource is part of a cluster singleton

Template Resource

It implies resource depends on the identity of your agent

These are tested with official MCP Inspector (the best client for testing), and invoked these tools. Before this PR, those images never existed

Multimodal - A weather report + weather image together

UnstructuredBinary: A snow fall image, given a city

Now it always render images, instead of at-times base64, or at times MCP client being smart and end up doing erroring out.

Unstrucured-Text report (there is nothing much here)

golem-worker-service/src/mcp/agent_mcp_resource.rs

vigoo · 2026-03-11T08:33:01Z

golem-common/src/base_model/agent.rs

-    feature = "full",
-    derive(IntoValue, FromValue, desert_rust::BinaryCodec)
-)]
+#[cfg_attr(feature = "full", derive(desert_rust::BinaryCodec))]


Why is this changed to manual implementation? Revert if possible

Automatic didn't work for functions that returned multimodal if I remember correctly

So just re-tested and yes I need this.

Please explain "did not work". We can fix the deriver.

So, WIT defines multimodal as list<tuple<string, element-value>>, and the actual runtime value is Value::Tuple. Not Value::Record.

What's in here is UntypedNamedElementValue {name, value} and the derive macro for FromVlaue will produce something that expects Value::Record.

So, ended up having Expected Record value with 2 fields, got Tuple([String("Text"), Variant { ... }]).

I think, its not a bug in the deriver to fix.

One thing is untyped-named-element-value is used only in multimodal..No other wit types ends up being untyped-named-element-value.. so may be a rename will solve it, but i don't want to do that.

Also note that this is nothing related to MCP. Invoking a function that returns multimodal didn't work at all

golem-worker-service/src/mcp/invoke/constructor_param_extraction.rs

golem-worker-service/src/mcp/invoke/resource.rs

test-components/agent-mcp/golem.yaml

* implement prompt hints * reformat code * fix integration tests * fix a bug related to validation of output schema in mcp inspector (#2969)

…ith fixed prompts

afsalthaj · 2026-03-14T22:12:54Z

Also merged #2980 into this.

* implement prompt hints * reformat code * fix integration tests * fix a bug related to validation of output schema in mcp inspector * fix multimodal images * make sure to render images * Reformat code * fix compilation errors and cleanup * start fixing tests * start fixing tests * reformat code * reformat code * fix all unit tests

support mcp resources

ef9f0a6

afsalthaj changed the title ~~support mcp resources~~ Support MCP resources Mar 3, 2026

afsalthaj added 3 commits March 3, 2026 16:30

support mcp resources

cc5cac4

rename invoke functions in mcp

5e2a05e

rename invoke functions in mcp

abb8cb6

afsalthaj commented Mar 3, 2026

View reviewed changes

golem-worker-service/src/mcp/agent_mcp_resource.rs Show resolved Hide resolved

afsalthaj added 5 commits March 5, 2026 22:36

Merge branch 'main' into resource_support

ba179c2

clean up

4ccd40e

reformat

494a58a

make tools consistent with resources

6a3a3c2

handle unstructured binary

a669faa

afsalthaj changed the title ~~Support MCP resources~~ Support MCP resources, and other complex structures Mar 5, 2026

afsalthaj added 17 commits March 6, 2026 02:04

handle unstructured text

271422a

remove the need of generic

39b9fc3

rename

c9316f0

update multimodal

77ef3a6

fix multimodal

c67ed8e

reformat

d9beea7

fix multimodal invokes

7957bc2

fix bugs

3fdf139

fix more bugs

2fb66bb

fix more bugs

c0a863e

add integration tests

87fa89c

license docs

753a971

fix some more bugs related to tool results

9dbfb44

fix a lot of more bugs after more testing with different MCP clients

d5489db

reformat code

61a2d51

fix multimodal invokes

26c7fe1

fix integration tests

ab15503

afsalthaj marked this pull request as ready for review March 11, 2026 06:00

afsalthaj added 2 commits March 11, 2026 17:02

Merge branch 'main' into resource_support

fe1c31a

resolve conflicts

5519dfa

afsalthaj changed the title ~~Support MCP resources, and other complex structures~~ Support MCP resources, multimodal, unstructured-* with integration tests Mar 11, 2026

afsalthaj added 3 commits March 11, 2026 17:27

resolve conflicts

92e28f2

fix integration tests

c16dedc

make sure all tests pass

f36e506

vigoo reviewed Mar 11, 2026

View reviewed changes

golem-worker-service/src/mcp/invoke/constructor_param_extraction.rs Show resolved Hide resolved

vigoo reviewed Mar 11, 2026

View reviewed changes

golem-worker-service/src/mcp/invoke/resource.rs Show resolved Hide resolved

vigoo reviewed Mar 11, 2026

View reviewed changes

golem-worker-service/src/mcp/invoke/resource.rs Show resolved Hide resolved

vigoo reviewed Mar 11, 2026

View reviewed changes

test-components/agent-mcp/golem.yaml Show resolved Hide resolved

afsalthaj added 4 commits March 11, 2026 21:09

add tests for invoke modules

d3b3a80

fix test component and update build scripts

20bc4d6

Merge branch 'main' into resource_support

d4e7159

Merge branch 'main' into resource_support

6f6be23

afsalthaj mentioned this pull request Mar 12, 2026

Add prompts in MCP #2961

Merged

Add prompts in MCP (#2961)

18ad958

* implement prompt hints * reformat code * fix integration tests * fix a bug related to validation of output schema in mcp inspector (#2969)

vigoo approved these changes Mar 13, 2026

View reviewed changes

afsalthaj added 2 commits March 15, 2026 07:53

Merge branch 'main' into resource_support

2dbc155

Make sure unstructured-binary and multimodal are rendered correctly w…

df3ab6b

…ith fixed prompts

afsalthaj merged commit 489e651 into main Mar 14, 2026
54 of 55 checks passed

afsalthaj deleted the resource_support branch March 14, 2026 23:38

github-actions bot locked and limited conversation to collaborators Mar 14, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support MCP resources, multimodal, unstructured-* with integration tests#2918

Support MCP resources, multimodal, unstructured-* with integration tests#2918
afsalthaj merged 39 commits intomainfrom
resource_support

afsalthaj commented Mar 3, 2026 •

edited

Loading

Uh oh!

Uh oh!

vigoo Mar 11, 2026

Uh oh!

afsalthaj Mar 11, 2026

Uh oh!

afsalthaj Mar 11, 2026

Uh oh!

vigoo Mar 11, 2026

Uh oh!

afsalthaj Mar 12, 2026

Uh oh!

afsalthaj Mar 12, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

afsalthaj commented Mar 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

afsalthaj commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Note

Static Resource

Template Resource

Multimodal - A weather report + weather image together

UnstructuredBinary: A snow fall image, given a city

Unstrucured-Text report (there is nothing much here)

Uh oh!

Uh oh!

vigoo Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

afsalthaj Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

afsalthaj Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

vigoo Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

afsalthaj Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

afsalthaj Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

afsalthaj commented Mar 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

afsalthaj commented Mar 3, 2026 •

edited

Loading