MCP UI — A Declarative UI + Action Protocol for Agents #522

biswapm · 2025-08-05T05:44:31Z

biswapm
Aug 5, 2025

Pre-submission Checklist

I have verified this would not be more appropriate as a feature request in a specific repository
I have searched existing discussions to avoid duplicates

Your Idea

Current MCP definitions focus on tools and semantics, but lack a standard way to define dynamic, agent-driven UIs or workflows. We need a UI protocol for composable rendering and user interaction. Introduce a UI-oriented extension to MCP called MCP.UI, where a tool not only returns data or performs actions, but can also declare how to render a UI, bind interactions, and wire actions.

It’s like giving agents a way to speak React (or JSX) — declarative, composable, and reactive — but abstracted through schema.

Use Cases:
Agent returns a form to create a CRM record, pre-filled from context, with on-submit invoking an MCP tool.

Agent renders a dynamic dashboard UI, with filters tied to ui:bind and click actions invoking a read_data tool.

Agent sends a confirmation modal with Approve/Reject buttons wired to specific tools.

Declarative step-by-step wizards, where each step's UI is returned by the agent, contextually.

e.g.

{
"tool": "create_contact_form",
"ui:type": "form",
"ui:elements": [
{ "type": "text", "label": "Name", "bind": "contact.name" },
{ "type": "email", "label": "Email", "bind": "contact.email" },
{ "type": "submit", "label": "Save", "action": "create_contact" }
],
"ui:bind": {
"contact.name": "{{ user.name }}",
"contact.email": "{{ user.email }}"
},
"ui:actions": {
"create_contact": {
"tool": "create_contact",
"inputs": {
"name": "{{ contact.name }}",
"email": "{{ contact.email }}"
}
}
}
}

Scope

tadasant · 2025-08-06T14:24:52Z

tadasant
Aug 6, 2025
Collaborator

See https://mcpui.dev/ for the most developed work on this front I've seen to date

There is also a #ui-wg channel in the MCP Contributor Discord

0 replies

alexbkogan · 2025-09-11T16:57:17Z

alexbkogan
Sep 11, 2025

Hi MCP UI team! We’ve been independently working on very similar ideas (we call our version Fractal https://www.fractalmcp.com) and we are so excited that other teams are also thinking about delivering UI via MCP! We’d love for a single standard to emerge so we can all make our client libraries interoperable (so whether a company uses MCP UI, Fractal, or someone else’s server library, it all works on the consumption side).

Below we review (a) your existing protocol spec (and suggest other teams align to it), (b) suggest an expansion focused on making the components themselves act like MCP servers to enable interactivity with agents.

Here’s what we see right now as far as protocol specs go:

UI tool responses

We love what MCP UI has done with the UIResource interface! We have something equivalent (though slightly different on our side), but to minimize friction in developing a standard, we will adopt the UIResource convention your team has created and we suggest other teams working in the space should also adopt it:

interface UIResource {
    type: 'resource';
    resource: {
        uri: string; // ui://component/id
        mimeType: 'text/html' | 'text/uri-list' | 'application/vnd.mcp-ui.remote-dom';
        // text/html: inline HTML
        // text/uri-list: URL(s)
        // application/vnd.mcp-ui.remote-dom: JS runtime for remote DOM
        _metatada: {[key:string]: string},
        text?: string; // Inline text (e.g., HTML or URL)
        blob?: string; // Base64-encoded payload (e.g., HTML bundle or URL list)
    };
}

Communication protocol between iframe and parent

We are aligned with MCP UI that there should be standardization on the communication protocol between the UI components inside the iframe and the parent component that appears in a chat. We think this is important because it will allow client libraries to support key features like resizing, user intents, and beyond.

We would like to propose that communication should be two-way, flowing between the iframed component and the parent rendering it.

Current format (iframe to parent)

Message IDs correlate requests with their responses.

type IframeToParent =
    | { type: 'tool';   payload: { toolName: string; params: Record<string, unknown> }; messageId?: string }
    | { type: 'intent'; payload: { intent: string;    params: Record<string, unknown> }; messageId?: string }
    | { type: 'prompt'; payload: { prompt: string };                                   messageId?: string }
    | { type: 'notify'; payload: { message: string };                                  messageId?: string }
    | { type: 'link';   payload: { url: string };                                      messageId?: string };

Currently these response events are handled inside the iframe:

ui-message-received
ui-message-response

Proposed extension: parent to iframe RPC

Motivation

We propose extending the spec so an agent can query a minimal DOM representation inside the iframe and drive interactions through a lightweight RPC layer. Components can declare whether they support this mode. Since most actions from parent to iframe will be done by the AI agent, we propose exposing functionality just like an MCP server would: via introspection and tool calls.

Use cases

Agent assisted navigation across multi step flows
Form filling and submission via visible UI fields
Onboarding flows driven by the agent
Shared context where user and agent manipulate the same state

Approach

Extend the messaging protocol for two way communication.
Define a small set of standard RPC methods such as queryDom, click, and enterText.
Allow components to act as MCP servers which support tools/list and tools/call.

1) Standardize the iframe message response format

We propose standardizing the response envelope as follows:

type UIMessageResponse<T = unknown> = {
    messageId: string;
    response?: T;
    error?: string;
};

This pattern is hinted at in the MCP UI documentation but not spelled out explicitly.

2) Extend messaging protocol to go both ways

Allow the parent to initiate a narrow set of built-in declarative actions. Use the same messageId correlation pattern.

type QueryDomResult  = string;                 // minimal HTML snapshot
type ClickResult     = { success: boolean };
type EnterTextResult = { success: boolean };

type ParentToIframe =
    | { type: 'queryDom';  payload: {};                                   messageId: string }
    | { type: 'click';     payload: { elementId: string };                 messageId: string }
    | { type: 'enterText'; payload: { elementId: string; text: string };   messageId: string };

type ParentToIframeResponse =
    | UIMessageResponse<QueryDomResult>
    | UIMessageResponse<ClickResult>
    | UIMessageResponse<EnterTextResult>;

These primitives should be enough to enable any interaction you might want your agent to take. We believe there’s a benefit of taking this a step further:

3) Components as MCP servers

We believe that allowing components themselves to behave as MCP servers is the simplest and most flexible way to enable agent-component interactions.

To this end, we propose adding support for the MCP-standard tools/list and tools/call messages, letting components advertise and expose custom actions beyond the standard queryDom, click, and enterText. The response formats directly match MCP’s ToolsListResult and CallToolResult.

Motivation

Gives components flexibility to expose specialized interactions.
Keeps the protocol consistent with MCP’s philosophy of declared capabilities.
Enables agents to dynamically discover and reason about available UI actions.

Example sketch

// We borrow some types from MCP!
import type { Tool, ToolsListResult, CallToolResult, Content } from '@modelcontextprotocol/sdk';


// Extend the ParentToIframe union with MCP tools support
type ParentToIframe =
    | { type: 'queryDom';    payload: {};                                    messageId: string }
    | { type: 'click';       payload: { elementId: string };                  messageId: string }
    | { type: 'enterText';   payload: { elementId: string; text: string };    messageId: string }
    | { type: 'tools/list';  payload: { cursor?: string };                    messageId: string }
    | { type: 'tools/call';  payload: { name: string; arguments?: Record<string, unknown> }; messageId: string };

// ParentToIframeResponse stays consistent
type ParentToIframeResponse =
    | UIMessageResponse<QueryDomResult>
    | UIMessageResponse<ClickResult>
    | UIMessageResponse<EnterTextResult>
    | UIMessageResponse<ToolsListResult>
    | UIMessageResponse<CallToolResult>;

Putting it all together

What exists today

UIResource referencing UI via text/html, text/uri-list, or application/vnd.mcp-ui.remote-dom.
Messages flow from iframe to parent: tool, intent, prompt, notify, link.
Responses are handled via ui-message-received and ui-message-response, but without a standardized envelope.

What we propose to add or change

Standard response envelope for iframe messaging
Use a consistent envelope for all responses:
UIMessageResponse<T> { messageId; response?: T; error?: string }
Two way messaging
Add parent-to-iframe RPC methods queryDom, click, and enterText.
Reuse the same messageId correlation for all request and response pairs,
We believe these should be standardized and therefore should not be left to cover as a component mcp tool
Components as MCP servers
Introduce tools/list and tools/call methods so components can expose arbitrary functionality to an agent.
This aligns UI components with the MCP model of declared capabilities.

Final consolidated TypeScript sketch

import type {
    ToolsListResult,
    CallToolResult,
} from '@modelcontextprotocol/sdk';

// ---------- UIResource ----------
interface UIResource {
    type: 'resource';
    resource: {
        uri: string; // ui://component/id
        mimeType: 'text/html' | 'application/vnd.mcp-ui.remote-dom' | 'text/uri-list';
        metadata?: { [key: string]: string };
        text?: string;
        blob?: string; // base64-encoded bundle; preferred
    };
}

// ---------- UI Message Envelope ----------
interface UIMessage<TType extends string, TPayload> {
    type: TType;
    payload: TPayload;
    messageId?: string; // required if a response is expected
}

type UIMessageResponse<T = unknown> = {
    messageId: string;
    response?: T;
    error?: string;
};

// ---------- Iframe to Parent ----------
type IframeToParent =
    | UIMessage<'tool',   { toolName: string; params: Record<string, unknown> }>
    | UIMessage<'intent', { intent: string;    params: Record<string, unknown> }>
    | UIMessage<'prompt', { prompt: string }>
    | UIMessage<'notify', { message: string }>
    | UIMessage<'link',   { url: string }>;

// ---------- Parent to Iframe (RPC) ----------
type QueryDomResult  = string;
type ClickResult     = { success: boolean };
type EnterTextResult = { success: boolean };

type ParentToIframe =
    | UIMessage<'queryDom',   {}>
    | UIMessage<'click',      { elementId: string }>
    | UIMessage<'enterText',  { elementId: string; text: string }>
    | UIMessage<'tools/list', { cursor?: string }>
    | UIMessage<'tools/call', { name: string; arguments?: Record<string, unknown> }>;

type ParentToIframeResponse =
    | UIMessageResponse<QueryDomResult>
    | UIMessageResponse<ClickResult>
    | UIMessageResponse<EnterTextResult>
    | UIMessageResponse<ToolsListResult>
    | UIMessageResponse<CallToolResult>;

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model Context Protocol

MCP UI — A Declarative UI + Action Protocol for Agents #522

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Model Context Protocol

MCP UI — A Declarative UI + Action Protocol for Agents #522

Uh oh!

biswapm Aug 5, 2025

Pre-submission Checklist

Your Idea

Scope

Replies: 2 comments

Uh oh!

tadasant Aug 6, 2025 Collaborator

Uh oh!

alexbkogan Sep 11, 2025

UI tool responses

Communication protocol between iframe and parent

Current format (iframe to parent)

Proposed extension: parent to iframe RPC

Motivation

1) Standardize the iframe message response format

2) Extend messaging protocol to go both ways

These primitives should be enough to enable any interaction you might want your agent to take. We believe there’s a benefit of taking this a step further:

3) Components as MCP servers

Putting it all together

What exists today

What we propose to add or change

Final consolidated TypeScript sketch

biswapm
Aug 5, 2025

tadasant
Aug 6, 2025
Collaborator

alexbkogan
Sep 11, 2025