merge

dsfaccini · dsfaccini · commit 32cc5c52a459 · 2025-12-03T19:48:26.000-05:00
diff --git a/clai/README.md b/clai/README.md
@@ -59,6 +59,8 @@ Launch a web-based chat interface for your agent:
 clai web --agent module:agent_variable
 ```
 
+![Web Chat UI](https://github.com/user-attachments/assets/8a1c90dc-f62b-4e35-9d66-59459b45790d)
+
 For example, if you have an agent defined in `my_agent.py`:
 
 ```python
@@ -78,9 +80,10 @@ This will start a web server (default: http://127.0.0.1:7932) with a chat interf
 ### Web Command Options
 
 - `--agent`, `-a`: Agent to serve in `module:variable` format
-- `--models`, `-m`: Comma-separated models to make available (e.g., `gpt-5,sonnet-4-5`)
-- `--tools`, `-t`: Comma-separated builtin tool IDs to enable (e.g., `web_search,code_execution`)
-- `--instructions`, `-i`: System instructions for generic agent (when `--agent` not specified)
+- `--model`, `-m`: Model to make available (repeatable, agent's model is default if present)
+- `--tool`, `-t`: [Builtin tool](https://ai.pydantic.dev/builtin-tools/) to enable (repeatable). See [available tools](https://ai.pydantic.dev/ui/web/#builtin-tool-support).
+- `--mcp`: Path to MCP server config JSON file
+- `--instructions`, `-i`: System instructions. In generic mode (no `--agent`), these are the agent instructions. With `--agent`, these are passed as extra instructions to each run.
 - `--host`: Host to bind the server to (default: 127.0.0.1)
 - `--port`: Port to bind the server to (default: 7932)
 
@@ -90,29 +93,21 @@ You can specify which models and builtin tools are available in the UI via CLI f
 
 ```bash
 # Generic agent with specific models and tools
-clai web -m gpt-5,sonnet-4-5 -t web_search,code_execution
+clai web -m openai:gpt-5 -m anthropic:claude-sonnet-4-5 -t web_search -t code_execution
 
 # Custom agent with additional models
-clai web --agent my_agent:my_agent -m gpt-5,gemini-2.5-pro
+clai web --agent my_agent:my_agent -m openai:gpt-5 -m google:gemini-2.5-pro
 
 # Generic agent with system instructions
-clai web -m gpt-5 -i 'You are a helpful coding assistant'
-```
-
-You can also launch the web UI directly from an `Agent` instance using `Agent.to_web()`:
-
-```python
-from pydantic_ai import Agent
-from pydantic_ai.builtin_tools import WebSearchTool
+clai web -m openai:gpt-5 -i 'You are a helpful coding assistant'
 
-agent = Agent('openai:gpt-5')
+# Custom agent with extra instructions for each run
+clai web --agent my_agent:my_agent -i 'Always respond in Spanish'
+```
 
-# Use defaults
-app = agent.to_web()
+When using `--agent`, the agent's configured model becomes the default. CLI models (`-m`) are additional options. Without `--agent`, the first `-m` model is the default.
 
-# Or customize models and tools
-app = agent.to_web(builtin_tools=[WebSearchTool()])
-```
+For full documentation, see [Web Chat UI](https://ai.pydantic.dev/ui/web/).
 
 ## Help
 
diff --git a/docs/cli.md b/docs/cli.md
@@ -72,7 +72,49 @@ clai web --agent my_agent:my_agent
 
 This will start a web server (default: http://127.0.0.1:7932) with a chat interface for your agent.
 
-For more details on web UI options, MCP server configuration, and programmatic usage with `Agent.to_web()`, see the [Web Chat UI documentation](ui/web.md).
+#### CLI Options
+
+```bash
+# With a custom agent
+clai web --agent my_module:my_agent
+
+# With specific models (first is default when no --agent)
+clai web -m openai:gpt-5 -m anthropic:claude-sonnet-4-5
+
+# With builtin tools
+clai web -m openai:gpt-5 -t web_search -t code_execution
+
+# Generic agent with system instructions
+clai web -m openai:gpt-5 -i 'You are a helpful coding assistant'
+
+# Custom agent with extra instructions for each run
+clai web --agent my_module:my_agent -i 'Always respond in Spanish'
+```
+
+| Option | Description |
+|--------|-------------|
+| `--agent`, `-a` | Agent to serve in `module:variable` format |
+| `--model`, `-m` | Model to make available (repeatable, agent's model is default if present) |
+| `--tool`, `-t` | [Builtin tool](builtin-tools.md) to enable (repeatable). See [available tools](ui/web.md#builtin-tool-support). |
+| `--mcp` | Path to MCP server config JSON file |
+| `--instructions`, `-i` | System instructions. In generic mode (no `--agent`), these are the agent instructions. With `--agent`, these are passed as extra instructions to each run. |
+| `--host` | Host to bind server (default: 127.0.0.1) |
+| `--port` | Port to bind server (default: 7932) |
+
+!!! note "Memory Tool"
+    The `memory` tool requires the agent to have memory configured and cannot be enabled via `-t memory` alone. An agent with memory must be provided via `--agent`.
+
+#### MCP Server Configuration
+
+You can enable [MCP (Model Context Protocol)](mcp/overview.md) servers using a JSON configuration file:
+
+```bash
+clai web --agent my_agent:my_agent --mcp mcp-servers.json
+```
+
+See [Loading MCP Servers from Configuration](mcp/client.md#loading-mcp-servers-from-configuration) for the full configuration format and environment variable syntax.
+
+For programmatic usage with `Agent.to_web()`, see the [Web UI documentation](ui/web.md).
 
 ### Help
 
diff --git a/docs/ui/web.md b/docs/ui/web.md
@@ -2,149 +2,115 @@
 
 Pydantic AI includes a built-in web chat interface that you can use to interact with your agents through a browser.
 
-https://github.com/user-attachments/assets/8a1c90dc-f62b-4e35-9d66-59459b45790d
+<video src="https://github.com/user-attachments/assets/8a1c90dc-f62b-4e35-9d66-59459b45790d" autoplay loop muted playsinline></video>
 
 ## Installation
 
-Install the `web` extra to get the required dependencies:
+Install the `web` extra (installs Starlette):
 
 ```bash
 pip/uv-add 'pydantic-ai-slim[web]'
 ```
 
-## Usage
-
-There are two ways to launch the web chat UI:
+For CLI usage with `clai web`, see the [CLI documentation](../cli.md#web-chat-ui).
 
-### 1. Using the CLI (`clai web`)
+## Usage
 
-The simplest way to start the web UI is using the `clai web` command:
+Create a web app from an agent instance using [`Agent.to_web()`][pydantic_ai.agent.Agent.to_web]:
 
-```bash
-# With a custom agent
-clai web --agent my_module:my_agent
+=== "Using Model Names"
 
-# With specific models
-clai web -m openai:gpt-5 -m anthropic:claude-sonnet-4-5
+    ```python
+    from pydantic_ai import Agent
+    from pydantic_ai.builtin_tools import WebSearchTool
 
-# With builtin tools
-clai web -m openai:gpt-5 -t web_search -t code_execution
+    model = 'openai:gpt-5'
+    agent = Agent(model)
 
-# Generic agent with system instructions
-clai web -m openai:gpt-5 -i 'You are a helpful coding assistant'
-```
+    @agent.tool_plain
+    def get_weather(city: str) -> str:
+        return f'The weather in {city} is sunny'
 
-#### CLI Options
+    # Create app with model names (their display names are auto-generated)
+    app = agent.to_web(
+        models=['openai:gpt-5', 'anthropic:claude-sonnet-4-5'],
+        builtin_tools=[WebSearchTool()],
+    )
 
-| Option | Description |
-|--------|-------------|
-| `--agent`, `-a` | Agent to serve in `module:variable` format |
-| `--model`, `-m` | Model to make available (repeatable) |
-| `--tool`, `-t` | Builtin tool to enable (repeatable) |
-| `--instructions`, `-i` | System instructions (when `--agent` not specified) |
-| `--host` | Host to bind server (default: 127.0.0.1) |
-| `--port` | Port to bind server (default: 7932) |
-| `--mcp` | Path to MCP server config JSON file |
+    # Or with custom display labels
+    app = agent.to_web(
+        models={'GPT 5': 'openai:gpt-5', 'Claude': 'anthropic:claude-sonnet-4-5'},
+        builtin_tools=[WebSearchTool()],
+    )
+    ```
 
-Model names without a provider prefix are automatically inferred:
+=== "Using Model Instances"
 
-- `gpt-*`, `o1`, `o3` → OpenAI
-- `claude-*`, `sonnet`, `opus`, `haiku` → Anthropic
-- `gemini-*` → Google
+    ```python
+    from pydantic_ai import Agent
+    from pydantic_ai.builtin_tools import WebSearchTool
+    from pydantic_ai.models.anthropic import AnthropicModel
+    from pydantic_ai.models.openai import OpenAIModel
 
-### 2. Using `Agent.to_web()` Programmatically
+    # Create separate models with their own custom configuration
+    anthropic_model = AnthropicModel('claude-sonnet-4-5')
+    openai_model = OpenAIModel('gpt-5', api_key='custom-key')
 
-For more control, you can create a web app from an agent instance:
+    agent = Agent(openai_model)
 
-```python
-from pydantic_ai import Agent
-from pydantic_ai.builtin_tools import WebSearchTool
+    @agent.tool_plain
+    def get_weather(city: str) -> str:
+        return f'The weather in {city} is sunny'
 
-agent = Agent('openai:gpt-5')
+    # Use the instances directly
+    app = agent.to_web(
+        models=[openai_model, anthropic_model],
+        builtin_tools=[WebSearchTool()],
+    )
 
-@agent.tool_plain
-def get_weather(city: str) -> str:
-    return f'The weather in {city} is sunny'
+    # Or mix instances and strings with custom labels
+    app = agent.to_web(
+        models={'Custom GPT': openai_model, 'Claude': 'anthropic:claude-sonnet-4-5'},
+        builtin_tools=[WebSearchTool()],
+    )
 
-# Create app with model names (their display names are auto-generated)
-app = agent.to_web(
-    models=['openai:gpt-5', 'anthropic:claude-sonnet-4-5'],
-    builtin_tools=[WebSearchTool()],
-)
-
-# Or with custom display labels
-app = agent.to_web(
-    models={'GPT 5': 'openai:gpt-5', 'Claude': 'anthropic:claude-sonnet-4-5'},
-    builtin_tools=[WebSearchTool()],
-)
-```
+    # With extra instructions passed to each run
+    app = agent.to_web(
+        models=[openai_model],
+        instructions='Always respond in a friendly tone.',
+    )
+    ```
 
 The returned Starlette app can be run with any ASGI server:
 
 ```bash
 uvicorn my_module:app --host 0.0.0.0 --port 8080
 ```
 
-!!! note "Reserved Routes"
-    The web UI app uses the following routes which should not be overwritten:
-
-    - `/` and `/{id}` - Serves the chat UI
-    - `/api/chat` - Chat endpoint (POST, OPTIONS)
-    - `/api/configure` - Frontend configuration (GET)
-    - `/api/health` - Health check (GET)
-
-    The app cannot currently be mounted at a subpath (e.g., `/chat`) because the UI expects these routes at the root. You can add additional routes to the app, but avoid conflicts with these reserved paths.
-
-## MCP Server Configuration
-
-You can enable [MCP (Model Context Protocol)](https://modelcontextprotocol.io/) servers using a JSON configuration file:
-
-```bash
-clai web --agent my_agent:my_agent --mcp mcp-servers.json
-```
+## Builtin Tool Support
 
-Example JSON configuration:
-
-```json
-{
-  "mcpServers": {
-    "deepwiki": {
-      "url": "https://mcp.deepwiki.com/mcp"
-    },
-    "github": {
-      "url": "https://api.githubcopilot.com/mcp",
-      "authorizationToken": "${GITHUB_TOKEN}"
-    }
-  }
-}
-```
+Builtin tool support is automatically determined from each model's profile. The UI will only show tools that the selected model supports.
 
-Environment variables can be referenced using `${VAR_NAME}` syntax, with optional defaults: `${VAR_NAME:-default_value}`.
+Available [builtin tools](../builtin-tools.md):
 
-Each server entry supports:
+- `web_search` - Web search capability ([`WebSearchTool`][pydantic_ai.builtin_tools.WebSearchTool])
+- `code_execution` - Code execution in a sandbox ([`CodeExecutionTool`][pydantic_ai.builtin_tools.CodeExecutionTool])
+- `image_generation` - Image generation ([`ImageGenerationTool`][pydantic_ai.builtin_tools.ImageGenerationTool])
+- `web_fetch` - Fetch content from URLs ([`WebFetchTool`][pydantic_ai.builtin_tools.WebFetchTool])
+- `memory` - Persistent memory across conversations ([`MemoryTool`][pydantic_ai.builtin_tools.MemoryTool])
 
-| Field | Description |
-|-------|-------------|
-| `url` (required) | The MCP server URL |
-| `authorizationToken` | Authorization token for the server |
-| `description` | Description shown in the UI |
-| `allowedTools` | List of allowed tool names |
-| `headers` | Additional HTTP headers |
+!!! note "Memory Tool Requirements"
+    The `memory` tool requires the agent to have memory configured via the
+    `memory` parameter when creating the agent.
 
-## Builtin Tool Support
 
-When using the new models API, builtin tool support is automatically determined from each model's profile. The UI will only show tools that the selected model supports.
+## Reserved Routes
 
-Available builtin tools:
+The web UI app uses the following routes which should not be overwritten:
 
-- `web_search` - Web search capability
-- `code_execution` - Code execution in a sandbox
-- `image_generation` - Image generation
-- `web_fetch` - Fetch content from URLs
-- `memory` - Persistent memory across conversations
+- `/` and `/{id}` - Serves the chat UI
+- `/api/chat` - Chat endpoint (POST, OPTIONS)
+- `/api/configure` - Frontend configuration (GET)
+- `/api/health` - Health check (GET)
 
-!!! note "Memory Tool Requirements"
-    The `memory` tool requires the agent to have memory configured via the
-    `memory` parameter when creating the agent. It cannot be enabled via
-    the CLI `-t memory` flag alone - an agent with memory must be provided
-    via `--agent`.
+The app cannot currently be mounted at a subpath (e.g., `/chat`) because the UI expects these routes at the root. You can add additional routes to the app, but avoid conflicts with these reserved paths.
diff --git a/mkdocs.yml b/mkdocs.yml
@@ -97,10 +97,12 @@ nav:
           - Overview: ui/overview.md
           - AG-UI: ui/ag-ui.md
           - Vercel AI: ui/vercel-ai.md
+          - Web Chat UI: ui/web.md
       - Agent2Agent (A2A): a2a.md
 
   - Related Packages:
-      - Clai: cli.md
+      - Clai:
+          - CLI: cli.md
 
   - Examples:
       - Setup: examples/setup.md