spacedriveapp
diff --git a/‎docs/content/docs/(configuration)/config.mdx‎
Lines changed: 8 additions & 2 deletions b/‎docs/content/docs/(configuration)/config.mdx‎
Lines changed: 8 additions & 2 deletions
diff --git a/‎docs/content/docs/(configuration)/meta.json‎
Lines changed: 1 addition & 1 deletion b/‎docs/content/docs/(configuration)/meta.json‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/content/docs/(configuration)/permissions.mdx‎
Lines changed: 2 additions & 2 deletions b/‎docs/content/docs/(configuration)/permissions.mdx‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/content/docs/(configuration)/sandbox.mdx‎
Lines changed: 221 additions & 0 deletions b/‎docs/content/docs/(configuration)/sandbox.mdx‎
Lines changed: 221 additions & 0 deletions
diff --git a/‎docs/content/docs/(features)/tools.mdx‎
Lines changed: 5 additions & 7 deletions b/‎docs/content/docs/(features)/tools.mdx‎
Lines changed: 5 additions & 7 deletions
@@ -536,12 +536,15 @@ Agent-specific routing is set via `[agents.routing]` with the same keys as `[def
 
 ### `[agents.sandbox]`
 
-OS-level filesystem containment for shell and exec tool subprocesses. Uses bubblewrap (Linux) or sandbox-exec (macOS) to enforce read-only access to everything outside the workspace.
+OS-level filesystem containment and environment sanitization for shell and exec tool subprocesses. Uses bubblewrap (Linux) or sandbox-exec (macOS) to enforce read-only access to everything outside the workspace. Environment sanitization runs in all modes -- workers never inherit the parent's environment variables.
+
+See [Sandbox](/docs/sandbox) for a full explanation of how containment, environment sanitization, leak detection, and durable binaries work.
 
 | Key | Type | Default | Description |
 |-----|------|---------|-------------|
-| `mode` | string | `"enabled"` | `"enabled"` for kernel-enforced containment, `"disabled"` for full host access |
+| `mode` | string | `"enabled"` | `"enabled"` for kernel-enforced containment, `"disabled"` for passthrough (env sanitization still applies) |
 | `writable_paths` | string[] | `[]` | Additional directories the agent can write to beyond its workspace |
+| `passthrough_env` | string[] | `[]` | Environment variable names to forward from the parent process to worker subprocesses |
 
 When `mode = "enabled"`, shell and exec commands run inside a mount namespace where the entire filesystem is read-only except:
 
@@ -552,12 +555,15 @@ When `mode = "enabled"`, shell and exec commands run inside a mount namespace wh
 
 The agent's data directory (databases, config) is explicitly re-mounted read-only even if it would otherwise be writable due to path overlap.
 
+Regardless of mode, all worker subprocesses start with a clean environment. Only `PATH` (with `tools/bin` prepended), safe variables (`HOME`, `USER`, `LANG`, `TERM`, `TMPDIR`), and any `passthrough_env` entries are injected. Use `passthrough_env` with least privilege: only forward variables required by worker tools (for example specific credentials set in Docker Compose or systemd), and avoid forwarding broad or highly sensitive credentials.
+
 If the sandbox backend isn't available (e.g. bubblewrap not installed), processes run unsandboxed with a warning at startup.
 
 ```toml
 [agents.sandbox]
 mode = "enabled"
 writable_paths = ["/home/user/projects/myapp", "/var/data/shared"]
+passthrough_env = ["GH_TOKEN", "GITHUB_TOKEN"]
 ```
 
 ### `[[agents.cron]]`
 
@@ -1,4 +1,4 @@
 {
   "title": "Configuration",
-  "pages": ["config", "permissions"]
+  "pages": ["config", "sandbox", "permissions"]
 }
@@ -9,7 +9,7 @@ Per-agent permission system that controls what tools can do, enforces inter-agen
 
 ## Design Principles
 
-**The container is the OS-level sandbox.** On spacebot.sh, each user runs in an isolated container. Self-hosters who need OS-level isolation can run Spacebot in Docker themselves. The permissions system does not attempt to replicate container-level security — it handles inter-agent boundaries and tool-level restrictions within a single Spacebot process.
+**OS-level containment is handled by the [Sandbox](/docs/sandbox) when sandboxing is enabled.** The sandbox enforces filesystem boundaries and environment sanitization at the kernel level, with exact guarantees depending on mode and backend/platform. On spacebot.sh, each user also runs in an isolated container. The permissions system handles a different layer -- inter-agent boundaries and tool-level restrictions within a single Spacebot process.
 
 **Deny is an error, not invisible.** When a tool call is denied, the tool still appears in the LLM's tool list, but returns a structured error explaining the restriction. The LLM can reason about the denial and adapt. This is better than hiding tools (which causes the LLM to attempt workarounds) and better than silent failures (which cause confusion).
 
@@ -285,7 +285,7 @@ Default when no `[permissions]` block exists: all tools return permission denied
 
 ## What This Does NOT Do
 
-**OS-level sandboxing.** No Docker containers, no seccomp profiles, no capability dropping. Provisioned instances (spacebot.sh) or the user's own Docker setup handles this. Spacebot's permissions system is application-level.
+**OS-level sandboxing.** The permissions system is application-level -- it controls which tools the LLM can use and what paths it's allowed to access. OS-level containment is handled by the [Sandbox](/docs/sandbox), which operates independently when sandboxing is enabled; exact primitives vary by mode and backend/platform. Provisioned instances (spacebot.sh) also run in isolated containers. The permissions system and sandbox are complementary layers.
 
 **Shell command parsing.** We don't parse shell pipelines or analyze command strings for dangerous patterns. For `shell = "workspace"`, the `working_dir` is confined but the command itself runs unrestricted within that directory. Full command analysis is fragile and has diminishing returns — if you need that level of restriction, use `shell = "deny"`.
 
 
@@ -0,0 +1,221 @@
+---
+title: Sandbox
+description: OS-level filesystem containment and environment sanitization for worker subprocesses.
+---
+
+# Sandbox
+
+OS-level containment for builtin worker subprocesses (`shell` and `exec`). Prevents workers from modifying the host filesystem, reading inherited environment secrets, and accessing the agent's internal data directory.
+
+## How It Works
+
+When a worker runs a shell or exec command, the sandbox wraps the subprocess in an OS-level containment layer before execution. The worker's command runs normally -- it can only read a minimal runtime allowlist plus workspace paths, and can only write to explicitly allowed paths.
+
+```
+Worker calls shell("npm test")
+  → Sandbox.wrap() builds a contained command
+  → Subprocess runs with:
+      - Read access to a minimal system allowlist + workspace
+      - Writable access only to the workspace + configured writable paths + /tmp
+      - Clean environment (no inherited secrets)
+      - HOME set to workspace, TMPDIR set to /tmp
+      - tools/bin prepended to PATH
+  → stdout/stderr captured and returned to worker
+```
+
+Two things happen regardless of whether the sandbox is enabled or disabled:
+
+1. **Environment sanitization** -- worker subprocesses never inherit the parent's environment variables. Secrets like `ANTHROPIC_API_KEY` are never visible to workers.
+2. **PATH injection** -- the persistent `tools/bin` directory is prepended to PATH so durably installed binaries are always available.
+
+## Backends
+
+The sandbox auto-detects the best available backend at startup:
+
+| Platform | Backend | Mechanism |
+|----------|---------|-----------|
+| Linux | [bubblewrap](https://github.com/containers/bubblewrap) | Mount namespaces, PID namespaces, environment isolation |
+| macOS | sandbox-exec | SBPL profile with deny-default policy |
+| Other / not available | Passthrough | No filesystem containment (env sanitization still applies) |
+
+If the sandbox is enabled but no backend is available, processes run unsandboxed with a warning at startup. Environment sanitization still applies in all cases.
+
+### Linux (bubblewrap)
+
+The default on all hosted instances and most self-hosted Linux deployments. Bubblewrap creates a mount namespace where:
+
+- A minimal host runtime allowlist is mounted **read-only** (`/bin`, `/sbin`, `/usr`, `/lib`, `/lib64`, `/etc`, `/opt`, `/run`, `/nix` when present)
+- The persistent tools directory is mounted **read-only** (if present)
+- The workspace directory is mounted **read-write**
+- `writable_paths` entries are mounted **read-write**
+- `/tmp` is a private tmpfs per invocation
+- `/dev` has standard device nodes
+- `/proc` is a fresh procfs (when supported by the environment)
+- The agent's data directory is masked with an empty tmpfs (no reads/writes)
+- PID namespace isolation prevents the subprocess from seeing other processes
+- `--die-with-parent` ensures the subprocess is killed if the parent exits
+
+Nested containers (Docker-in-Docker, Fly Machines) may not support `--proc /proc`. The sandbox probes for this at startup and falls back gracefully -- `proc_supported: false` in the startup log means `/proc` inside the sandbox shows the host's process list rather than an isolated view.
+
+### macOS (sandbox-exec)
+
+Uses Apple's sandbox-exec with a generated SBPL (Sandbox Profile Language) profile. The profile starts with `(deny default)` and explicitly allows:
+
+- Process execution and forking
+- Reading only a backend allowlist (system runtime roots + workspace + configured writable paths + tools/bin)
+- Writing only to the workspace, configured writable paths, and `/tmp`
+- Network access (unrestricted)
+- Standard device and IPC operations
+
+The agent's data directory is denied for both reads and writes even if it falls under the workspace subtree.
+
+Note: `sandbox-exec` is deprecated by Apple but remains functional. It's the only user-space sandbox option on macOS without requiring a full VM.
+
+## Filesystem Boundaries
+
+When the sandbox is enabled, the subprocess sees:
+
+| Path | Access | Notes |
+|------|--------|-------|
+| System runtime allowlist | Read-only | Backend-specific system roots required to execute common tools |
+| Agent workspace | Read-write | Where the worker does its job |
+| `writable_paths` entries | Read-write | User-configured additional paths |
+| `{instance_dir}/tools/bin` | Read-only | Persistent binaries on PATH |
+| `/tmp` | Read-write | Private per invocation (bubblewrap) |
+| `/dev` | Read-write | Standard device nodes |
+| Agent data directory | **No access** | Masked/denied to protect databases and config |
+
+The data directory protection is important: even if the data directory overlaps with workspace-related paths, it's explicitly blocked. Workers can't read or modify databases, config files, or identity files at the kernel level.
+
+## Environment Sanitization
+
+Worker subprocesses start with a **clean environment**. The parent process's environment variables are never inherited. This applies in all sandbox modes -- even when the sandbox is disabled, `env_clear()` strips the environment.
+
+A worker running `printenv` sees only:
+
+| Variable | Source | Value |
+|----------|--------|-------|
+| `PATH` | Always | `{instance_dir}/tools/bin:{system_path}` |
+| `HOME` | Always | Worker workspace path |
+| `TMPDIR` | Always | `/tmp` |
+| `USER` | Always | From parent (if set) |
+| `LANG` | Always | From parent (if set) |
+| `TERM` | Always | From parent (if set) |
+| `passthrough_env` entries | Config | User-configured forwarding |
+
+Workers never see `ANTHROPIC_API_KEY`, `DISCORD_BOT_TOKEN`, `SPACEBOT_*` internal vars, or any other environment variables from the parent process.
+
+### passthrough_env
+
+Self-hosted users who set credentials as environment variables in Docker Compose or systemd can forward specific variables to worker subprocesses:
+
+```toml
+[agents.sandbox]
+passthrough_env = ["GH_TOKEN", "GITHUB_TOKEN", "NPM_TOKEN"]
+```
+
+Each listed variable is read from the parent process environment at subprocess spawn time and injected into the worker's environment. Variables not in the list are stripped.
+
+When the secret store is available, `passthrough_env` is redundant -- credentials should be stored in the secret store, which injects tool secrets automatically. The field is additive and continues to work alongside the store.
+
+## Durable Binaries
+
+On hosted instances, the root filesystem is ephemeral -- machine image rollouts replace it. Binaries installed via `apt-get install` or similar disappear on the next deploy.
+
+The `{instance_dir}/tools/bin` directory is on the persistent volume and is prepended to `PATH` for all worker subprocesses. Binaries placed here survive restarts and rollouts.
+
+Workers are instructed about this in their system prompt:
+
+```
+Persistent binary directory: /data/tools/bin (on PATH, survives restarts and rollouts)
+Binaries installed via package managers (apt, brew, etc.) land on the root filesystem
+which is ephemeral on hosted instances -- they disappear on rollouts. To install a tool
+durably, download or copy the binary into /data/tools/bin.
+```
+
+The `GET /agents/tools` API endpoint lists installed binaries for dashboard observability:
+
+```json
+{
+  "tools_bin": "/data/tools/bin",
+  "binaries": [
+    { "name": "gh", "size": 1234567, "modified": "2026-02-20T14:15:00Z" },
+    { "name": "ripgrep", "size": 3456789, "modified": "2026-02-15T10:30:00Z" }
+  ]
+}
+```
+
+## Leak Detection
+
+All tool output (shell, exec, file, browser) is scanned for known secret patterns before being returned to the LLM. This runs in the `SpacebotHook` after every tool execution.
+
+Detected patterns include:
+
+- OpenAI keys (`sk-...`)
+- Anthropic keys (`sk-ant-...`)
+- GitHub tokens (`ghp_...`)
+- Google API keys (`AIza...`)
+- Discord bot tokens
+- Slack tokens (`xoxb-...`, `xapp-...`)
+- Telegram bot tokens
+- PEM private keys
+- Base64-encoded, URL-encoded, and hex-encoded variants of the above
+
+Detection also covers encoded forms -- secrets wrapped in base64, URL encoding, or hex are decoded and checked against the same patterns.
+
+If a leak is detected, the process is terminated immediately with an error. The raw leaked value is never logged or returned to the LLM -- only the detection event, encoding type, and a truncated non-reversible fingerprint are recorded for debugging.
+
+### OpenCode Workers
+
+OpenCode workers (external coding agent processes) are covered by the same protection. SSE output events are scanned through both:
+
+1. **Output scrubbing** (exact-match redaction of known secret values) -- runs first
+2. **Leak detection** (regex pattern matching for unknown secrets) -- runs second
+
+The ordering ensures that stored tool secrets are redacted before leak detection runs, so expected secret values in worker output don't trigger false-positive kills.
+
+## Dynamic Mode Switching
+
+Sandbox mode can be changed at runtime via the API or dashboard without restarting the agent. The `Sandbox` struct reads the current mode from a shared `ArcSwap<SandboxConfig>` on every `wrap()` call.
+
+```
+PUT /agents/config
+{
+  "sandbox": { "mode": "disabled" }
+}
+```
+
+Backend detection runs at startup regardless of the initial mode. If the sandbox starts disabled and is later enabled via the API, bubblewrap/sandbox-exec is already detected and ready to use.
+
+## Configuration
+
+```toml
+[agents.sandbox]
+mode = "enabled"                              # "enabled" | "disabled"
+writable_paths = ["/home/user/shared-data"]   # additional writable directories
+passthrough_env = ["GH_TOKEN"]                # env vars to forward to workers
+```
+
+| Key | Type | Default | Description |
+|-----|------|---------|-------------|
+| `mode` | string | `"enabled"` | `"enabled"` for OS-level containment, `"disabled"` for passthrough |
+| `writable_paths` | string[] | `[]` | Additional directories workers can write to beyond the workspace |
+| `passthrough_env` | string[] | `[]` | Environment variable names to forward from the parent process |
+
+See [Configuration](/docs/config#agentssandbox) for the full config reference.
+
+## Protection Layers
+
+The sandbox is one layer in a defense-in-depth model:
+
+| Layer | What It Does | Scope |
+|-------|-------------|-------|
+| **Sandbox (filesystem)** | Read allowlist + writable workspace/writable_paths/tmp; blocks agent data dir | Shell, exec subprocesses |
+| **Env sanitization** | Clean environment, no inherited secrets | All subprocesses (including passthrough mode) |
+| **File tool workspace guard** | Path validation against workspace boundary | File tool only (in-process) |
+| **Exec env var blocklist** | Blocks `LD_PRELOAD`, `DYLD_INSERT_LIBRARIES`, etc. | Exec tool |
+| **Leak detection** | Regex scan of all tool output for secret patterns | All tools via SpacebotHook |
+| **Output scrubbing** | Exact-match redaction of known secret values | Worker output, status updates, OpenCode events |
+| **Permissions system** | Application-level tool access control | All tools |
+
+The sandbox and permissions system are complementary. The [permissions system](/docs/permissions) controls which tools an agent can use and what paths the LLM is allowed to access at the application level. The sandbox enforces filesystem boundaries at the kernel level for subprocesses that are allowed to run.
@@ -175,19 +175,17 @@ async fn call(&self, args: Self::Args) -> Result<Self::Output, Self::Error> {
 
 ### Sandbox containment
 
-Shell and exec commands run inside an OS-level sandbox (bubblewrap on Linux, sandbox-exec on macOS). The entire filesystem is mounted read-only except the workspace, `/tmp`, and any configured `writable_paths`. The agent's data directory (databases, config files) is explicitly protected. See [Configuration](/docs/config#agentssandbox) for sandbox config options.
+Shell and exec commands run inside an OS-level sandbox (bubblewrap on Linux, sandbox-exec on macOS). The entire filesystem is mounted read-only except the workspace, `/tmp`, and any configured `writable_paths`. The agent's data directory (databases, config files) is explicitly protected.
+
+Worker subprocesses also start with a clean environment -- they never inherit the parent's environment variables. System secrets (LLM API keys, messaging tokens) are never visible to workers regardless of sandbox mode. See [Sandbox](/docs/sandbox) for full details.
 
 The `file` tool independently validates paths against the workspace boundary and rejects writes to identity files (`SOUL.md`, `IDENTITY.md`, `USER.md`). The `exec` tool blocks dangerous environment variables (`LD_PRELOAD`, `DYLD_INSERT_LIBRARIES`, etc.) that enable library injection regardless of sandbox state.
 
-Leak detection (via `SpacebotHook`) scans all tool output for secret patterns (API keys, tokens, PEM keys) and terminates the process if a leak is found.
+Leak detection (via `SpacebotHook`) scans all tool output for secret patterns (API keys, tokens, PEM keys) and terminates the process if a leak is found. This includes base64-encoded, URL-encoded, and hex-encoded variants.
 
 ### Status reporting
 
-Workers report progress via `set_status`. The channel sees these in its status block. Status updates use `try_send` (non-blocking) so a slow event bus never blocks tool execution.
-
-### Fire-and-forget sends
-
-`set_status` uses `try_send` instead of `.await` on the event channel. If the channel is full, the update is dropped rather than blocking the worker.
+Workers report progress via `set_status`, and the channel sees those updates in its status block. `set_status` uses `try_send` (non-blocking), so if the event channel is full the update is dropped instead of blocking the worker.
 
 ## What Each Tool Does
Original file line number	Diff line number	Diff line change
`@@ -1,4 +1,4 @@`
`1`	`1`	`{`
`2`	`2`	`"title": "Configuration",`
`3`		`- "pages": ["config", "permissions"]`
	`3`	`+ "pages": ["config", "sandbox", "permissions"]`
`4`	`4`	`}`