Backlog Priority

Prioritized by impact order: security and production risk → core platform capabilities → developer experience and ecosystem.

Last updated: 2026-06-01 (P0 acceptance tests complete, strategic P2 research initiated, monitoring automation scripts created, first monitoring cycle executed)

Implemented

Items below were deferred at baseline and have since been implemented in-repo.

Item	Sprint	Files
Key ring CLI support (`--oauth-key-ring-file`, `--oauth-active-kid`, fail-closed validation)	P0-r1	`cli/_mcp_parsers.py`, `cli/_handlers/_mcp.py`
External state checkpoint store (`InMemoryCheckpointStore`, `SQLiteCheckpointStore`, `--checkpoint-store` CLI)	P0-r1	`teaagent/checkpoint.py`, `runner/_core.py`, `chat_agent.py`
DPoP replay TTL configurable (`dpop_replay_ttl` + `--oauth-dpop-replay-ttl` CLI)	P0-r1	`oauth21/_server.py`, `cli/_mcp_parsers.py`
OAuth key rotation overlap window (`OAuthKeyRing.rotate`, `key_for_validation`, `--oauth-rotation-window`)	P0-r2	`oauth21/_store.py`, `oauth21/_server.py`, `oauth21/_resource.py`
Code Mode sandbox profile matrix (`SandboxProfile` enum, `default_sandbox`, `validate_runtime_support`)	P0-r2	`code_mode/_types.py`, `code_mode/__init__.py`
LLM-as-Judge scoring (`JudgeScore`, `run_eval_with_judge`, `make_llm_judge_fn`)	P1-r1	`teaagent/eval.py`
AgentCard + InMemoryAgentRegistry + `agent card` CLI	P1-r1	`teaagent/agentcard.py`, `cli/_agent_parsers.py`
Managed runtime interface (`ManagedRuntimeAdapter` Protocol, `ManagedAgentRunner`, provider stubs)	P1-r2	`teaagent/managed_runtime.py`
LATENCY conformance tier (p50/p95 sampling, threshold check, `latency_samples`/`latency_threshold_ms` params)	P1-r2	`llm_conformance/_types.py`, `llm_conformance/_runner.py`
SQLiteAgentRegistry + A2ADispatcher (persistent A2A registry, in-process routing)	P1-r2	`teaagent/agentcard.py`
Extended conformance tiers: `STREAMING`, `STRUCTURED_OUTPUT`	P2-r1	`llm_conformance/_types.py`, `llm_conformance/_runner.py`
OpenAPI 3.1 schema auto-generation from `ToolRegistry` + `workspace openapi` CLI	P2-r1	`teaagent/openapi.py`, `cli/_misc_parsers.py`
Web audit viewer (`AuditViewerServer`, HTML/JSON routes, `audit serve` CLI)	P2-r2	`teaagent/audit_viewer.py`, `cli/_handlers/_audit.py`
Schema migration framework (`SchemaMigration`, `SQLiteMigrationStore`, `MigrationRunner`, `doctor migration` CLI)	P2-r2	`teaagent/schema_migration.py`, `cli/_handlers/_doctor.py`
Code Mode kernel sandbox hardening (`seccomp_profile`, `apparmor_profile`, `selinux_label`, `oci_runtime` on `ContainerCodeModeBackend`; `IsolateCodeModeBackend` gVisor wrapper; `sandbox_profile_selected`/`sandbox_violation` audit events; `profile`+`audit_logger` params on `execute_code_mode`)	P0-r3	`code_mode/_types.py`, `code_mode/_container.py`, `code_mode/_isolate.py`, `code_mode/__init__.py`
Cross-host OAuthStore persistence (`PostgreSQLOAuthStore` with `DELETE…RETURNING` atomic consume; `RedisOAuthStore` with Lua-script atomic consume, NX nonce/code saves, configurable key prefix)	P0-r3	`oauth21/_pg_store.py`, `oauth21/_redis_store.py`, `oauth21/__init__.py`
Managed runtime audit events (`managed_task_started/completed/failed` on `ManagedAgentRunner.run`; `audit_logger`+`run_id` params); tool-context forwarding for Anthropic and OpenAI runtimes	P1-r3	`teaagent/managed_runtime.py`
Extended conformance tiers: `TOOL_CALLING` (invokes `get_current_time` tool, checks `tool_calls`); `SAFETY` (API-level block + text refusal taxonomy); `LLMToolDefinition`, `LLMToolCall`, `SafetyCategory`, `LLMSafetyBlock` types; tool wiring in all three adapters	P1-r3	`llm/_types.py`, `llm/_adapters.py`, `llm/__init__.py`, `llm_conformance/_types.py`, `llm_conformance/_runner.py`
A2A HTTP discovery + wire protocol: `A2ADiscoveryServer` (serves `/.well-known/agent.json`, handles POST `/a2a/task`); `A2AClient` (`fetch_card`, `delegate`); `FederatedAgentRegistry` (TTL-cached pulls from remote endpoints)	P1-r3	`teaagent/agentcard.py`
GraphQLite production deployment (`GraphQLitePersistentStore`, `GraphQLiteProductionConfig`, index strategy, migration integration, `graphqlite migrate` CLI, production deployment guide)	P2-r3	`teaagent/graphqlite_production.py`, `docs/graphqlite-production.md`, `cli/_handlers/_misc.py`, `cli/_misc_parsers.py`
Hosted doc site infrastructure (`pdoc` dependency, `scripts/build_docs.py` build script, class-level docstrings on core modules)	P2-r3	`pyproject.toml`, `scripts/build_docs.py`, `teaagent/tools.py`, `teaagent/runner/_core.py`, `teaagent/budget.py`, `teaagent/policy.py`, `teaagent/memory.py`
ANP bidirectional adapter governed federation (`ANPGovernedService`, audit correlation, approval/budget invariants)	P1	`teaagent/anp_adapter.py`, `tests/acceptance/test_anp_adapter_flow.py`, `docs/adr/0007-anp-adapter-boundary.md`
Daily-use ergonomics (init, providerless CLI, recipes, sessions, background/attach, model capabilities, KPI)	P1	`teaagent/ergonomics/`, `teaagent/recipes/`, `scripts/measure_time_to_first_run.py`, `examples/ergonomics/`
mtime read-before-write concurrent modification guard (`workspace_write_file` rejects overwrites on mtime mismatch; `workspace_read_file` returns `mtime`; backward compatible)	P0	`teaagent/workspace_tools/_files.py`, `tests/acceptance/test_mtime_read_before_write_flow.py`
Protected paths default deny rules (`.git/` and `.teaagent/` blocked by default in `FilePolicy`; `load_file_policy(include_protected_dirs=)` gating)	P0	`teaagent/file_policy.py`, `tests/acceptance/test_protected_paths_flow.py`
LSP code analysis acceptance (7 tests covering tool registration, tree-sitter relations, candidate path extraction, config enablement)	P0	`tests/acceptance/test_code_analysis_lsp_flow.py`
Declarative sub-agent definitions with Markdown frontmatter (`.md` file support matching Claude Code `.claude/agents/*.md` convention; `isolation`/`background`/`disallowed_tools`/`effort` fields on `SubagentDef`)	P1	`teaagent/subagents/_types.py`, `teaagent/subagents/_loader.py`, `tests/acceptance/test_subagent_definitions_flow.py`
Context compaction latency SLO (traffic-light zoning boundary tests; compaction preserves recent observations; latency < 100ms SLO)	P1	`teaagent/context.py`, `tests/acceptance/test_context_compaction_slo_flow.py`
Hook lifecycle acceptance elevation (16 tests: PreToolUse veto, PostToolUse chaining, permission_check_hook deny/allow/patterns, all 8 Claude Code events, registry enabled flag)	P1	`tests/acceptance/test_hook_lifecycle_flow.py`
Architecture comparison matrix documenting TeaAgent vs Claude Code/Codex/OpenCode feature coverage	P2	`docs/architecture.md`
Use-case matrix updated with LSP, sub-agent orchestration, mtime guard, and protected paths entries	P2	`docs/use-cases.md`, `docs/use-case-matrix.md`
5-Loop Governance Hardening (Tool, Coding, Audit, Memory, Swarm)	P0	`teaagent/tools.py`, `teaagent/governance/tool_lint.py`, `teaagent/selftest.py`, `teaagent/plan_mode.py`, `teaagent/workflow_engine.py`, `teaagent/audit.py`, `teaagent/memory/failure_card.py`, `teaagent/swarm.py`, `teaagent/tournament/`, `docs/architecture.md`

Open — High (P0)

Note: All P0 items below have been implemented with acceptance tests. See docs/specs/ for technical specifications.

Issue-to-Plan Intake

Status: ✅ Implemented Journey: User pastes an issue, gets ambiguity score, plan artifact, safe command, and acceptance checklist. Acceptance test: test_issue_to_plan_acceptance_flow.py Dependencies: Plan mode, issue parsing, ambiguity detection, plan generation Size: Large (2-3 weeks) Reference: docs/plans/agent-ecosystem-acceptance-roadmap-2026-05-31.md Files: teaagent/issue_intake.py, tests/test_issue_intake.py, docs/specs/issue-to-plan-intake-spec-2026-05-31.md

Plan Review and Revision

Status: ✅ Implemented Journey: User can compare two plan revisions before execution and bind run to the accepted plan hash. Acceptance test: test_plan_review_revision_flow.py Dependencies: Plan versioning, plan diff, run-to-plan binding, hash verification Size: Large (2-3 weeks) Reference: docs/plans/agent-ecosystem-acceptance-roadmap-2026-05-31.md Files: teaagent/plan_storage.py, tests/test_plan_storage.py, docs/specs/plan-review-revision-spec-2026-05-31.md

Guided Recovery

Status: ✅ Implemented Journey: Failed/partial run suggests resume, undo, inspect audit, or retry with safer mode. Acceptance test: test_guided_recovery_flow.py Dependencies: Failure classification, recovery strategy selection, undo/resume integration Size: Medium (1-2 weeks) Reference: docs/plans/agent-ecosystem-acceptance-roadmap-2026-05-31.md Files: teaagent/guided_recovery.py, tests/test_guided_recovery.py, tests/acceptance/test_guided_recovery_flow.py, docs/specs/guided-recovery-spec-2026-05-31.md

Execution Evidence Summary

Status: ✅ Implemented Journey: Completed run emits changed files, commands, tests, approvals, costs, failures, and rollback path. Acceptance test: test_run_evidence_summary_flow.py Dependencies: Audit event extraction, evidence bundle serialization, sensitive value redaction Size: Medium (1-2 weeks) Reference: docs/plans/agent-ecosystem-acceptance-roadmap-2026-05-31.md Files: teaagent/run_evidence.py, tests/test_run_evidence.py, tests/acceptance/test_run_evidence_summary_flow.py, docs/specs/run-evidence-bundle-spec-2026-06-01.md

Daily Cockpit Parity

Status: ✅ Implemented Journey: CLI, TUI, and dashboard expose the same core run, approval, cost, and warning state. Acceptance test: test_daily_cockpit_parity_flow.py Dependencies: Cockpit state model, serialization, run store integration Size: Medium (1-2 weeks) Reference: docs/plans/agent-ecosystem-acceptance-roadmap-2026-05-31.md Files: teaagent/cockpit.py, tests/test_cockpit.py, tests/acceptance/test_daily_cockpit_parity_flow.py

Open — High (P1)

Note: Items extracted from strategic reference documents (competitive positioning, UX improvements, future roadmap).

CP-1 — README Rewrite for Governance-First Narrative [P1]

Status: ✅ Complete Journey: README already leads with governance-first positioning (permission gates, audit trail, undo, cost cap, model-agnostic) Acceptance criteria: README answers all three persona questions in first screen, governance table in top half, time to comprehension < 30 seconds Size: Small (3 days) Reference: docs/plans/competitive-positioning-plan-2026-05-31.md

CP-2 — Security Whitepaper for Enterprise Evaluation [P1]

Status: ✅ Complete Journey: Security whitepaper exists documenting governance model, control catalog, NIST mapping, data handling, deployment isolation, incident response Acceptance criteria: Document exists and reviewed by security background, every control has traceable code path, honest "known limitations" section Size: Medium (1 week) Reference: docs/plans/competitive-positioning-plan-2026-05-31.md

CP-3 — Aider-Style Commit-Per-Change Visibility [P1]

Status: ✅ Complete Journey: Surface git commit-per-change workflow prominently (stage in branch, review diff, commit/discard, structured commit messages with run metadata) Acceptance criteria: teaagent show --diff works without git knowledge, commit message includes run metadata, teaagent commit is idempotent Size: Medium (1 week) Reference: docs/plans/competitive-positioning-plan-2026-05-31.md

UX1.1 — Post-Run Summary [P1]

Status: ✅ Complete Journey: Emit structured summary at end of every run (tools called, files changed, cost, budget remaining, audit log location, undo command) Acceptance criteria: Run summary always emitted at session end, --no-summary flag suppresses it, summary includes files changed/cost/undo command Size: Small (3 days) Reference: docs/plans/ux-improvement-roadmap-2026-05-31.md

UX1.2 — Proactive Budget Warnings [P1]

Status: ✅ Complete Journey: Emit budget warnings at 50%, 80%, 90% (with prompt), 100% (offer read-only mode) Acceptance criteria: Tests trigger at 50%/80%/90%/100%, TUI shows persistent budget status line, warning in post-run summary Size: Small (3 days) Reference: docs/plans/ux-improvement-roadmap-2026-05-31.md

UX1.3 — One-Command Undo with Diff Preview [P1]

Status: ✅ Complete Journey: Add --preview flag to undo command, add --last shortcut, show unified diff of what will be reverted Acceptance criteria: teaagent undo --last --preview shows readable diff without executing, teaagent undo --last reverts all writes, post-run summary includes undo command Size: Small (3 days) Reference: docs/plans/ux-improvement-roadmap-2026-05-31.md

UX3.1 — teaagent init < 2 Minutes to First Useful Output [P1]

Status: ✅ Complete Journey: Implement guided init flow (provider selection, permission mode, config write) that completes in < 2 minutes Acceptance criteria: Init completes in < 2 minutes with guided prompts, after init teaagent "hello" produces useful response, init is documented first step in README Size: Medium (1 week) Reference: docs/plans/ux-improvement-roadmap-2026-05-31.md

UX3.2 — First-Run Orientation Message [P1]

Status: ✅ Complete Journey: On first run (no .teaagent/), show orientation message explaining governance features (approval gates, audit log, undo, budget cap) Acceptance criteria: Orientation message shown on first run, explains key governance features, includes next steps Size: Small (2 days) Reference: docs/plans/ux-improvement-roadmap-2026-05-31.md

UX2.1 — Persistent Decision Log [P1]

Status: ✅ Complete Journey: Implement DecisionLog stored in .teaagent/decisions.md, inject into system prompt, CLI commands to list/add decisions Acceptance criteria: teaagent memory decisions list shows all decisions, teaagent memory decisions add appends entry, system prompt includes 10 most recent decisions Size: Medium (1 week) Reference: docs/plans/ux-improvement-roadmap-2026-05-31.md

UX2.2 — Proactive Context Compaction Warning [P1]

Status: ✅ Complete Journey: Track context window usage, warn at 60% with /compact suggestion, implement one-command summary-and-continue Acceptance criteria: Warning triggers at configurable threshold (default 60%), /compact summarizes conversation and replaces with compressed version Size: Medium (1 week) Reference: docs/plans/ux-improvement-roadmap-2026-05-31.md

UX2.3 — Cross-Session Scratchpad [P1]

Status: ✅ Complete Journey: Write structured scratchpad on session end (goal, progress, open questions, next step), offer resumption on next session start Acceptance criteria: Scratchpad written on clean exit/ctrl+c/crash, offered for resumption on next session start, teaagent sessions list shows recent sessions Size: Medium (1 week) Reference: docs/plans/ux-improvement-roadmap-2026-05-31.md

Completed — High (P0) - Security & Reliability

Note: All P0 security and reliability items from comprehensive audit have been completed.

S1 — Fix AuditLevel.L3 "encrypted at rest" documentation fraud [P0]

Status: ✅ Complete Journey: Removed incorrect "encrypted at rest" claim from audit.py docstring, updated threat-model.md with L3 plaintext row Acceptance criteria: grep returns no "encrypted at rest" in audit.py, threat-model.md has L3 plaintext row, no other file claims L3 encrypts Size: Small (1 day) Reference: docs/plans/comprehensive-plan-all-aspects-2026-05-31.md

S2 — Gate ChildProcessCodeModeBackend as trusted-user only [P0]

Status: ✅ Complete Journey: Added docstring warning, added trusted_only field that raises ValueError when False, updated security-spec.md with trust boundary table Acceptance criteria: ChildProcessCodeModeBackend(trusted_only=False) raises ValueError, docstring change auditable via git log Size: Small (1 day) Reference: docs/plans/comprehensive-plan-all-aspects-2026-05-31.md

R1 — Fix fragile async loop management in approval paths [P0]

Status: ✅ Complete Journey: Replaced new_loop pattern in approval_manager.py and policy.py with proper async loop handling to prevent "Event loop is closed" errors Acceptance criteria: No asyncio.set_event_loop(new_loop) pattern in approval paths, tests verify no closed loop errors Size: Small (2 days) Reference: docs/plans/comprehensive-plan-all-aspects-2026-05-31.md

S-H3 — workspace_run_shell shell=True → argv-only path [P0]

Status: ✅ Complete Journey: Converted shell=True to argv-only path in workspace_run_shell for security Acceptance criteria: shell=True removed, argv-only path used, tests pass Size: Small (2 days) Reference: docs/plans/remediation-roadmap.md

S-H4 — Shell normalization adversarial matrix gaps [P0]

Status: ✅ Complete Journey: Completed ShellObfuscationTests coverage for adversarial matrix Acceptance criteria: ShellObfuscationTests pass, adversarial matrix complete Size: Medium (3 days) Reference: docs/plans/remediation-roadmap.md

S-H5 — MCP loopback no-auth default [P0]

Status: ✅ Complete Journey: Required token when TEAAGENT_STRICT_LOCAL=1 for MCP loopback Acceptance criteria: MCP loopback requires token when strict local mode enabled, tests pass Size: Small (2 days) Reference: docs/plans/remediation-roadmap.md

S-H6 — Vote relay loopback without auth_policy [P0]

Status: ✅ Complete Journey: Auto-load relay-tokens.json or env for vote relay auth Acceptance criteria: surface_auth.default_relay_token_file implemented, tests pass Size: Small (2 days) Reference: docs/plans/remediation-roadmap.md

S-H8 — Plugin audit fail-open [P0]

Status: ✅ Complete Journey: Implemented fail-closed when TEAAGENT_PLUGINS_STRICT=1 Acceptance criteria: Plugin fails closed when strict mode enabled, tests pass Size: Small (2 days) Reference: docs/plans/remediation-roadmap.md

S-M2 — Plaintext bearer token files [P1]

Status: ✅ Complete Journey: Documented plaintext bearer token files as known limitation with operational guidance Acceptance criteria: Bearer token files documented as known limitation with chmod 600 guidance Size: Small (documentation only) Reference: docs/plans/remediation-roadmap.md

S-M3 — Audit verify swallows exceptions [P1]

Status: ✅ Complete Journey: Verified audit verify raises exceptions properly Acceptance criteria: Audit verify raises exceptions properly, tests pass Size: Small (verification only) Reference: `docs/plans/remediation-roadmap.md``

Completed Reference Plans

The following plans are marked as complete and are retained for reference:

daily-driver-backlog-2026-06-01.md: ✅ COMPLETE - All 6 tickets implemented (chat REPL fixes, cost accounting, compaction, shared controller, TUI scrollback)
governance-hardening.md: ✅ SHIPPED - All tranches shipped (tool lint, plan gate, audit, failure cards, MCP trust, approval queue, swarm, Phase 4-6 Beta)
daily-driver-execution-readiness-2026-06-01.md: ✅ COMPLETE - All items marked as completed
daily-driver-hardening-plan-2026-06-01.md: ✅ COMPLETE - All items marked as completed

Open — Medium (P2)

CP-4 — OpenCode Gap Watch [P2]

Status: 📋 Ongoing monitoring (automation script created) Journey: Monitor OpenCode GitHub releases monthly for governance-related features, escalate if approval gates added Acceptance criteria: Monthly checks performed, escalation trigger defined (50+ votes on governance issues) Size: Ongoing Reference: docs/plans/competitive-positioning-plan-2026-05-31.md Process document: docs/processes/opencode-gap-watch.md Automation script: scripts/opencode_gap_watch.py

CP-5 — Model Capability Matrix [P2]

Status: ✅ Complete Journey: Create model capability matrix document mapping features to models (extended thinking, tool use, streaming, multi-sig, cost tracking) Acceptance criteria: Matrix exists and updated with provider adapter changes, README links to matrix Size: Small (2 days) Reference: docs/plans/competitive-positioning-plan-2026-05-31.md

CP-6 — Community Presence / Developer Relations [P3]

Status: 📋 Process (not binary) (automation script created) Journey: Post on r/LocalLLaMA, Hacker News, GitHub, Dev.to; track GitHub stars, Reddit mentions, HN upvotes Acceptance criteria: Track metrics monthly, respond to governance-related competitor issues Size: Ongoing Reference: docs/plans/competitive-positioning-plan-2026-05-31.md Process document: docs/processes/community-presence.md Automation script: scripts/community_presence_monitor.py

Strategic Reference Documents

For long-term planning, positioning, and UX research, see:

docs/plans/competitive-positioning-plan-2026-05-31.md - README rewrite, security whitepaper, demo scripts
docs/plans/ux-improvement-roadmap-2026-05-31.md - Post-run summary, budget warnings, undo UX, memory/context
docs/plans/future-roadmap-risk-usability-backlog-2026-05-31.md - Large strategic backlog with horizons

These are reference documents for strategic decisions. Extract actionable items to this backlog as needed.

Phase 4-6 Status — Beta (All Shipped)

Phase 4 (consensus), Phase 5 (sandbox routing + execution), and Phase 6 (skill writer, docker monitor, control plane) are Beta with all items marked as shipped. No remaining hardening work identified.

Future Roadmap — Phase 4-5 (Beta — core shipped, E2E hardening ongoing)

Phase 4 (consensus) and Phase 5 (sandbox routing + execution) are Beta with CLI, unit tests, and acceptance flows. Optional hardening (async votes, skill execution, docker-smoke CI) is shipped. See architecture.md.

Phase 4: Federated Swarm Consensus & Peer Attestations — Beta

Task	Priority	Status	Evidence
Consensus data structures	P1	Shipped	`teaagent/consensus.py`, `tests/test_consensus.py`
Peer identity management	P1	Shipped	`PeerRegistry`, CLI `teaagent consensus peers`
Voting mechanism	P1	Shipped	`VotingMechanism`, `tests/test_consensus.py`
Consensus engine	P0	Shipped	`ConsensusEngine`, attestation + pre-approval
Swarm integration	P0	Shipped	`SwarmManager` + pre-approval gate
CLI commands	P2	Shipped	`teaagent/cli/_handlers/_consensus.py`
E2E acceptance	P2	Shipped	`tests/acceptance/test_consensus_flow.py`

Phase 5: Hardened Sandbox Virtualization — Beta

Task	Priority	Status	Evidence
Docker resource limits	P1	Shipped	`prepare_subagent_isolation` `--cpus` / `--memory`
WASM runtime wrapper	P0	Shipped	`teaagent/wasm_runtime.py`, `tests/test_wasm_runtime.py`
Skill execution routing	P1	Shipped	`teaagent/skill_router.py`, `isolation=auto` on subagents
Resource monitoring	P2	Shipped	`teaagent/resource_monitor.py`, `teaagent sandbox monitor`
CLI commands	P2	Shipped	`teaagent/cli/_handlers/_sandbox.py`
E2E acceptance	P2	Shipped	`tests/acceptance/test_sandbox_enhancement_flow.py`

Remaining (optional hardening)

Task	Priority	Status	Description
Async consensus UX	P2	Shipped	Non-blocking vote collection via `async_vote_collection` + poll
WASM skill execution	P1	Shipped	`teaagent/skill_executor.py`, `teaagent sandbox execute`
Docker dogfood benchmarks	P2	Shipped	Optional CI `docker-smoke` job (`tests/test_phase6_docker.py`)

Phase 6: Skill writer, docker monitor, control plane — Beta

Task	Priority	Status	Evidence
SkillWriter publish + review	P1	Shipped	`teaagent/skill_writer.py`, `tests/test_phase6_skill_writer.py`
Docker resource monitor + abort	P1	Shipped	`teaagent/docker_sandbox.py`, `tests/test_phase6_docker.py`
Prompt tournament scoring	P2	Shipped	`teaagent/swarm.py`, `tests/test_phase6_swarm_score.py`
Control plane HTTP + dashboard	P2	Shipped	`teaagent/control_plane_api.py`, `tests/test_phase6_control_plane.py`
Control plane CLI	P2	Shipped	`teaagent control-plane serve`
JIT approval server	P2	Shipped	`teaagent/jit_approval_server.py`, `tests/test_phase6_jit_server.py`
External peer voting UX	P2	Shipped	`teaagent consensus wait`, `votes-import`, `import_votes_batch`
Comparator tournament from subagent results	P2	Shipped	`teaagent/tournament/benchmark.py`, `tests/test_phase6_remaining_features.py`
Control plane swarm dogfood	P2	Shipped	`teaagent/control_plane_bridge.py`, `SwarmManager(control_plane_state=…)`
WASM skill contract CLI	P2	Shipped	`teaagent/wasm_skill.py`, `teaagent sandbox wasm-contract`
SSH vote relay (production peers)	P1	Shipped	`teaagent/vote_relay.py`, `teaagent consensus relay`
WASM skill CI template workflow	P2	Shipped	`.github/workflows/wasm-skill-build.yml`, `docs/wasm-skill-ci.md`
Multi-tenant control plane	P2	Shipped	`teaagent/control_plane_tenant.py`, `X-TeaAgent-Tenant`
Relay mTLS + API tokens	P1	Shipped	`teaagent/surface_auth.py`, `teaagent/tls_server.py`, `docs/http-surface-auth.md`
Per-tenant authZ + reverse proxy templates	P2	Shipped	`templates/reverse-proxy/`, control plane bearer enforcement

Open — Daily-use ergonomics

All items from the 2026-05-22 ergonomics tranche are shipped. Future work is strategic (hosted surfaces, repo-map quality eval) under ongoing maintenance below.

Recently completed (competitive refresh, 2026-05-22)

Item	Notes
Docs/provider architecture drift guard	`scripts/validate_docs_consistency.py` now checks README/USAGE/architecture counts against `PROVIDER_CONFIGS`, unique credential env vars (including shared `CLOUDFLARE_API_TOKEN` / `OPENCODEZEN_API_KEY`), and survey doc freshness; `docs/architecture.md` updated to 13 providers.
Mode and safety comparison matrix	`docs/USAGE.md` matrix maps permission modes, Plan/Auto/Code lanes, shell mutation, approvals, audit, and rollback; validator enforces every `PermissionMode` value and required topics.
Multi-surface launch recipes	`docs/USAGE.md` “Choose Your Surface” table with CLI/TUI/VS Code/MCP/ACP/A2A/ANP/managed-runtime recipes; `validate_surface_recipes` + `test_surface_launch_recipes_flow.py` smoke local commands.
Subagent lineage and isolation hardening	Child runs record lineage metadata; `isolation: shared`, `worktree`, or `container` snapshot; `worktree_path`/`container_path` in lineage; `test_subagent_worktree_isolation_flow.py`, `test_subagent_container_isolation_flow.py`.
Repo-map / context pack for coding runs	`build_context_pack` on `agent preflight`; candidate files, memories, optional LSP symbols, hybrid/knowledge/GraphQLite read-only hits; `test_context_pack_read_only_flow.py`.
Plugin/skill compatibility catalog	`docs/plugin-skill-catalog.md` with fixture-backed `validate_plugin_skill_catalog`.
Competitive use-case dashboard refresh	Matrix/HTML include survey review date and open P1/P2 gap counts via `build_use_case_matrix.py` + `render_use_case_dashboard.py`.
Periodic mainstream-agent refresh cadence	`docs/release-checklist.md` requires survey refresh before minor releases or protocol ADRs.
Everyday usage onboarding	README “Daily Use in 5 Commands”, USAGE daily section, cli recipe table, TUI `daily → preflight → ask → resume` flow.

Ongoing maintenance (competitive refresh complete)

Competitive-refresh feature work is shipped. Before minor releases or protocol ADRs, run:

python3 scripts/refresh_competitive_docs.py

Then manually refresh scripts/refresh_agent_readme_survey.md when upstream signals change (see docs/release-checklist.md).

Cadence	What to keep in sync
Survey	`Last reviewed` date, source table, `docs/use-cases.md` differentiators
Docs drift	`validate_docs_consistency.py` (providers, mode matrix, surface recipes, catalog)
Coverage artifacts	`docs/acceptance.md` count, `docs/use-case-matrix.md`, `docs/use-case-matrix.html` via `refresh_competitive_docs.py --check` (CI/review) and `refresh_competitive_docs.py` (intentional regeneration)
Extension surfaces	`docs/plugin-skill-catalog.md`, `docs/USAGE.md`, `docs/cli.md` when hooks, isolation modes, or preflight/context APIs change
New index backends	`teaagent/context_pack.py` read-only graph evidence when adding search stores
Strategic P2 research	hosted/cloud surface docs, background sessions, desktop/client-server packaging, repo-map quality evaluation
Ergonomics KPI	`python3 scripts/measure_time_to_first_run.py --write docs/ergonomics-kpi.json` before regenerating the use-case matrix

FilesExpand file tree

backlog-priority.md

Latest commit

History