Agents.KT/docs/roadmap.md at main · Deep-CodeAI/Agents.KT

Roadmap

Release narrative

0.5.0   Agents with boundaries                       — shipped
0.6.0   Boundaries you can audit                     — shipped (epic [#1911](../../issues/1911))
0.7.0   Boundaries you can enforce externally        — shipped (epic [#2879](../../issues/2879))
0.8.0   Interoperable, multimodal agents (+ grants)  — shipped (A2A v1, multimodal, RAG, composition, Gemini, capability grants)
0.9.0   Layer-2 sandbox backends                     — next (Docker/proxy/read-confinement)

0.8.0 shipped: agent-to-agent interop (A2A v1 — server + typed client), full multimodal (audio STT/TTS, vision, image generation), the RAG seam, richer composition (handoff / firstOf / .speculative / loopUntil / built-in aggregators / forum captains), human-in-the-loop gates + the eval harness, history compression, an eighth model provider (Google Gemini #1917), agent.json serialization (#4516), and capability grants (grants { allow / confirm } #4545), plus the agentic-web standards groundwork (AGNTCY / AG-UI / x402 / NLWeb, PRD §12.6–§12.9). The "sandbox backends" originally pencilled for 0.8 slipped: WasmSandbox (#2894) was closed won't-do (embedded-WASM-for-tools isn't rational; agent → WASM export is the separate forward track #4547), and DockerSandbox (#2895), the egress hostname-allowlist proxy (#2893), and read confinement (#4546) move to 0.9.0 (they want a Linux-capable environment to build + verify).

0.7.0 shipped (epic #2879): runtime enforcement of declared tool policies — Layer 1 in-JVM filesystem gate (#2890) + Layer 2 OS sandbox (#1916): macOS Seatbelt, Linux bubblewrap, firejail setuid fallback, plain-ProcessBuilder fallback; write-root + env + cwd confinement; default-deny network — and the standalone agents-kt CLI (#1923) for manifest generate/inspect/verify outside Gradle. Deferred to 0.8: WasmSandbox (#2894), DockerSandbox (#2895), the network hostname-allowlist proxy (#2893), and the grants { } structure DSL.

0.6.0 hero feature: the permission manifest / capability graph (#1912) — a deterministic YAML/JSON artifact showing every agent / skill / tool / memory access / MCP endpoint / provider / budget / policy boundary in a system. Build-time evidence for security review; the manifest hash (#1913) propagates into every runtime audit event so dynamic behaviour ties back to the signed-off capability graph.

The 0.6.0 epic (#1911) tracks the full acceptance criteria. The phase layout below remains time-based; the release-arc tags below each item show which release that item targets.

Phase 1 — Core DSL (in progress)

Phase 2 — Runtime + Distribution (Q2 2026)

Priority — 0.6.0 hero:

Permission manifest / capability graph — pipeline.permissionManifest { } DSL on agents and compositions; writeYaml(file) / writeJson(file) emit deterministic output; Gradle task agentManifest plus verifyAgentManifest fails CI when high-risk boundaries widen. Captures agents, skills, tools, memory R/W, budgets, MCP client/server snapshots, providers (secrets masked), guardrail hooks, and composition structure. Lives in :agents-kt-manifest (zero vendor deps). The manifest SHA-256 is attached to every agent in the graph for runtime correlation. (#1912)
Manifest hash + request/session IDs in runtime audit events — AgentRuntimeContext carries requestId (UUIDv4 per invoke), sessionId (per agent.session()), manifestHash (sha256 of the deterministic manifest, null until generated). Every PipelineEvent / AgentEvent includes these three; consumed by the OTel bridge (#1908) and the JSONL exporter (#1914). Closes the loop from build-time evidence to runtime behaviour. (#1913)
JSONL audit log exporter — append-only, one event per line, grep/jq-friendly. Schema covers requestId / sessionId / manifestHash / agentId / skillId / toolId / eventType / timestamp / inputType / outputType / budgetState / guardrailDecision / mcpClientId / toolPolicyRisk / usedDeclaredCapability / provider / model. Lives in :agents-kt-observability, masks raw args/results by omission, supports size/day rotation, and handles write backpressure without throwing into the agent path. Sibling to the OTel bridge (#1908) for teams that need a deterministic on-disk record. (#1914)
Declarative tool sandbox policy DSL (0.6.0 — declarative only, enforcement in 0.7.0) — tool(..., policy { risk = ToolRisk.Medium; filesystem { read("/uploads/**"); writeNone() }; network { denyAll() } }). ToolPolicy captures risk, filesystem, network, and environment sub-policies with deterministic map/JSON/YAML manifest helpers. Audit events note toolPolicyRisk and usedDeclaredCapability. The enforcement layer is sibling #1916. (#1915)

Priority — 0.6.0 platform + follow-ups:

Secondary:

Phase 3 — Production (Q3 2026)

Phase 4 — Ecosystem (Q4 2026)

Knowledge packs — battle-tested prompt libraries for common domains
Agent generation from natural language (NL → Kotlin DSL)
Skillify — extract reusable skills from session transcripts
Visual structure editor, UML bidirectional conversion
Knowledge marketplace
Comparison page — docs/comparison.md with a feature matrix vs LangChain (Py + LangChain4j), Microsoft Semantic Kernel, AutoGen, and a raw MCP client; covers typed Agent<IN,OUT>, runtime tool allowlist, MCP client/server, native streaming, budgets, sandboxing, KSP/compile-time validation, language, local-first model support; honest "where Agents.KT is weaker" subsection. (#1906)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Roadmap

Release narrative

FilesExpand file tree

roadmap.md

Latest commit

History

roadmap.md

File metadata and controls

Roadmap

Release narrative