AllocDB Roadmap

Status

Draft. This document turns the current product and engineering docs into a dependency-driven implementation roadmap.

It is intentionally organized around correctness gates, not feature count.

Planning Principles

The roadmap follows these rules:

Freeze semantics before implementation invents behavior.
Build the pure deterministic allocator before adding IO and transport.
Remove prototype hot-path costs before calling the core production-grade.
Prove recovery and boundedness before polishing APIs.
Add simulation before distributed features.
Start replication design only after the single-node core is credible.

These principles follow:

Workstreams

These workstreams can overlap, but they should not violate milestone dependencies:

semantics and API surface
core engine
durability and recovery
submission pipeline
simulation and fault injection
observability and operational packaging
documentation
bounded implementation spikes

Spike Policy

Spikes are allowed only for implementation uncertainty, not for reopening settled semantics.

Use a spike when:

multiple implementation shapes are plausible
the choice materially affects boundedness, determinism, or simplicity
a short experiment can retire meaningful technical risk

Do not use a spike when the issue is already a product or semantics decision.

Spike outputs must be:

time-boxed
disposable by default
documented with the decision they support

See spikes.md.

Current Priority Adjustments

Recent external review tightened the near-term plan in four ways:

the sorted Vec tables were acceptable for the first deterministic slice, but they are not the intended production-grade shape
retirement work must stop scanning whole tables on every apply path
WAL recovery must distinguish an expected EOF torn tail from a genuine corruption event in the middle of durable history
observability must expose logical executor lag explicitly, and the version field may justify a version-guarded confirm API if the product wants read-modify-write protection beyond reservation_id

This feedback changes the order of work: core-table hardening comes before deeper M3 work.

Milestones

M0: Freeze v1 Semantics

Goal:

stabilize the v1 rules so the codebase does not invent semantics during implementation

Deliverables:

stable command and result-code surface
stable retention and capacity knobs
stable trusted-core crate boundary
stable slot and TTL rules

Exit criteria:

no open semantic questions affecting core state transitions
no ambiguity in indefinite-outcome handling
no ambiguity in bounded retention behavior
candidate spike list is approved so experimentation does not drift into product design

Primary docs:

M1: Pure State Machine

Goal:

implement the allocator as an in-memory deterministic state machine with fixed-capacity structures

Deliverables:

resource, reservation, and operation models
fixed-capacity tables
deterministic expiration index
command apply functions
invariant checks
decisions from the table and timing-wheel spikes folded into the implementation

Exit criteria:

state-machine tests cover all transitions
property tests prove no double allocation
capacity tests prove fail-fast behavior at bounds

M1H: Constant-Time Core Hardening

Goal:

replace prototype table and retirement mechanics with production-grade constant-time structures

Deliverables:

deterministic fixed-capacity open-addressed tables for resource, reservation, and operation lookup
explicit retirement queues or equivalent ordered indexes so retirement work drains expired items instead of scanning whole tables
measured probe-bound and full-capacity behavior documented by tests or focused experiments
decision on whether version-guarded conditional_confirm is required in v1

Exit criteria:

resource, reservation, and operation lookup do not depend on binary search or shifting inserts
retirement work is proportional to expired entries, not table size
constant-time behavior remains deterministic at configured limits
the semantics decision for version-guarded confirm is explicit

M2: Durability and Recovery

Goal:

make the state machine durable and replayable through WAL and snapshots

Deliverables:

WAL frame format and codec
append and recovery scanner
snapshot format
replay path using the same apply logic
corruption and torn-tail handling
checkpoint coordination and safe WAL truncation rules
decisions from the WAL spike folded into the implementation

Exit criteria:

live apply and replay produce identical results
crash-recovery tests pass
corrupted or torn local state fails closed
snapshot/WAL coordination preserves overlapping durable history across checkpoint replacement

M3: Submission Pipeline

Goal:

add a bounded single-node submission path with idempotent retries and strict reads

Deliverables:

command envelope validation
bounded submission queue
LSN and request-slot assignment
result publication and retry lookup
strict-read fence by applied LSN

Exit criteria:

duplicate operation_id tests pass
indefinite-outcome retry tests pass
overload behavior is deterministic and bounded

M4: Deterministic Simulation

Goal:

exercise the real core under seeded, reproducible fault injection

Deliverables:

simulated slot driver
injected crash points
injected WAL and fsync failures
reproducible seeded execution harness
decisions from the simulation spike folded into the implementation

Exit criteria:

failures are reproducible from seed
simulated crash/restart scenarios cover recovery logic
the simulator runs the real core logic, not a mock semantics layer

M5: Single-Node Alpha

Goal:

package the single-node allocator into a usable alpha release

Deliverables:

minimal API surface
metrics and health signals
benchmark harness
operator runbook
explicit wire-level mapping for definite vs indefinite submission failures

Exit criteria:

benchmark and fault runs are documented
operational limits are visible
single-node alpha is usable without semantic caveats

M6: Replication Design Gate

Goal:

begin replicated-system design only after the single-node core has earned it

Deliverables:

expanded replication.md
protocol choice and rationale
recovery and failover semantics
replicated simulation plan
Jepsen validation plan

Exit criteria:

replication design does not rewrite single-node semantics
replication work has explicit validation gates
quorum and failover choices are justified against the research inputs

M7: Replicated Core Prototype

Goal:

implement the first real replicated AllocDB shard on top of the completed M6 design

Deliverables:

replicated node wrapper state and durable protocol metadata
deterministic 3-replica in-process cluster harness
primary-to-backup quorum write path
view change and fail-closed read behavior
suffix catch-up, snapshot transfer, and rejoin
executable replicated simulation scenarios from testing.md

Exit criteria:

one configured primary can replicate and commit through majority durable append
quorum loss and higher-view takeover fail closed instead of guessing
stale and recovering replicas catch up without rewriting committed history
replicated simulation scenarios for partition, primary crash, and rejoin run against the real replicated harness

M8: External Cluster Validation

Goal:

validate the replicated implementation through real processes and an external fault harness

Deliverables:

multi-process local replicated cluster runner
local fault-control harness for process and network disruption
local QEMU 3-replica plus control-node testbed
Jepsen harness implementing the release gate from testing.md

Exit criteria:

local multi-process and QEMU-backed clusters are reproducible enough for scripted fault runs
Jepsen histories apply AllocDB's retry-aware interpretation for ambiguous writes
release-blocking invariants are enforced automatically for failover, ambiguity, stale reads, and expiration safety
the external validation path is documented well enough to run before any replicated release

M9: Lease Kernel Follow-On

Goal:

extend AllocDB with the minimal correctness-critical lease primitives needed for broader scarce-resource ownership without turning the trusted core into a scheduler or policy engine

Deliverables:

approved follow-on kernel scope and non-goals for generic scarce-resource use cases
explicit bundle-ownership semantics with all-or-nothing visibility
explicit fencing-token and revoke semantics
one minimal lease lifecycle that carries only correctness-bearing state
durability, API, replay, and replication plans for the new primitives
simulation and regression plan for stale-holder, revoke, and bundle-failure paths

Exit criteria:

the follow-on kernel scope is explicit and keeps Kubernetes, queueing, topology, and routing policy outside the trusted core
bundle commit, fencing, revoke, and liveness-boundary semantics are documented before implementation starts
replay, crash recovery, and replication invariants for the new primitives are explicit
the work can be broken into reviewable implementation tasks without reopening settled M0-M8 semantics

Sequencing

The minimum dependency chain is:

M0 -> M1 -> M1H -> M2 -> M3 -> M4 -> M5 -> M6 -> M7 -> M8 -> M9

Allowed overlap:

docs can advance continuously
approved spikes can run ahead of their consuming milestone if they stay time-boxed and disposable
some M2 storage scaffolding can start during late M1
some M2 durability hardening can overlap with M1H when it does not reopen state-machine semantics
some M5 API and metrics work can start during late M3
some M8 cluster-runner and operator packaging work can start during late M7 when it does not weaken deterministic replicated-harness coverage
some M9 planning docs can advance during late M8 or its maintainability follow-up when they do not destabilize the current replicated release gate

Not allowed:

networking or async polish before M1 semantics are proven
replication design pressure changing M1-M5 semantics

Review Rhythm

Suggested review points:

end of M0: semantic freeze review
end of M1: invariant and state-machine review
end of M1H: constant-time and hot-path review
end of M2: durability and corruption-handling review
end of M4: simulation credibility review
end of M5: alpha readiness review
end of M7: replicated prototype correctness review
end of M8: external validation gate review
end of M9: lease-kernel follow-on scope and readiness review

Planning Output

The milestone plan here is paired with:

work-breakdown.md

That document lists the concrete units of work that can be tracked in issue or ticket form.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AllocDB Roadmap

Status

Planning Principles

Workstreams

Spike Policy

Current Priority Adjustments

Milestones

M0: Freeze v1 Semantics

M1: Pure State Machine

M1H: Constant-Time Core Hardening

M2: Durability and Recovery

M3: Submission Pipeline

M4: Deterministic Simulation

M5: Single-Node Alpha

M6: Replication Design Gate

M7: Replicated Core Prototype

M8: External Cluster Validation

M9: Lease Kernel Follow-On

Sequencing

Review Rhythm

Planning Output

FilesExpand file tree

roadmap.md

Latest commit

History

roadmap.md

File metadata and controls

AllocDB Roadmap

Status

Planning Principles

Workstreams

Spike Policy

Current Priority Adjustments

Milestones

M0: Freeze v1 Semantics

M1: Pure State Machine

M1H: Constant-Time Core Hardening

M2: Durability and Recovery

M3: Submission Pipeline

M4: Deterministic Simulation

M5: Single-Node Alpha

M6: Replication Design Gate

M7: Replicated Core Prototype

M8: External Cluster Validation

M9: Lease Kernel Follow-On

Sequencing

Review Rhythm

Planning Output