Name	Name	Last commit message	Last commit date
parent directory ..
.claude-plugin	.claude-plugin
agents	agents
commands	commands
skills	skills
templates	templates
README.md	README.md

Context Ledger - Evidence-Based Product Development

A Claude Code plugin for structured product development with traceable evidence, explicit decisions, and constrained spec generation.

Why Context Ledger?

The Problem with Traditional Product Planning

Most product development starts with vibes and ends with specs that nobody trusts:

Specs drift from reality — PRDs reference decisions that were never formally made
Assumptions hide in prose — Trade-offs buried in paragraph 47 of a 200-page doc
Evidence disappears — "Studies show..." but which studies? From when?
Updates break everything — New learnings require manual diff of every downstream doc

What Context Ledger Does Differently

Context Ledger asks: "Can you trace every spec requirement back to evidence and an explicit decision?"

This constraint-based approach:

Forces explicit decisions — No spec section without a DEC-* reference
Traces to evidence — Every decision cites EV-* evidence objects
Surfaces trade-offs — Decisions must list wins, loses, and created risks
Produces impact reports — Updates show what downstream artifacts need regeneration

Who Benefits

Role	How Context Ledger Helps
Product Managers	Evidence-backed PRDs with explicit trade-off documentation
Engineers	Architecture decisions with clear reasoning and risk awareness
Executives	Decision audits showing how conclusions were reached
Teams	Shared understanding through structured artifacts, not tribal knowledge

Design Philosophy

Auditability Over Convenience

Every claim traces back to its source. If you can't cite the evidence, it doesn't go in the spec.

Explicit Decisions Over Implicit Assumptions

Most product failures stem from hidden assumptions. Context Ledger makes trade-offs visible:

What alternatives were considered?
What are we giving up?
What risks does this create?

Constrained Generation Over Free-Form Writing

Specs cannot be generated without meeting quality gates:

Evidence gate: ≥5 evidence objects per pillar before synthesis
Decision gate: DECISIONS.yaml must exist before spec generation
Spec gate: Every PRD section must reference at least one DEC-*

Impact-Aware Updates

Changes produce diffs, not silent rewrites. Every update includes an impact report showing what must be regenerated.

The Pipeline

Phase 1 — Context Build (PARALLEL)

Generate pillar map, collect evidence in parallel across 8 pillars.

Output: Evidence Objects in 02-evidence/...

Phase 2 — Synthesis (LAYERED)

Per-pillar synthesis → cross-pillar synthesis → decisions + risks.

Output:

03-synthesis/...
04-decisions/DECISIONS.yaml
05-risks/RISKS.yaml

Phase 3 — Build (CONSTRAINED)

Generate PRD, architecture, plan — only from decisions + evidence.

Output:

06-prd/PRD.md
07-architecture/ARCHITECTURE.md
08-plan/PLAN.md

Installation

# Add the marketplace (one-time setup)
claude plugin marketplace add synaptiai/synapti-marketplace

# Install the plugin
claude plugin install context-ledger

Commands

Full Pipeline (Recommended)

`/ledger-full <brief>`

Run the complete Context Ledger pipeline from brief to implementation plan in one command.

# Standard overnight run
/ledger-full "Build a task management app for remote teams" --mode optimizer

# With self-improvement loops
/ledger-full "Task management app" --mode optimizer --self-improve

# Maximum speed hackathon
/ledger-full "AI code review assistant" --mode tokenburner

# Fully autonomous (walk away overnight)
/ledger-full "Healthcare portal with HIPAA" --mode ralph --max-iterations 50

# Ralph + self-improve (maximum autonomy)
/ledger-full "Complex fintech app" --mode ralph --self-improve --max-iterations 100

What's Good About /ledger-full:

Single command, complete output — Goes from idea to implementation plan without manual phase transitions
Mode flexibility — Choose between sustainable (optimizer), fast (tokenburner), or autonomous (ralph) execution
Self-improvement loops — Optional --self-improve flag fills evidence gaps and resolves contradictions automatically
Quality gate enforcement — Cannot skip evidence, decision, or spec gates; ensures receipts at every stage
Walk-away capable — Ralph mode with stop hooks enables overnight autonomous execution
Minimal interaction — Only asks questions during decision phase; everything else runs autonomously

Individual Commands

`/ledger-init <brief>`

Initialize a Context Ledger workspace with structured brief and pillar configuration.

/ledger-init Build a task management app for remote software teams with Slack integration

What's Good About /ledger-init:

Structured brief parsing — Extracts goals, constraints, and target users from free-form input
Pillar prioritization — Identifies which research areas matter most for your specific project
Directory scaffold — Creates the complete 10-folder workspace structure automatically
Validation gates — Won't proceed if brief lacks goals or constraints, ensuring quality from the start

`/ledger-research`

Parallel evidence collection across 8 research pillars (market, users, tech, competitors, design, legal, ops, economics).

/ledger-research
/ledger-research --pillars market,users,competitors

What's Good About /ledger-research:

True parallelism — 8 agents research simultaneously, dramatically faster than sequential research
Atomic evidence objects — Each finding is structured YAML with confidence scores and assumptions
Semantic IDs — Evidence gets readable IDs like EV-market-pricing-smb-wtp that humans can reference
Quality gates — Enforces minimum 5 evidence objects per pillar before proceeding
Source traceability — Every claim links to its source with retrieval date

`/ledger-synthesize`

Transform raw evidence into pillar syntheses, then cross-synthesize to identify conflicts and emergent insights.

/ledger-synthesize

What's Good About /ledger-synthesize:

Two-layer synthesis — Per-pillar patterns first, then cross-pillar connections and conflicts
Contradiction surfacing — Explicitly identifies where evidence disagrees, forcing resolution
Decision candidates — Generates preliminary decisions from synthesis, not from vibes
Pattern recognition — Identifies themes across evidence that individual pieces don't reveal

`/ledger-decide`

Make explicit decisions with documented alternatives, trade-offs, and risk implications.

/ledger-decide

What's Good About /ledger-decide:

Forced alternatives — Every decision must document what options were considered and rejected
Trade-off documentation — Explicit wins and loses, not just the happy path
Risk linkage — Decisions automatically create risk entries when they create new vulnerabilities
Evidence requirements — Each decision must cite minimum 2 evidence IDs, preventing gut-feel choices
Interactive workflow — Presents decisions with evidence for user approval, not autonomous guessing

`/ledger-spec`

Generate constrained PRD and architecture documents where every section cites decisions.

/ledger-spec

What's Good About /ledger-spec:

Constraint enforcement — PRD sections cannot exist without DEC-* references; no vibes allowed
Automatic validation — Spec gate checks that all sections cite decisions before completing
Risk cross-referencing — Architecture sections reference relevant RISK-* entries
Coverage tracking — Reports which decisions are covered and which need spec sections
No drift — Specs stay anchored to decisions, which stay anchored to evidence

`/ledger-plan`

Generate implementation plan with backlog, milestones, and test plan linked to decisions and risks.

/ledger-plan

What's Good About /ledger-plan:

Decision-linked items — Every backlog item traces to the decisions that require it
Risk-aware sequencing — High-risk items get earlier attention and explicit mitigations
Test plan generation — Acceptance criteria derived from decisions, not invented
Milestone structure — Logical groupings based on decision dependencies

`/ledger-update <new-evidence>`

Apply new learnings and generate impact report showing what needs regeneration.

/ledger-update "User interviews revealed onboarding completion rate of 23%"

What's Good About /ledger-update:

Impact analysis — Shows exactly which decisions, specs, and plans are affected by new evidence
Diff generation — Documents what changed, not silent rewrites
Regeneration guidance — Identifies which artifacts need updates vs. which remain valid
Audit trail — Maintains history of how understanding evolved over time

Execution Modes

The /ledger-full command supports three modes + one flag:

`--mode optimizer`

Sustainable execution. 3 agents per pillar (24 total), balanced throughput.

Best for: Standard projects, overnight runs, production use.

`--mode tokenburner`

Maximum parallelism. 30+ agents per pillar (240+ total), burns through tokens fast.

Best for: Hackathons, time-critical projects, rapid exploration.

`--mode ralph`

Fully autonomous execution using Ralph Loop's stop hook mechanism. Same prompt re-fed on each exit until <promise>LEDGER_COMPLETE</promise> is output.

Best for: Walk-away overnight runs, complex autonomous projects.

`--self-improve` (Flag)

Can combine with any mode. Enables within-pipeline gap analysis loops:

Analyzes research for missing evidence
Checks synthesis for unresolved contradictions
Loops back when gaps found

Scenario	Recommended
Standard project	`--mode optimizer`
Hackathon / rapid prototyping	`--mode tokenburner`
Walk-away overnight run	`--mode ralph --max-iterations 50`
Complex regulated project	`--mode ralph --self-improve --max-iterations 100`
High-stakes production planning	`--mode optimizer --self-improve`

Semantic ID Scheme

All artifacts use self-describing semantic IDs with optional -2, -3 disambiguators.

Why Semantic IDs

Readable + speakable in meetings and docs
Predictable structure so humans can guess IDs
Still unique with -n suffix when collisions happen
Max 40 chars after prefix

ID Formats

Type	Format	Examples
Evidence	`EV-<pillar>-<topic>-<descriptor>[-n]`	`EV-market-pricing-smb-wtp`, `EV-users-onboarding-dropoff`
Decision	`DEC-<area>-<decision>[-n]`	`DEC-scope-power-users-first`, `DEC-ux-single-summary-box`
Risk	`RISK-<area>-<risk>[-n]`	`RISK-retention-expert-depth-churn`, `RISK-legal-gdpr-processing-basis`

Ledger Workspace Structure

Default: ~/project/ledger/ (overridable)

ledger/
├── 00-brief/           # 5-sentence brief + goals + constraints
├── 01-pillars/         # pillar map, scope, priorities
├── 02-evidence/        # Evidence Objects (per pillar)
│   ├── market/
│   ├── users/
│   ├── tech/
│   ├── competitors/
│   ├── design/
│   ├── legal/
│   ├── ops/
│   └── economics/
├── 03-synthesis/       # per-pillar syntheses + CROSS-SYNTHESIS.md
├── 04-decisions/       # DECISIONS.yaml
├── 05-risks/           # RISKS.yaml
├── 06-prd/             # PRD.md
├── 07-architecture/    # ARCHITECTURE.md
├── 08-plan/            # PLAN.md
├── 09-brand/           # personas, tone, UI refs, tokens
└── 10-gtm-ops/         # pricing, unit economics, launch checklist

Evidence Objects

Evidence objects force atomic, traceable research output.

Schema

id: EV-market-pricing-smb-wtp
pillar: market
source:
  type: url | pdf | interview | internal-doc | experiment | dataset
  ref: "https://..."
  retrieved_at: 2026-01-21
claim: "SMB segment willingness-to-pay peaks at $29/mo."
quote: "<short excerpt or summary>"
confidence: 0.0-1.0
assumptions:
  - "Survey sample excludes enterprise accounts"
notes: "Why it matters, how it may fail, what to verify next."
tags:
  - pricing
  - smb

Quality Rules

Claims must be falsifiable (not vibes)
Confidence is mandatory (even if subjective)
Assumptions must be listed
ID auto-suggested from claim text

Decision Ledger

The heart of the system. Every decision must document:

Schema

- id: DEC-scope-power-users-first
  decision: "Target power users before SMB"
  status: accepted | provisional | rejected
  owner: "user"
  created_at: 2026-01-21
  alternatives:
    - "SMB-first"
    - "Enterprise-first"
  evidence:
    - EV-market-pricing-smb-wtp
    - EV-users-power-user-retention
  tradeoffs:
    wins:
      - "Faster iteration cycles"
      - "Lower sales friction"
    loses:
      - "Lower initial ARPA"
      - "Potentially higher churn"
  risks:
    - RISK-retention-expert-depth-churn
  implications:
    - "MVP UX must optimize for expert workflows"

Quality Gates

Gate	Requirement
Evidence gate	Each pillar needs ≥5 Evidence Objects before synthesis
Decision gate	No spec generation until DECISIONS.yaml exists with ≥2 evidence IDs per decision
Spec gate	No PRD/architecture sections without DEC-* references

Impact Reports

Updates produce structured impact analysis:

# Impact Report

## What Changed
- Evidence added/updated:
  - EV-market-pricing-enterprise-wtp (new)
  - EV-users-onboarding-dropoff (updated confidence: 0.7 → 0.9)
- Decisions updated:
  - DEC-scope-power-users-first (status: provisional → accepted)

## What Is Affected
- PRD sections: 2, 4
- Architecture sections: Data model
- Plan items: Epic 1

## Recommended Actions
- [ ] Regenerate PRD section 2
- [ ] Review RISK-retention-expert-depth-churn mitigations

Constraint Enforcement

Citation Format

Specs must cite decisions using parseable references:

Section headings:

## 2. MVP scope (DEC-scope-power-users-first, DEC-ux-single-summary-box)

Inline:

We will prioritize workflow X because it reduces drop-off. (DEC-scope-power-users-first)

What Makes This Different

Most "context" tooling is a dump of notes.

Context Ledger is:

Structured evidence with confidence scores
Explicit decisions with trade-offs and alternatives
Explicit risks with triggers and mitigations
Constrained generation (specs must cite decisions)
Impact-aware updates (changes produce diffs)

It's a lightweight system for building products with receipts.

License

MIT License - See LICENSE file for details.

Author

Daniel Bentes

Repository

https://github.com/synaptiai/synapti-marketplace

FilesExpand file tree

context-ledger

Directory actions

More options

Directory actions

More options

Latest commit

History

context-ledger

Folders and files

parent directory

README.md

Context Ledger - Evidence-Based Product Development

Why Context Ledger?

The Problem with Traditional Product Planning

What Context Ledger Does Differently

Who Benefits

Design Philosophy

Auditability Over Convenience

Explicit Decisions Over Implicit Assumptions

Constrained Generation Over Free-Form Writing

Impact-Aware Updates

The Pipeline

Phase 1 — Context Build (PARALLEL)

Phase 2 — Synthesis (LAYERED)

Phase 3 — Build (CONSTRAINED)

Installation

Commands

Full Pipeline (Recommended)

/ledger-full <brief>

Individual Commands

/ledger-init <brief>

/ledger-research

/ledger-synthesize

/ledger-decide

/ledger-spec

/ledger-plan

/ledger-update <new-evidence>

Execution Modes

--mode optimizer

--mode tokenburner

--mode ralph

--self-improve (Flag)

Semantic ID Scheme

Why Semantic IDs

ID Formats

Ledger Workspace Structure

Evidence Objects

Schema

Quality Rules

Decision Ledger

Schema

Quality Gates

Impact Reports

Constraint Enforcement

Citation Format

What Makes This Different

License

Author

Repository

`/ledger-full <brief>`

`/ledger-init <brief>`

`/ledger-research`

`/ledger-synthesize`

`/ledger-decide`

`/ledger-spec`

`/ledger-plan`

`/ledger-update <new-evidence>`

`--mode optimizer`

`--mode tokenburner`

`--mode ralph`

`--self-improve` (Flag)