docs: update website with implementation benchmark findings and v0.5.0 roadmap #111

prosdev · 2025-11-29T10:34:20Z

Summary

Major documentation update based on implementation benchmark findings. This PR supersedes #110.

Key Findings from Implementation Benchmarks

Task Type	Cost Savings	Time Savings
Debugging	42%	37%
Exploration	44%	19%
Implementation	29%	22%

Key insight: Savings scale with task complexity.

Why It Saves Money

What dev-agent does	Manual equivalent	Impact
Returns code snippets in search	Read entire files	99% fewer input tokens
`dev_plan` bundles issue + code + commits	5-10 separate tool calls	29% cost reduction
Semantic search finds relevant code	grep chains + filtering	42% cost reduction

Changes

Website (`website/content/index.mdx`)

Highlight context bundling as core value prop
Add "scales with complexity" messaging
Show 99% input token reduction from code snippets
Add dev_plan context bundling comparison (before/after tabs)
Update benchmark results with task-type breakdown

Docs (`website/content/docs/index.mdx`)

Add "Why it saves money" section
Update benchmark data with task-type breakdown

PLAN.md

Add v0.5.0 roadmap with dev_context generalization
Add benchmark improvements for implementation task coverage
Update benchmark results with token analysis

AGENTS.md & CLAUDE.md

Update to show all 9 MCP tools (was missing dev_refs, dev_map, dev_history)
Improve tool descriptions

Closes

This PR supersedes #110 (which only had the AGENTS.md/CLAUDE.md/PLAN.md updates). Please close #110 after merging this.

PLAN.md: - Add v0.5.0 roadmap with dev_context generalization - Add benchmark improvements for implementation task coverage - Update timestamp AGENTS.md & CLAUDE.md: - Update to show all 9 MCP tools (was missing dev_refs, dev_map, dev_history) - Improve tool descriptions to match v0.4.2 updates

…0 roadmap Website changes: - Highlight context bundling as core value prop - Add 'scales with complexity' messaging (42% for debugging, 29% for implementation) - Show 99% input token reduction from code snippets - Add dev_plan context bundling comparison - Update benchmark results with task-type breakdown PLAN.md: - Add v0.5.0 roadmap with dev_context generalization - Add benchmark improvements for implementation task coverage - Update benchmark results with token analysis AGENTS.md & CLAUDE.md: - Update to show all 9 MCP tools (was missing dev_refs, dev_map, dev_history) - Improve tool descriptions to match v0.4.2 updates Benchmark data from studies/: - Debugging: 42% cost savings, 37% time savings - Implementation: 29% cost savings, 22% time savings - Exploration: 44% cost savings, 19% time savings

prosdev added 2 commits November 29, 2025 01:46

prosdev merged commit 903e539 into main Nov 29, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: update website with implementation benchmark findings and v0.5.0 roadmap #111

docs: update website with implementation benchmark findings and v0.5.0 roadmap #111

Uh oh!

prosdev commented Nov 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

docs: update website with implementation benchmark findings and v0.5.0 roadmap #111

docs: update website with implementation benchmark findings and v0.5.0 roadmap #111

Uh oh!

Conversation

prosdev commented Nov 29, 2025

Summary

Key Findings from Implementation Benchmarks

Why It Saves Money

Changes

Website (website/content/index.mdx)

Docs (website/content/docs/index.mdx)

PLAN.md

AGENTS.md & CLAUDE.md

Closes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Website (`website/content/index.mdx`)

Docs (`website/content/docs/index.mdx`)