Skip to content

Conversation

@prosdev
Copy link
Collaborator

@prosdev prosdev commented Nov 29, 2025

Summary

Major documentation update based on implementation benchmark findings. This PR supersedes #110.

Key Findings from Implementation Benchmarks

Task Type Cost Savings Time Savings
Debugging 42% 37%
Exploration 44% 19%
Implementation 29% 22%

Key insight: Savings scale with task complexity.

Why It Saves Money

What dev-agent does Manual equivalent Impact
Returns code snippets in search Read entire files 99% fewer input tokens
dev_plan bundles issue + code + commits 5-10 separate tool calls 29% cost reduction
Semantic search finds relevant code grep chains + filtering 42% cost reduction

Changes

Website (website/content/index.mdx)

  • Highlight context bundling as core value prop
  • Add "scales with complexity" messaging
  • Show 99% input token reduction from code snippets
  • Add dev_plan context bundling comparison (before/after tabs)
  • Update benchmark results with task-type breakdown

Docs (website/content/docs/index.mdx)

  • Add "Why it saves money" section
  • Update benchmark data with task-type breakdown

PLAN.md

  • Add v0.5.0 roadmap with dev_context generalization
  • Add benchmark improvements for implementation task coverage
  • Update benchmark results with token analysis

AGENTS.md & CLAUDE.md

  • Update to show all 9 MCP tools (was missing dev_refs, dev_map, dev_history)
  • Improve tool descriptions

Closes

This PR supersedes #110 (which only had the AGENTS.md/CLAUDE.md/PLAN.md updates). Please close #110 after merging this.

PLAN.md:
- Add v0.5.0 roadmap with dev_context generalization
- Add benchmark improvements for implementation task coverage
- Update timestamp

AGENTS.md & CLAUDE.md:
- Update to show all 9 MCP tools (was missing dev_refs, dev_map, dev_history)
- Improve tool descriptions to match v0.4.2 updates
…0 roadmap

Website changes:
- Highlight context bundling as core value prop
- Add 'scales with complexity' messaging (42% for debugging, 29% for implementation)
- Show 99% input token reduction from code snippets
- Add dev_plan context bundling comparison
- Update benchmark results with task-type breakdown

PLAN.md:
- Add v0.5.0 roadmap with dev_context generalization
- Add benchmark improvements for implementation task coverage
- Update benchmark results with token analysis

AGENTS.md & CLAUDE.md:
- Update to show all 9 MCP tools (was missing dev_refs, dev_map, dev_history)
- Improve tool descriptions to match v0.4.2 updates

Benchmark data from studies/:
- Debugging: 42% cost savings, 37% time savings
- Implementation: 29% cost savings, 22% time savings
- Exploration: 44% cost savings, 19% time savings
@prosdev prosdev merged commit 903e539 into main Nov 29, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant