Bob the Builder

Concurrent task orchestrator for Kiro specs. Takes a spec with a tasks.md, analyzes dependencies, builds a DAG, and executes tasks in parallel using git worktrees for isolation.

Each task gets its own worktree and agent process. A planner assigns models per task, a reviewer audits completed work, and everything syncs back to your main branch automatically.

Quick Start

# Clone btb anywhere on your machine
git clone https://github.com/joshuakatt/Bob-The-Builder.git

# cd into your project repo
cd your-project

# Run setup (validates prereqs, installs agent configs)
path/to/Bob-The-Builder/setup.sh

# Run btb on a spec
path/to/Bob-The-Builder/btb.sh spec-name

btb lives in its own directory and operates on whatever repo you're in.

Prerequisites

bash 3.2+ (macOS default works, on Windows use WSL or Git Bash)
kiro-cli — installed and authenticated
git — any recent version
python3 — for DAG analysis
perl — optional, for TUI keyboard input (use --no-tui without it)

Run setup.sh in your target repo to validate all prerequisites and install agent configs:

path/to/Bob-The-Builder/setup.sh

Windows note: btb is a bash tool. On Windows, run it inside WSL or Git Bash — not PowerShell or cmd.

Usage

# By spec name (looks in .kiro/specs/<name>/)
btb.sh spec-name

# By explicit path
btb.sh --spec-dir path/to/spec

# Sequential mode (one task at a time, no DAG)
btb.sh spec-name --sequential

# Dry run (analyze DAG only, no execution)
btb.sh spec-name --dry-run

# Skip the review gate
btb.sh spec-name --no-review

# Plain log output instead of TUI
btb.sh spec-name --no-tui

# Set max parallel workers
btb.sh spec-name --max-parallel 6

# Clean up all worktrees, branches, and locks
btb.sh --cleanup

Spec Structure

Your spec directory needs at minimum a tasks.md. The planner also reads design.md and requirements.md if present.

.kiro/specs/spec-name/
├── tasks.md          # required — task list with checkboxes
├── design.md         # recommended — architecture and implementation guidance
└── requirements.md   # recommended — acceptance criteria

Tasks in tasks.md use this format:

- [ ] 1. Set up project infrastructure
  - [ ] 1.1 Initialize project with TypeScript config
  - [ ] 1.2 Set up database schema
- [ ] 2. Implement core features
  - [ ] 2.1 Build the API layer
  - [ ] 2.2 Add authentication

The planner analyzes dependencies between tasks and groups them into waves for parallel execution. Subtasks (1.1, 1.2, etc.) are the units of work — parent tasks auto-complete when all their subtasks finish.

How It Works

Analysis — The planner reads your tasks and design docs, builds a dependency DAG, and assigns models per task
Execution — Tasks spawn as workers in isolated git worktrees. A task becomes ready once all its dependencies are synced to main
Sync — Completed tasks merge back to main immediately (serialized by lock). Merge conflicts are resolved by a dedicated resolver agent
Review — After a batch of tasks sync, the reviewer audits the work. If issues are found, a fix agent addresses them
Completion — Parent tasks auto-complete when all subtasks finish. The run ends when all tasks are resolved

Configuration

All settings live in config.sh. Most can be overridden via environment variables.

Concurrency

Setting	Default	Description
`MAX_PARALLEL`	`6`	Total concurrent agent processes (workers + reviewer)
`REVIEW_RESERVED_SLOTS`	`1`	Slots reserved for the reviewer agent
`MAX_ITERATIONS_PER_TASK`	`20`	Max agent iterations per task before giving up

Effective worker slots = MAX_PARALLEL - REVIEW_RESERVED_SLOTS. With defaults, that's 5 workers + 1 reviewer.

Models

Setting	Default	Description
`AVAILABLE_MODELS`	`claude-sonnet-4.5,claude-opus-4.6`	Models the planner can assign to tasks
`DEFAULT_TASK_MODEL`	`claude-opus-4.6`	Fallback when planner doesn't assign a model
`STEERING_MODEL`	`claude-opus-4.6`	Model used to generate steering docs
`REVIEWER_MODEL`	`claude-opus-4.6`	Model used for the review gate

The planner picks a model per task based on complexity. Simple tasks get cheaper models, complex ones get more capable models.

Review Gate

Setting	Default	Description
`ENABLE_REVIEW`	`true`	Master switch for post-batch quality review
`REVIEW_BATCH_SIZE`	`3`	Number of synced tasks before triggering a review
`MAX_REVIEW_RETRIES`	`2`	Review→fix cycles before accepting

The reviewer audits completed work against the spec after every batch of synced tasks. If it finds issues, the player agent gets a fix attempt. Use --no-review to skip entirely.

Retry and Safety

Setting	Default	Description
`MAX_RETRIES`	`10`	Retries per task on failure
`MAX_DAG_REPAIR_ATTEMPTS`	`3`	Planner re-prompts to patch missing DAG tasks
`STALE_THRESHOLD`	`600`	Seconds of inactivity before killing a worker
`RATE_LIMIT_PAUSE`	`3`	Seconds between spawning workers

The stale threshold is activity-based, not wall-clock. A worker is considered active if its log file is growing, it has descendant processes running anywhere in its process tree (e.g. a long build or test suite), or the process is alive.

Shared Build Cache

Setting	Default	Description
`SHARED_BUILD_CACHE_DIR`	`../.ralph-build-cache`	Shared build artifact directory across all worktrees

Parallel worktrees normally duplicate build artifacts (Rust target/, Go cache, Gradle home, etc.). The shared cache redirects these via environment variables (CARGO_TARGET_DIR, GRADLE_USER_HOME, GOPATH/GOCACHE, PIP_CACHE_DIR). Set to empty string to disable.

Steering Docs

On first run, if your repo doesn't have .kiro/steering/ docs, they're auto-generated from your spec and codebase. These give all agents persistent project context — architecture decisions, conventions, tech stack, etc.

If you already have steering docs, they're used as-is.

Logs

All logs go to .ralph-logs/ in your repo:

debug_<timestamp>.log — orchestrator debug log
task_<id>_<timestamp>.log — per-task agent output
dag_<timestamp>.json — the dependency DAG produced by the planner
dag_validation_<timestamp>.log — diagnostic info when DAG validation fails
steering_<timestamp>.log — steering doc generation output

DAG Validation

For large specs (100+ tasks), the planner might miss some tasks. btb handles this automatically:

Compares DAG task count against incomplete tasks in tasks.md
If tasks are missing → asks the planner to produce a patch for only the missing ones
Repeats up to MAX_DAG_REPAIR_ATTEMPTS times (default: 3)
Any remaining stragglers get appended as sequential waves — no task is ever lost

You can also validate manually:

./validate-dag.sh spec-name
./validate-dag.sh spec-name .ralph-logs/dag_20250208_143022.json

If you still see issues, try --sequential mode (bypasses DAG entirely), break large specs into smaller ones, or check .ralph-logs/dag_*.json to see what the planner understood.

Cloud Deployment

The deploy/ directory has everything you need to run btb as a service on a Linux server. It includes a webhook server that listens for GitHub push events and automatically runs btb on branches containing a .btb config file.

Quick setup

Provision a fresh Amazon Linux 2023 or Ubuntu instance:
```
sudo bash deploy/provision.sh
```
Copy the btb code to /opt/btb on the instance
Edit /etc/btb-service/config.env with your webhook secret, GitHub token, TLS certs, and paths (see deploy/config.env.example for all options)
Set up the GitHub webhook:
```
bash deploy/setup-webhook.sh
```
Start the service:
```
sudo systemctl start btb-service
```

The service exposes a dashboard at https://<your-host>:8443/ showing job status, queue, and logs.

What's in `deploy/`

File	Purpose
`provision.sh`	Installs all dependencies on a fresh instance
`btb-service.service`	systemd unit file (auto-restart, security hardening)
`config.env.example`	Annotated config template
`setup-webhook.sh`	Creates the GitHub webhook via API

Cleanup

If a run is interrupted or you need to reset:

./btb.sh --cleanup

This removes all worktrees, ralph branches, and lock files.

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bob the Builder

Quick Start

Prerequisites

Usage

Spec Structure

How It Works

Configuration

Concurrency

Models

Review Gate

Retry and Safety

Shared Build Cache

Steering Docs

Logs

DAG Validation

Cloud Deployment

Quick setup

What's in `deploy/`

Cleanup

License

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.kiro		.kiro
deploy		deploy
lib		lib
server		server
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
btb.sh		btb.sh
config.sh		config.sh
setup.sh		setup.sh
validate-dag.sh		validate-dag.sh

License

HautlyS/bob-the-builder

Folders and files

Latest commit

History

Repository files navigation

Bob the Builder

Quick Start

Prerequisites

Usage

Spec Structure

How It Works

Configuration

Concurrency

Models

Review Gate

Retry and Safety

Shared Build Cache

Steering Docs

Logs

DAG Validation

Cloud Deployment

Quick setup

What's in deploy/

Cleanup

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

What's in `deploy/`

Packages