bt-servant-worker

AI-powered assistance for Bible translators, at the edge.

A Cloudflare Worker that provides AI-powered assistance to Bible translators via Claude, with sandboxed code execution, dynamic MCP tool orchestration, and per-user persistent memory.

What This Project Does

bt-servant-worker is deployed on Cloudflare's edge network and provides:

Claude-powered chat with multi-turn orchestration (up to 10 tool-use iterations per request)
Dynamic MCP tool discovery — discovers and calls MCP tools from configured servers
Sandboxed code execution via QuickJS compiled to WebAssembly
Per-user state — chat history, preferences, prompt overrides, and persistent memory via Durable Objects
Request serialization — one request at a time per user, preventing race conditions
Streaming support — real-time SSE streaming and webhook progress callbacks
Dynamic prompt overrides — org and user-level customization of Claude's system prompt
User persistent memory — schema-free markdown memory that persists across conversations

Architecture Overview

┌─────────────────────────────────────────────────────────────────┐
│  Cloudflare Worker                                              │
│                                                                 │
│  POST /api/v1/chat ──► KV (org config, MCP servers, prompts)   │
│       │                                                         │
│       ▼                                                         │
│  ┌─────────────────────────────────────────────────────────┐    │
│  │ Durable Object (per-user)                               │    │
│  │ - Chat history, preferences, prompt overrides, memory   │    │
│  │ - Request serialization via storage lock                │    │
│  └──────────────┬──────────────────────────────────────────┘    │
│                 ▼                                               │
│  ┌─────────────────────────────────────────────────────────┐    │
│  │ Claude Orchestrator                                     │    │
│  │ - System prompt with tool catalog + memory TOC          │    │
│  │ - Up to 10 iterations with parallel tool execution      │    │
│  └──────────────┬──────────────────────────────────────────┘    │
│                 │                                               │
│    ┌────────────┼────────────┬──────────────┐                   │
│    ▼            ▼            ▼              ▼                   │
│  execute_code  get_tool_   read_memory   update_memory          │
│  (QuickJS)     definitions  (DO store)   (DO store)             │
└─────────────────────────────────────────────────────────────────┘

Key Components

QuickJS Sandbox — Replaces Node.js isolated-vm with QuickJS compiled to WebAssembly. Code runs in a completely isolated sandbox with no access to fetch, environment variables, or Worker APIs. Only explicitly injected MCP tool wrappers are available.

Durable Objects — Each user gets their own instance that guarantees single-threaded execution, stores chat history/preferences/memory, and persists data across requests.

MCP Budget & Health Tracking — Downstream API call budget tracking with circuit breaker pattern prevents runaway costs and blocks unhealthy servers.

User Persistent Memory — Schema-free markdown document per user. A deterministic TOC is injected into the system prompt; Claude reads/writes specific sections via tools. Memory persists indefinitely across sessions. 128KB storage cap per user.

Dynamic Prompt Overrides — 6 customizable prompt slots (identity, methodology, tool_guidance, instructions, memory_instructions, closing) with 3-tier resolution: user → org → default.

API Endpoints

Chat

Endpoint	Method	Description
`/health`	GET	Health check
`/api/v1/chat`	POST	Chat with Claude (non-streaming)
`/api/v1/chat/stream`	POST	Chat with Claude (SSE streaming)

User Endpoints

Endpoint	Method	Description
`/api/v1/orgs/:org/users/:userId/preferences`	GET/PUT	User preferences
`/api/v1/orgs/:org/users/:userId/history`	GET	Chat history

Admin Endpoints

All admin endpoints require Bearer token authentication (super admin or org-specific admin key).

Endpoint	Method	Description
`/api/v1/admin/orgs/:org/mcp-servers`	GET/PUT/POST	MCP server management
`/api/v1/admin/orgs/:org/mcp-servers/:serverId`	DELETE	Remove MCP server
`/api/v1/admin/orgs/:org/config`	GET/PUT/DEL	Org config (history limits)
`/api/v1/admin/orgs/:org/prompt-overrides`	GET/PUT/DEL	Org-level prompt overrides
`/api/v1/admin/orgs/:org/users/:userId/prompt-overrides`	GET/PUT/DEL	User-level prompt overrides
`/api/v1/admin/orgs/:org/users/:userId/memory`	GET/DEL	User persistent memory

Chat Request/Response

// Request
interface ChatRequest {
  client_id: string;
  user_id: string;
  message: string;
  message_type: 'text' | 'audio';
  org?: string;
  progress_callback_url?: string; // webhook URL for progress updates
  progress_mode?: 'full' | 'incremental';
  progress_throttle_seconds?: number;
}

// Response
interface ChatResponse {
  responses: string[];
  response_language: string;
  voice_audio_base64: string | null;
}

Request Serialization & Concurrency

Chat requests are processed one at a time per user to ensure conversation history integrity. Concurrent requests receive 429 Too Many Requests with a Retry-After header.

API consumers must implement retry logic for 429 responses. The lock has a 90-second stale threshold as a safety mechanism.

See the 429 Response Format below.

429 Response Format

{
  "error": "Request in progress",
  "code": "CONCURRENT_REQUEST_REJECTED",
  "message": "Another request for this user is currently being processed. Please retry.",
  "retry_after_ms": 5000
}

Project Structure

bt-servant-worker/
├── src/
│   ├── index.ts                         # Worker entry point + admin routes
│   ├── config/                          # Environment configuration types
│   ├── durable-objects/                 # UserSession Durable Object
│   ├── services/
│   │   ├── claude/                      # Orchestrator, system prompt, tools
│   │   ├── code-execution/             # QuickJS sandbox
│   │   ├── mcp/                        # MCP discovery, catalog, budget, health
│   │   ├── memory/                     # User persistent memory (parser, store)
│   │   └── progress/                   # Webhook progress callbacks
│   ├── types/                          # Shared TypeScript types
│   └── utils/                          # Logger, crypto, errors, validation
├── tests/
│   ├── unit/                           # Unit tests
│   └── e2e/                            # End-to-end tests
├── docs/
│   ├── implementation-plan.md          # Full implementation plan
│   └── plans/                          # Feature implementation plans
├── .github/workflows/
│   ├── ci.yml                          # CI: lint, typecheck, test, build
│   ├── deploy.yml                      # Deploy to Cloudflare (after CI passes)
│   └── claude-review.yml              # Claude PR reviews
├── wrangler.toml                       # Cloudflare Worker config
├── package.json                        # Dependencies and scripts
├── tsconfig.json                       # TypeScript config (strict mode)
├── eslint.config.js                    # ESLint with fitness functions
├── .dependency-cruiser.js              # Architecture enforcement
├── .prettierrc                         # Code formatting
└── .husky/pre-commit                   # Pre-commit hooks

Development

Prerequisites

Node.js >= 20.0.0
pnpm >= 9.0.0

Setup

pnpm install

Commands

Command	Description
`pnpm dev`	Start local development server
`pnpm build`	Build the worker
`pnpm test`	Run tests
`pnpm lint`	Run ESLint
`pnpm format`	Format code with Prettier
`pnpm check`	TypeScript type check
`pnpm architecture`	Check for circular dependencies

Local Testing

pnpm dev
# In another terminal:
curl http://localhost:8787/health

Code Quality (Fitness Functions)

This project enforces strict code quality rules via ESLint:

Rule	Limit	Purpose
`max-lines-per-function`	50 lines	Keep functions small and focused
`max-statements`	25	Limit complexity per function
`complexity`	10	Cyclomatic complexity limit
`max-depth`	4	Limit nested blocks
`max-nested-callbacks`	3	Prevent callback hell
`max-params`	5	Encourage parameter objects

Architecture Enforcement

Dependency-cruiser enforces onion architecture:

No circular dependencies allowed
types/ cannot import from routes, services, or durable-objects
services/ cannot import from routes

Deployment

Deployments go through CI/CD (never deploy directly):

Push to a branch and create a PR
CI runs (lint, typecheck, test, build)
Claude PR Review runs automatically
On merge to main, deploy runs automatically

The worker will be available at: https://bt-servant-worker.<your-subdomain>.workers.dev

Pre-commit Hooks

Every commit runs:

lint-staged — ESLint + Prettier on staged files
Type check — tsc --noEmit
Architecture check — dependency-cruiser
Tests — vitest
Build — wrangler build

If any check fails, the commit is blocked.

Related Projects

bt-servant-engine — The Python/FastAPI predecessor
bt-servant-web-client — Next.js frontend
bt-servant-whatsapp-gateway — WhatsApp integration
lasker-api — Reference for dynamic code execution pattern

License

Private

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bt-servant-worker

What This Project Does

Architecture Overview

Key Components

API Endpoints

Chat

User Endpoints

Admin Endpoints

Chat Request/Response

Request Serialization & Concurrency

429 Response Format

Project Structure

Development

Prerequisites

Setup

Commands

Local Testing

Code Quality (Fitness Functions)

Architecture Enforcement

Deployment

Pre-commit Hooks

Related Projects

License

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.claude		.claude
.github/workflows		.github/workflows
.husky		.husky
docs		docs
src		src
tests		tests
.dependency-cruiser.js		.dependency-cruiser.js
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
CLAUDE.md		CLAUDE.md
README.md		README.md
eslint.config.js		eslint.config.js
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts
wrangler.toml		wrangler.toml

unfoldingWord/bt-servant-worker

Folders and files

Latest commit

History

Repository files navigation

bt-servant-worker

What This Project Does

Architecture Overview

Key Components

API Endpoints

Chat

User Endpoints

Admin Endpoints

Chat Request/Response

Request Serialization & Concurrency

429 Response Format

Project Structure

Development

Prerequisites

Setup

Commands

Local Testing

Code Quality (Fitness Functions)

Architecture Enforcement

Deployment

Pre-commit Hooks

Related Projects

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages