RouteBox-beta

One proxy. Every model. Your rules.
A macOS menu bar app that routes your LLM API calls to the best provider — by cost, speed, or quality.

Quickstart • Why I built RouteBox • Features • How it works • Providers • Installation • Contributing

What is RouteBox?

RouteBox is a native macOS menu bar app that runs a local OpenAI-compatible proxy on localhost:3001. Point any app at it instead of directly at OpenAI / Anthropic / Google — RouteBox picks the best provider for each request based on your rules, tracks cost and latency in real-time, and supports both cloud APIs and local models (Ollama / LM Studio).

Your App  →  RouteBox (localhost:3001)  →  Cloud: OpenAI / Anthropic / Google / DeepSeek / MiniMax / Kimi / FLock.io
                                         →  Local: Ollama / LM Studio

Why I built RouteBox

I work across multiple AI providers every day — testing models, switching keys, comparing outputs. What started as a simple annoyance (manually swapping base_url and api_key in every script) became a real productivity tax. I was losing track of which provider I was hitting, how much I was spending, and whether a cheaper model could've handled the same task.

RouteBox started as a personal tool: a lightweight local proxy that sits in the menu bar and handles the routing for me. I wanted something that respects the way macOS apps should feel — quiet, native, always available but never in the way. One endpoint for everything, with full visibility into what's happening under the hood.

It's MIT licensed and open source because I think developer tools should be transparent and hackable.

Features

Intelligent routing — auto-route requests by cost, speed, or quality.
Content-aware rules — detect code tasks, long context, or custom patterns and route accordingly.
Real-time dashboard — track requests, tokens, cost, and savings at a glance.
Cloud + Local — 7 cloud providers + local models via Ollama and LM Studio.
Full request logs — every call logged with model, provider, latency, and token count.
Usage analytics — cost trends, latency comparison, model usage breakdown.
Budget alerts — set monthly limits with warnings at 80% and 100%.
Native UX — Tauri v2 + React, frosted glass, SF Pro, feels like a macOS system tool.
Docker gateway — run the proxy headless for server or team use.

Coming soon

RouteBox Cloud — pay-as-you-go API access with credits. No need to configure multiple API keys — just top up and go.
Provider health monitoring — auto-detect latency spikes, rate limits, and downtime with automatic fallback.
More platforms — Windows, Linux, and iOS/Android support.

How it works

Routing Flow

flowchart LR
    subgraph App["Your App"]
        R[API Request]
    end

    subgraph RB["RouteBox (localhost:3001)"]
        direction TB
        Auth[Auth Check] --> Rules[Match Rules]
        Rules --> Strategy[Apply Strategy]
        Strategy --> Select[Select Provider]
    end

    subgraph Cloud["Cloud Providers"]
        direction TB
        OA[OpenAI]
        AN[Anthropic]
        GG[Google]
        DS[DeepSeek]
        MM[MiniMax]
        KM[Kimi]
        FL[FLock.io]
    end

    subgraph Local["Local Models"]
        direction TB
        OL[Ollama]
        LM[LM Studio]
    end

    R --> Auth
    Select --> Cloud
    Select --> Local

    classDef appNode fill:#e3f2fd,stroke:#1976d2,stroke-width:2px
    classDef rbNode fill:#e8f5e8,stroke:#4caf50,stroke-width:2px
    classDef cloudNode fill:#fff8e1,stroke:#ff9800,stroke-width:2px
    classDef localNode fill:#f3e5f5,stroke:#9c27b0,stroke-width:2px

    class R appNode
    class Auth,Rules,Strategy,Select rbNode
    class OA,AN,GG,DS,MM,KM,FL cloudNode
    class OL,LM localNode

Routing Strategies

Strategy	Behavior
Smart Auto	AI picks the best route per request based on content analysis
Cost First	Always pick the cheapest available provider
Speed First	Always pick the lowest-latency provider
Quality First	Always pick the best available model tier

Content-Aware Rules

Rule Type	Triggers when...	Example
Alias	Model name matches a virtual name you define	`route-code` → `deepseek-coder`
Code	Request contains ≥3 code markers	Auto-route code tasks to DeepSeek
Long	Message ≥8,000 characters	Auto-route long context to Gemini
General	Catch-all fallback	Default model for everything else

Model Preferences

Pin: Force gpt-4o → always use OpenAI (never fall back).
Exclude: Never route gpt-4o through provider X.

Supported Providers

Cloud

Provider	Models	API Key
OpenAI	GPT-5.4, GPT-5	platform.openai.com
Anthropic	Claude Opus 4.6, Claude Sonnet 4.6, Claude Haiku 4.5	console.anthropic.com
Google	Gemini 3.1 Pro, Gemini 3.1 Flash	aistudio.google.com
DeepSeek	DeepSeek-V3.2, DeepSeek-R1	platform.deepseek.com
MiniMax	MiniMax-M2.5, MiniMax-M2.1	platform.minimaxi.com
Kimi	Kimi K2.5, Kimi K2, Moonshot	platform.moonshot.ai
FLock.io	Qwen3-235B, Qwen3.5, DeepSeek-V3.2, Kimi K2.5	platform.flock.io

Local

Provider	Setup
Ollama	Install Ollama and pull any model — RouteBox auto-detects it
LM Studio	Run LM Studio's local server — RouteBox connects automatically

Tip: FLock API Platform provides access to open-source models at competitive rates — a good option for cost-effective routing.

Quickstart

Download (recommended)

Grab the latest RouteBox.dmg from Releases.
Drag RouteBox into Applications and launch it.
The app appears in your menu bar. Press ⌘⇧R to toggle the panel.
Go to Settings → Providers and add your API keys.
Point any OpenAI-compatible client at http://localhost:3001/v1:

from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:3001/v1",
    api_key="YOUR_ROUTEBOX_TOKEN"  # shown in Settings → Authentication
)

response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello!"}]
)

That's it. RouteBox handles the rest.

Installation

From Releases (recommended)

Download RouteBox.dmg from Releases.
Drag RouteBox into Applications.
Launch and start adding provider keys.

From Source

Prerequisites: macOS 12+, Node.js 20+, pnpm 10+, Bun 1.x, Rust (stable), Xcode CLT (xcode-select --install).

git clone https://github.com/createpjf/RouteBox.git
cd RouteBox
pnpm install
cd apps/desktop
pnpm tauri dev

Docker (Gateway Only)

Run the routing gateway headless — no macOS desktop UI, ideal for servers or team use:

cd apps/gateway
docker build -t routebox-gateway .
docker run -p 3001:3001 \
  -e OPENAI_API_KEY=sk-... \
  -e ANTHROPIC_API_KEY=sk-ant-... \
  -v routebox-data:/data \
  routebox-gateway

See apps/gateway/.env.example for all environment variables.

Build DMG

cd apps/desktop
pnpm tauri build

# With updater signing
TAURI_SIGNING_PRIVATE_KEY="$(cat src-tauri/routebox-signer.key)" \
TAURI_SIGNING_PRIVATE_KEY_PASSWORD="routebox" \
pnpm tauri build

App Overview

Tab	What it shows
Dashboard	Requests, tokens, cost, savings, traffic sparkline, provider status
My Usage	Usage analytics with cost trends, provider latency, model breakdown
Routing	Strategy selector, model preferences (pin/exclude), content-aware rules
Logs	Full request history with model, provider, latency, token count per request
Account	API key management, local model connections, RouteBox Cloud

Settings

Setting	Location	Notes
Provider API Keys	Account → Providers	Cloud + local provider keys
Monthly Budget	Settings → Budget	Alerts at 80% and 100%
Gateway URL	Settings → Connection	Default `http://localhost:3001`, customizable
Auth Token	Settings → Authentication	Auto-generated for proxy access
Auto-start Gateway	Settings → Gateway	On/off toggle
Check for Updates	Settings → About	Downloads and installs automatically

Keyboard Shortcuts

Shortcut	Action
`⌘⇧R`	Toggle panel (global)
`⌘C`	Copy API key
`⌘P`	Pause/resume traffic

Tech Stack

Desktop: Tauri v2 (Rust) + React 19 + TypeScript + Tailwind CSS v4
Gateway: Bun + Hono + bun:sqlite
Design: SF Pro, frosted glass (macOS native feel)

Contributing

Contributions are welcome! Here's how to get started:

Fork the repo and create a branch from main.
Make your changes and test locally with pnpm tauri dev.
Open a PR with a clear description of what you changed and why.

If you find a bug or have a feature idea, open an issue.

License

MIT

Built with ☕ and too many API keys.
GitHub · Download · Issues

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
.github/workflows		.github/workflows
apps		apps
docs/images		docs/images
scripts		scripts
.gitignore		.gitignore
.npmrc		.npmrc
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RouteBox-beta

What is RouteBox?

Why I built RouteBox

Features

Coming soon

How it works

Routing Flow

Routing Strategies

Content-Aware Rules

Model Preferences

Supported Providers

Cloud

Local

Quickstart

Download (recommended)

Installation

From Releases (recommended)

From Source

Docker (Gateway Only)

Build DMG

App Overview

Settings

Keyboard Shortcuts

Tech Stack

Contributing

License

About

Uh oh!

Releases 2

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RouteBox-beta

What is RouteBox?

Why I built RouteBox

Features

Coming soon

How it works

Routing Flow

Routing Strategies

Content-Aware Rules

Model Preferences

Supported Providers

Cloud

Local

Quickstart

Download (recommended)

Installation

From Releases (recommended)

From Source

Docker (Gateway Only)

Build DMG

App Overview

Settings

Keyboard Shortcuts

Tech Stack

Contributing

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages