Sona 🎧

Sona is a local transcription runner built on top of whisper.cpp.

It runs as a standalone process and exposes an OpenAI-compatible HTTP API, so other apps (desktop apps, Python scripts, etc.) can spawn it and talk to it easily.

Think of it as a small, fast, local Whisper server you control.

Why Sona ✨

Fully local transcription
Cross-platform single binary
GPU-accelerated by default
OpenAI-compatible API
Designed to be spawned and owned by another process
No heavy setup, no long-running system service

Platform & GPU Support 🚀

Sona ships prebuilt binaries for:

macOS (x86_64 and arm64)
Linux (x86_64 and arm64)
Windows (x86_64)

GPU acceleration is enabled by default:

macOS: CoreML / Metal
Linux: Vulkan
Windows: Vulkan

Sona automatically uses the best available backend for the platform.

How It Works 🧠

Your app starts the Sona process
Sona binds to a local port (optionally chosen by the OS)
Your app talks to Sona over HTTP
Models are loaded and unloaded at runtime
Transcription requests go through an OpenAI-compatible endpoint

Sona is intentionally simple and predictable, so it can be embedded into larger systems.

Quick Start ⚡

1. Download a release binary

https://github.com/thewh1teagle/sona/releases

2. Download a model

./sona pull https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-base.bin

3. Start Sona

./sona serve --port 0

Using port 0 lets the OS assign a free port automatically.

When ready, Sona prints a single machine-readable line to stdout:

{"status":"ready","port":52341}

This is intended for parent processes to detect readiness and discover the bound port.

Using Sona 🔌

Sona exposes an OpenAI-compatible transcription API.

This means:

You can use existing OpenAI SDKs
You don’t need custom client code
Switching between local (Sona) and remote (OpenAI) is trivial

See the full API reference here:

API docs: /docs
OpenAPI spec: /openapi.json

Notes & Limitations ⚠️

One transcription runs at a time per process
concurrent requests return 429
Non-WAV audio is automatically converted using ffmpeg
- system ffmpeg or a bundled binary next to sona

When to Use Sona 🎯

Sona is a good fit if you want:

Local or offline transcription
Low-latency transcription in desktop apps
Full control over models and hardware
An OpenAI-like API without the OpenAI dependency

Documentation 📚

API reference: /docs
OpenAPI schema: /openapi.json
Releases: https://github.com/thewh1teagle/sona/releases

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
.github/workflows		.github/workflows
cmd/sona		cmd/sona
diarize		diarize
docs		docs
internal		internal
plans		plans
scripts		scripts
sonapy		sonapy
tests/manual		tests/manual
third_party/include		third_party/include
.gitignore		.gitignore
.python-version		.python-version
.whisper.cpp-commit		.whisper.cpp-commit
BUILDING.md		BUILDING.md
CLAUDE.md		CLAUDE.md
README.md		README.md
go.mod		go.mod
go.sum		go.sum
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sona 🎧

Why Sona ✨

Platform & GPU Support 🚀

How It Works 🧠

Quick Start ⚡

1. Download a release binary

2. Download a model

3. Start Sona

Using Sona 🔌

Notes & Limitations ⚠️

When to Use Sona 🎯

Documentation 📚

About

Uh oh!

Releases 9

Packages

Contributors 2

Uh oh!

Languages

thewh1teagle/sona

Folders and files

Latest commit

History

Repository files navigation

Sona 🎧

Why Sona ✨

Platform & GPU Support 🚀

How It Works 🧠

Quick Start ⚡

1. Download a release binary

2. Download a model

3. Start Sona

Using Sona 🔌

Notes & Limitations ⚠️

When to Use Sona 🎯

Documentation 📚

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 9

Packages 0

Contributors 2

Uh oh!

Languages

Packages