Karaoke from any song in your music library, powered by neural networks.
Nightingale scans your music folder, separates lead vocals from instrumentals using the UVR Karaoke model (or Demucs), transcribes lyrics with word-level timestamps via WhisperX, and plays it all back with synchronized highlighting, pitch scoring, profiles, and dynamic backgrounds.
Ships as a single binary. No manual installation of Python, ffmpeg, or ML models required โ everything is downloaded and bootstrapped automatically on first launch.
๐ค Stem Separation โ isolates lead vocals from instrumentals using the UVR Karaoke model (default) or Demucs, with adjustable guide vocal volume. The karaoke model preserves backing vocals in the instrumental for a more natural sound
๐ Word-Level Lyrics โ automatic transcription with alignment, or fetched from LRCLIB when available
๐ฏ Pitch Scoring โ real-time microphone input with pitch detection, star ratings, and per-song scoreboards
๐ค Profiles โ create and switch between player profiles; scores are tracked per profile
๐ฌ Video Files โ drop video files (.mp4, .mkv, etc.) into your music folder; vocals are separated from the audio track and the original video plays as a synchronized background
๐ 7 Background Themes โ 5 GPU shader backgrounds (Plasma, Aurora, Waves, Nebula, Starfield), Pixabay video backgrounds with 5 flavors (Nature, Underwater, Space, City, Countryside), plus automatic source video playback for video files
๐ฎ Gamepad Support โ full navigation and control via gamepad (D-pad, sticks, face buttons)
๐บ Adaptive UI Scaling โ scales to any resolution including 4K TVs
๐ฆ Self-Contained โ ffmpeg, uv, Python, PyTorch, and ML packages are all downloaded to ~/.nightingale/vendor/ on first run. Video backgrounds are pre-downloaded during setup so the first session is ready to go
Download the latest release for your platform from the Releases page and run it. On first launch, Nightingale will set up its Python environment and download ML models โ this takes a few minutes and shows a progress screen.
macOS quarantines files downloaded from the internet. Since Nightingale isn't signed with an Apple Developer ID, Gatekeeper will block it with a message like "app is damaged and can't be opened". To fix this, remove the quarantine attribute after extracting:
xattr -cr Nightingale.appAudio: .mp3, .flac, .ogg, .wav, .m4a, .aac, .wma. Video: .mp4, .mkv, .avi, .webm, .mov, .m4v.
| Action | Keyboard | Gamepad |
|---|---|---|
| Move | Arrow keys | D-pad / Left stick |
| Confirm / Select | Enter | A (South) |
| Back / Cancel | Escape | B (East) / Start |
| Switch panel | Tab | โ |
| Search songs | Type to filter | โ |
| Action | Keyboard | Gamepad |
|---|---|---|
| Pause / Resume | Space | Start |
| Exit to menu | Escape | B (East) |
| Toggle guide vocals | G | โ |
| Guide volume up/down | + / - | โ |
| Cycle background theme | T | โ |
| Cycle video flavor | F | โ |
| Toggle microphone | M | โ |
| Next microphone | N | โ |
| Toggle fullscreen | F11 | โ |
| Skip Intro / Skip Outro | On-screen buttons | A (South) |
Audio or video file
โ
โผ
โโโโโโโโโโโโโโโโโโโ
โ UVR Karaoke / โ โโโถ vocals.ogg + instrumental.ogg
โ Demucs โ (extracts audio track from videos)
โโโโโโโโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโ
โ LRCLIB โ โโโถ Fetches synced lyrics if available
โโโโโโโโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโ
โ WhisperX โ โโโถ Transcription + word-level alignment
โ (large-v3) โ
โโโโโโโโโโโโโโโโโโโ
โ
โผ
โโโโโโโโโโโโโโโโโโโ
โ Bevy App โ โโโถ Plays instrumental + synced lyrics
โ (Rust) โ with pitch scoring & backgrounds
โโโโโโโโโโโโโโโโโโโ (video files use source video as background)
Analysis results are cached at ~/.nightingale/cache/ using blake3 file hashes. Re-analysis only happens if the source file changes or is manually triggered.
The Python analyzer uses PyTorch and auto-detects the best backend:
| Backend | Device | Notes |
|---|---|---|
| CUDA | NVIDIA GPU | Fastest |
| MPS | Apple Silicon | macOS; WhisperX alignment falls back to CPU |
| CPU | Any | Slowest but always works |
The UVR Karaoke model uses ONNX Runtime and enables CUDA acceleration automatically on NVIDIA GPUs, or CoreML on Apple Silicon.
A song typically takes 2โ5 minutes on GPU, 10โ20 minutes on CPU.
Everything lives under ~/.nightingale/:
~/.nightingale/
โโโ cache/ # Stems, transcripts, lyrics per song
โโโ config.json # App settings
โโโ profiles.json # Player profiles and scores
โโโ videos/ # Cached Pixabay video backgrounds
โโโ sounds/ # Sound effects (celebration)
โโโ vendor/
โ โโโ ffmpeg # Downloaded ffmpeg binary
โ โโโ uv # Downloaded uv binary
โ โโโ python/ # Python 3.10 installed via uv
โ โโโ venv/ # Virtual environment with ML packages
โ โโโ analyzer/ # Extracted analyzer Python scripts
โ โโโ .ready # Marker indicating setup is complete
โโโ models/
โโโ torch/ # Demucs model cache
โโโ huggingface/ # WhisperX model cache
โโโ audio_separator/ # UVR Karaoke model cache
Pixabay video backgrounds use the Pixabay API. The API key is embedded in release builds. For development, create a .env file at the project root:
PIXABAY_API_KEY=your_key_here
The release script (make-release.sh) sources .env automatically.
| Tool | Version |
|---|---|
| Rust | 1.85+ (edition 2024) |
| Linux only | libasound2-dev, libudev-dev, libwayland-dev, libxkbcommon-dev |
git clone <repo-url> nightingale
cd nightingale
cargo build --releaseLinux / macOS:
scripts/make-release.shBuilds the release binary and packages it into nightingale-<target>.tar.gz.
Windows (PowerShell):
powershell -ExecutionPolicy Bypass -File scripts/make-release.ps1Builds the release binary and packages it into nightingale-x86_64-pc-windows-msvc.zip.
| Flag | Description |
|---|---|
--setup |
Force re-run of the first-launch bootstrap |
| Platform | Target |
|---|---|
| Linux x86_64 | x86_64-unknown-linux-gnu |
| Linux aarch64 | aarch64-unknown-linux-gnu |
| macOS ARM | aarch64-apple-darwin |
| macOS Intel | x86_64-apple-darwin |
| Windows x86_64 | x86_64-pc-windows-msvc |
GPL-3.0-or-later โ see LICENSE.
