GitHub - rzru/nightingale: Machine learning powered Karaoke app (with scores!)

Karaoke from any song in your music library, powered by neural networks.

Nightingale scans your music folder, separates lead vocals from instrumentals using the UVR Karaoke model (or Demucs), transcribes lyrics with word-level timestamps via WhisperX, and plays it all back with synchronized highlighting, pitch scoring, profiles, and dynamic backgrounds.

Ships as a single binary. No manual installation of Python, ffmpeg, or ML models required — everything is downloaded and bootstrapped automatically on first launch.

Features

🎤 Stem Separation — isolates lead vocals from instrumentals using the UVR Karaoke model (default) or Demucs, with adjustable guide vocal volume. The karaoke model preserves backing vocals in the instrumental for a more natural sound

📝 Word-Level Lyrics — automatic transcription with alignment, or fetched from LRCLIB when available

🎯 Pitch Scoring — real-time microphone input with pitch detection, star ratings, and per-song scoreboards

👤 Profiles — create and switch between player profiles; scores are tracked per profile

🎬 Video Files — drop video files (.mp4, .mkv, etc.) into your music folder; vocals are separated from the audio track and the original video plays as a synchronized background

🌌 7 Background Themes — 5 GPU shader backgrounds (Plasma, Aurora, Waves, Nebula, Starfield), Pixabay video backgrounds with 5 flavors (Nature, Underwater, Space, City, Countryside), plus automatic source video playback for video files

🎮 Gamepad Support — full navigation and control via gamepad (D-pad, sticks, face buttons)

📺 Adaptive UI Scaling — scales to any resolution including 4K TVs

📦 Self-Contained — ffmpeg, uv, Python, PyTorch, and ML packages are all downloaded to ~/.nightingale/vendor/ on first run. Video backgrounds are pre-downloaded during setup so the first session is ready to go

Quick start

Download the latest release for your platform from the Releases page and run it. On first launch, Nightingale will set up its Python environment and download ML models — this takes a few minutes and shows a progress screen.

macOS

macOS quarantines files downloaded from the internet. Since Nightingale isn't signed with an Apple Developer ID, Gatekeeper will block it with a message like "app is damaged and can't be opened". To fix this, remove the quarantine attribute after extracting:

xattr -cr Nightingale.app

Supported formats

Audio: .mp3, .flac, .ogg, .wav, .m4a, .aac, .wma. Video: .mp4, .mkv, .avi, .webm, .mov, .m4v.

Controls

Navigation

Action	Keyboard	Gamepad
Move	Arrow keys	D-pad / Left stick
Confirm / Select	Enter	A (South)
Back / Cancel	Escape	B (East) / Start
Switch panel	Tab	—
Search songs	Type to filter	—

Playback

Action	Keyboard	Gamepad
Pause / Resume	Space	Start
Exit to menu	Escape	B (East)
Toggle guide vocals	G	—
Guide volume up/down	+ / -	—
Cycle background theme	T	—
Cycle video flavor	F	—
Toggle microphone	M	—
Next microphone	N	—
Toggle fullscreen	F11	—
Skip Intro / Skip Outro	On-screen buttons	A (South)

How it works

Audio or video file
        │
        ▼
  ┌─────────────────┐
  │  UVR Karaoke /   │  ──▶  vocals.ogg + instrumental.ogg
  │  Demucs          │       (extracts audio track from videos)
  └─────────────────┘
        │
        ▼
  ┌─────────────────┐
  │  LRCLIB          │  ──▶  Fetches synced lyrics if available
  └─────────────────┘
        │
        ▼
  ┌─────────────────┐
  │  WhisperX        │  ──▶  Transcription + word-level alignment
  │  (large-v3)      │
  └─────────────────┘
        │
        ▼
  ┌─────────────────┐
  │  Bevy App        │  ──▶  Plays instrumental + synced lyrics
  │  (Rust)          │       with pitch scoring & backgrounds
  └─────────────────┘       (video files use source video as background)

Analysis results are cached at ~/.nightingale/cache/ using blake3 file hashes. Re-analysis only happens if the source file changes or is manually triggered.

Hardware

The Python analyzer uses PyTorch and auto-detects the best backend:

Backend	Device	Notes
CUDA	NVIDIA GPU	Fastest
MPS	Apple Silicon	macOS; WhisperX alignment falls back to CPU
CPU	Any	Slowest but always works

The UVR Karaoke model uses ONNX Runtime and enables CUDA acceleration automatically on NVIDIA GPUs, or CoreML on Apple Silicon.

A song typically takes 2–5 minutes on GPU, 10–20 minutes on CPU.

Data storage

Everything lives under ~/.nightingale/:

~/.nightingale/
├── cache/              # Stems, transcripts, lyrics per song
├── config.json         # App settings
├── profiles.json       # Player profiles and scores
├── videos/             # Cached Pixabay video backgrounds
├── sounds/             # Sound effects (celebration)
├── vendor/
│   ├── ffmpeg          # Downloaded ffmpeg binary
│   ├── uv              # Downloaded uv binary
│   ├── python/         # Python 3.10 installed via uv
│   ├── venv/           # Virtual environment with ML packages
│   ├── analyzer/       # Extracted analyzer Python scripts
│   └── .ready          # Marker indicating setup is complete
└── models/
    ├── torch/          # Demucs model cache
    ├── huggingface/    # WhisperX model cache
    └── audio_separator/ # UVR Karaoke model cache

Video backgrounds

Pixabay video backgrounds use the Pixabay API. The API key is embedded in release builds. For development, create a .env file at the project root:

PIXABAY_API_KEY=your_key_here

The release script (make-release.sh) sources .env automatically.

Building from source

Prerequisites

Tool	Version
Rust	1.85+ (edition 2024)
Linux only	`libasound2-dev`, `libudev-dev`, `libwayland-dev`, `libxkbcommon-dev`

Development build

git clone <repo-url> nightingale
cd nightingale
cargo build --release

Local release

Linux / macOS:

scripts/make-release.sh

Builds the release binary and packages it into nightingale-<target>.tar.gz.

Windows (PowerShell):

powershell -ExecutionPolicy Bypass -File scripts/make-release.ps1

Builds the release binary and packages it into nightingale-x86_64-pc-windows-msvc.zip.

CLI flags

Flag	Description
`--setup`	Force re-run of the first-launch bootstrap

Supported platforms

Platform	Target
Linux x86_64	`x86_64-unknown-linux-gnu`
Linux aarch64	`aarch64-unknown-linux-gnu`
macOS ARM	`aarch64-apple-darwin`
macOS Intel	`x86_64-apple-darwin`
Windows x86_64	`x86_64-pc-windows-msvc`

License

GPL-3.0-or-later — see LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 132 Commits
.cargo		.cargo
analyzer		analyzer
assets		assets
packaging		packaging
scripts		scripts
site		site
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Cross.toml		Cross.toml
LICENSE		LICENSE
README.md		README.md
build.rs		build.rs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Features

Quick start

macOS

Supported formats

Controls

Navigation

Playback

How it works

Hardware

Data storage

Video backgrounds

Building from source

Prerequisites

Development build

Local release

CLI flags

Supported platforms

License

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Features

Quick start

macOS

Supported formats

Controls

Navigation

Playback

How it works

Hardware

Data storage

Video backgrounds

Building from source

Prerequisites

Development build

Local release

CLI flags

Supported platforms

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages