Hibiki-Zero

Hibiki-Zero is a real-time and multilingual speech translation model. It translates from French, Spanish, Portuguese and German to English: accurately, with low latency, high audio quality, and voice transfer.

hibiki-zero-leon-marchand.mp4

🤗 Hugging Face Model Card | ⚙️ Tech report | 📄 Paper | 🎧 More samples

Requirements

Hibiki-Zero is a 3B-parameter model and requires an NVIDIA GPU to run: 8 GB VRAM should work, 12 GB is safe.

Run the server

Hibiki-Zero comes with a server you can run to interact with Hibiki in real time. To run it, just use:

uvx -p 3.13 hibiki-zero serve [--gradio-tunnel]

Then go to the URL displayed to try out Hibiki-Zero. The --gradio-tunnel flag will forward the server to a public URL that you can access from anywhere.

If you don't have uv, you must first install hibiki-zero with pip install hibiki-zero and then run the server with hibiki-zero serve [--gradio-tunnel].

Run inference

If you'd like to run Hibiki-Zero on existing audio files, run:

uvx -p 3.13 hibiki-zero generate [--file /path/to/my/audio.wav --file /path/to/another/audio.mp3]

Batch inference is supported, meaning you can run the model on multiple audio files at the same time.

Local development

We recomment using uv, run anything with uv run in this repository. For example

uv run some_file.py
or 
uv run hibiki-zero serve

if you use pip, use pip install -e . before executing python commands.

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
.github/workflows		.github/workflows
frontend		frontend
hibiki_zero		hibiki_zero
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
build_frontend.sh		build_frontend.sh
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hibiki-Zero

Requirements

Run the server

Run inference

Local development

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors 3

Languages

License

kyutai-labs/hibiki-zero

Folders and files

Latest commit

History

Repository files navigation

Hibiki-Zero

Requirements

Run the server

Run inference

Local development

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors 3

Languages

Packages