Skip to content

sinapsis-speech v0.4.0

Latest

Choose a tag to compare

@Natalia-OsorioClavijo Natalia-OsorioClavijo released this 05 Jun 21:26
· 22 commits to main since this release

Sinapsis-speech V0.4.0

The sinapsis-speech package continues to evolve, bringing broader compatibility and more powerful speech capabilities. This release introduces new integrations and support for cutting-edge TTS and STT models, along with extended support for ElevenLabs 2.0+.

🚀 New Integrations
Sinapsis Orpheus-CPP
Enables text-to-speech (TTS) using the Orpheus-TTS engine, providing high-quality neural voice synthesis.

OrpheusTTS

Converts text to speech using Orpheus.

Accepts text packets from an input container and returns synthesized audio.

Includes memory-safe error handling for GPU-intensive workloads.

📄 See the full setup in the README.

Sinapsis Parakeet-TDT
Brings speech-to-text (STT) capabilities using NVIDIA’s Parakeet TDT 0.6B model.

ParakeetTDTInference

Transcribes audio input from containers or files.

Supports timestamp prediction.

Adds the resulting text packets back into the container.

📄 See the full setup in the README.

ElevenLabs 2.0+ Support
We now offer seamless compatibility with ElevenLabs v2.0 and above, unlocking improved voice fidelity and additional model options.

ElevenLabsTTS

Text-to-speech using ElevenLabs voice models.

ElevenLabsVoiceGeneration

Generate synthetic voices based on descriptions.

📄 Setup instructions available in the package README.

🔧 Full Package Overview
Sinapsis ElevenLabs – TTS + voice generation via ElevenLabs

Sinapsis F5-TTS – TTS with voice cloning

Sinapsis Kokoro – TTS with Kokoro 82M

Sinapsis Zonos – TTS and voice cloning using Zonos

Sinapsis Orpheus-CPP – NEW: TTS via Orpheus

Sinapsis Parakeet-TDT – NEW: STT via Parakeet TDT