Sinapsis-speech V0.4.0
The sinapsis-speech package continues to evolve, bringing broader compatibility and more powerful speech capabilities. This release introduces new integrations and support for cutting-edge TTS and STT models, along with extended support for ElevenLabs 2.0+.
🚀 New Integrations
Sinapsis Orpheus-CPP
Enables text-to-speech (TTS) using the Orpheus-TTS engine, providing high-quality neural voice synthesis.
OrpheusTTS
Converts text to speech using Orpheus.
Accepts text packets from an input container and returns synthesized audio.
Includes memory-safe error handling for GPU-intensive workloads.
📄 See the full setup in the README.
Sinapsis Parakeet-TDT
Brings speech-to-text (STT) capabilities using NVIDIA’s Parakeet TDT 0.6B model.
ParakeetTDTInference
Transcribes audio input from containers or files.
Supports timestamp prediction.
Adds the resulting text packets back into the container.
📄 See the full setup in the README.
ElevenLabs 2.0+ Support
We now offer seamless compatibility with ElevenLabs v2.0 and above, unlocking improved voice fidelity and additional model options.
ElevenLabsTTS
Text-to-speech using ElevenLabs voice models.
ElevenLabsVoiceGeneration
Generate synthetic voices based on descriptions.
📄 Setup instructions available in the package README.
🔧 Full Package Overview
Sinapsis ElevenLabs – TTS + voice generation via ElevenLabs
Sinapsis F5-TTS – TTS with voice cloning
Sinapsis Kokoro – TTS with Kokoro 82M
Sinapsis Zonos – TTS and voice cloning using Zonos
Sinapsis Orpheus-CPP – NEW: TTS via Orpheus
Sinapsis Parakeet-TDT – NEW: STT via Parakeet TDT