This guide covers the installation process for TTS Studio Python library.
- Python 3.10 or 3.11 (3.10 recommended)
- pip package manager
- Git
- 8GB+ RAM (16GB recommended for M1 Pro)
- GPU optional (MPS for Apple Silicon, CUDA for NVIDIA)
- Linux (Ubuntu 20.04+, Debian 11+)
- macOS (10.15+, M1/M2 recommended)
- Windows 10/11 (with WSL2 recommended)
- CPU: Modern multi-core processor
- RAM: Minimum 8GB, 16GB+ recommended
- Storage: 10GB+ free space (for models and cache)
- GPU:
- Apple Silicon M1/M2 with MPS (recommended for macOS)
- NVIDIA GPU with CUDA support (optional for Linux/Windows)
-
Clone the repository:
git clone https://github.com/bryanstevensacosta/tts-studio.git cd tts-studio -
Navigate to core library:
cd apps/core -
Run the automated setup script:
./setup.sh
This script will:
- Check Python version compatibility
- Create a virtual environment
- Install all dependencies
- Set up pre-commit hooks (for development)
- Verify the installation
-
Activate the virtual environment:
source venv/bin/activate -
Verify installation:
python -c "from api.studio import TTSStudio; print('TTS Studio installed successfully!')"
-
Clone the repository:
git clone https://github.com/bryanstevensacosta/tts-studio.git cd tts-studio/apps/core -
Create a virtual environment:
python3.10 -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate
-
Upgrade pip:
pip install --upgrade pip
-
Install the package:
pip install -e . -
Verify installation:
python -c "from api.studio import TTSStudio; print('Installation successful!')"
For contributors and developers:
# Clone and navigate to core library
git clone https://github.com/bryanstevensacosta/tts-studio.git
cd tts-studio/apps/core
# Create and activate virtual environment
python3.10 -m venv venv
source venv/bin/activate
# Install with development dependencies
pip install -e ".[dev]"
# Install pre-commit hooks
pre-commit install
pre-commit install --hook-type commit-msg
pre-commit install --hook-type pre-push
# Verify installation
make testFor Apple Silicon Macs, MPS (Metal Performance Shaders) is automatically supported:
-
Verify MPS availability:
import torch print(f"MPS available: {torch.backends.mps.is_available()}")
-
Configure for MPS in
apps/core/config/config.yaml:model: device: "mps" # or "auto" for automatic detection dtype: "float32" # Required for MPS stability
-
Performance: M1 Pro generates ~15-25 seconds per minute of audio
To enable GPU acceleration with CUDA (Linux/Windows):
-
Install CUDA Toolkit (11.8 or compatible version):
- Download from NVIDIA CUDA Toolkit
- Follow platform-specific installation instructions
-
Install PyTorch with CUDA support:
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
-
Verify GPU availability:
import torch print(f"CUDA available: {torch.cuda.is_available()}") print(f"GPU: {torch.cuda.get_device_name(0)}")
-
Configure for CUDA in
apps/core/config/config.yaml:model: device: "cuda" # or "auto" dtype: "float32"
If no GPU is available, TTS Studio will automatically use CPU:
# apps/core/config/config.yaml
model:
device: "cpu"
dtype: "float32"Note: CPU-only mode is significantly slower (~2-3 minutes per minute of audio).
The Qwen3-TTS model (~3.4GB) downloads automatically on first use:
from api.studio import TTSStudio
# First initialization downloads the model
studio = TTSStudio() # Downloads model to apps/core/data/models/
# Subsequent initializations use cached model
studio = TTSStudio() # Fast, uses cached modelDownload location: apps/core/data/models/Qwen3-TTS-12Hz-1.7B-Base/
Note: Ensure you have:
- Stable internet connection
- At least 10GB free disk space
- Patience (first download takes 10-15 minutes)
A native desktop application with modern UI is in development. The desktop app will:
- Download models on-demand (not included in installer)
- Store models in OS-specific user directories:
- macOS:
~/Library/Application Support/TTS Studio/models/ - Windows:
%LOCALAPPDATA%\TTS Studio\models\ - Linux:
~/.local/share/tts-studio/models/
- macOS:
- Allow users to manage models (download, delete, re-download)
- Provide visual interface for all features
Stay tuned for updates!
Issue: python3.10: command not found
- Solution: Install Python 3.10 using your system package manager or from python.org
Issue: pip install fails with permission errors
- Solution: Use a virtual environment (recommended) or add
--userflag to pip install
Issue: Model download fails
- Solution: Ensure stable internet connection and at least 10GB free disk space. The model (~3.4GB) downloads automatically on first use.
Issue: MPS/CUDA out of memory errors
- Solution: Use CPU mode by setting
device: "cpu"in config.yaml, or close other applications
Issue: Audio library errors (librosa, soundfile)
- Solution: Install system audio libraries:
- Ubuntu/Debian:
sudo apt-get install libsndfile1 ffmpeg - macOS:
brew install libsndfile ffmpeg
- Ubuntu/Debian:
Issue: Import errors after installation
- Solution: Ensure virtual environment is activated:
source venv/bin/activate
- Check the FAQ in the README
- Search existing issues
- Open a new issue with details
After installation, proceed to:
- Quick Start - Get started quickly
- Configuration - Configure TTS Studio
- API Documentation - Explore the API reference
To remove TTS Studio:
# Deactivate virtual environment
deactivate
# Remove the project directory
cd ../..
rm -rf tts-studioOr if you want to keep the repository but remove the virtual environment:
cd apps/core
rm -rf venv