Egyptian-Text-To-Speech

EGTTS V0.1

EGTTS V0.1 is a cutting-edge text-to-speech (TTS) model specifically designed for Egyptian Arabic. Built on the XTTS v2 architecture, it transforms written Egyptian Arabic text into natural-sounding speech, enabling seamless communication in various applications such as voice assistants, educational tools, and chatbots.

Try It Out

✨ Experience the magic of EGTTS V0.1 live! Try the model directly through this HuggingFace Space.

Download The Model

📥 Get the model now! Download EGTTS V0.1 directly from HuggingFace.

Quick Start

Dependencies to install

pip install git+https://github.com/coqui-ai/TTS

pip install transformers

pip install deepspeed

Inference

Load the model

import os
import torch
import torchaudio
from TTS.tts.configs.xtts_config import XttsConfig
from TTS.tts.models.xtts import Xtts

CONFIG_FILE_PATH = 'path/to/config.json'
VOCAB_FILE_PATH = 'path/to/vocab.json'
MODEL_PATH = 'path/to/model'
SPEAKER_AUDIO_PATH = 'path/to/speaker.wav'

print("Loading model...")
config = XttsConfig()
config.load_json(CONFIG_FILE_PATH)
model = Xtts.init_from_config(config)
model.load_checkpoint(config, checkpoint_dir=MODEL_PATH, use_deepspeed=True, vocab_path=VOCAB_FILE_PATH)
model.cuda()

print("Computing speaker latents...")
gpt_cond_latent, speaker_embedding = model.get_conditioning_latents(audio_path=[SPEAKER_AUDIO_PATH])

Run the model

from IPython.display import Audio, display

text = "صباح الخير"
print("Inference...")
out = model.inference(
    text,
    "ar",
    gpt_cond_latent,
    speaker_embedding,
    temperature=0.75,
)

AUDIO_OUTPUT_PATH = "path/to/output_audio.wav"
torchaudio.save("xtts_audio.wav", torch.tensor(out["wav"]).unsqueeze(0), 24000)
display(Audio(AUDIO_OUTPUT_PATH, autoplay=True))

Citation

@misc{omarsamir,
      author = {Omar Samir, Youssef Waleed, Youssef Tamer ,and Amir Mohamed},
      title = {Fine-Tuning XTTS V2 for Egyptian Arabic},
      year = {2024},
      url = {https://github.com/joejoe03/Egyptian-Text-To-Speech},
}

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
egtts_v0_1		egtts_v0_1
modules		modules
LICENSE		LICENSE
README.md		README.md
main.py		main.py
utils.py		utils.py
xtts_inference.ipynb		xtts_inference.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Egyptian-Text-To-Speech

EGTTS V0.1

Try It Out

Download The Model

Quick Start

Dependencies to install

Inference

Load the model

Run the model

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Egyptian-Text-To-Speech

EGTTS V0.1

Try It Out

Download The Model

Quick Start

Dependencies to install

Inference

Load the model

Run the model

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages