FastSpeech2

Project on text-to-speech. This rep contains my implementation of FastSpeech2 model and all the steps to reimplement the pipeline

How to install?

Make sure to follow this guide

git clone https://github.com/aizamaksutova/FastSpeech2.git
cd FastSpeech2
pip install -r requirements.txt

How to inference?

First, you should create a file (e.g. texts.txt) with all the phrases you want to reproduce as wavs. The format of .txt file is the same as the file inference.txt

Then perform these:

chmod a+x prepare_inf.sh
./prepare_inf.sh   #downloading all the needed models
python3 test.py -c config.json -r final_model.pth -f inference.txt

Afterwards, you will see the results in the results/ directory

How to train the model by yourself?

In order to train the model you would need to perform simple steps, but wait for a long time for them to actually download all the data + manually perform all the pitch and energy in advance

chmod a+x prepare_data.sh
./prepare_data.sh
python3 prepare_pitch_energy.py #prepare pitch and energy
python3 train.py -c hw_tts/configs/config.json

Wandb report

Here is the link to my wandb report with all the architecture explanation and wavs + graphs

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
__pycache__		__pycache__
_assets		_assets
hw_tts		hw_tts
test_data		test_data
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config.json		config.json
glow.py		glow.py
inference.txt		inference.txt
prepare_data.sh		prepare_data.sh
prepare_inf.sh		prepare_inf.sh
prepare_pitch_energy.py		prepare_pitch_energy.py
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FastSpeech2

How to install?

How to inference?

How to train the model by yourself?

Wandb report

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FastSpeech2

How to install?

How to inference?

How to train the model by yourself?

Wandb report

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages