FastSpeech-Pytorch

The Implementation of FastSpeech Based on Pytorch.

Update (2020/07/20)

Optimize the training process.
Optimize the implementation of length regulator.
Use the same hyper parameter as FastSpeech2.
The measures of the 1, 2 and 3 make the training process 3 times faster than before.
Better speech quality.

Model

My Blog

Prepare Dataset

Download and extract LJSpeech dataset.
Put LJSpeech dataset in data.
Unzip alignments.zip.
Put Nvidia pretrained waveglow model in the waveglow/pretrained_model and rename as waveglow_256channels.pt;
Run python3 preprocess.py.

Training

Run python3 train.py.

Evaluation

Run python3 eval.py.

Notes

In the paper of FastSpeech, authors use pre-trained Transformer-TTS model to provide the target of alignment. I didn't have a well-trained Transformer-TTS model so I use Tacotron2 instead.
I use the same hyper-parameter as FastSpeech2.
The examples of audio are in sample.
pretrained model.

Name		Name	Last commit message	Last commit date
Latest commit History 272 Commits
audio		audio
data		data
img		img
sample		sample
text		text
transformer		transformer
waveglow		waveglow
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
alignments.zip		alignments.zip
dataset.py		dataset.py
eval.py		eval.py
glow.py		glow.py
hparams.py		hparams.py
loss.py		loss.py
model.py		model.py
modules.py		modules.py
optimizer.py		optimizer.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FastSpeech-Pytorch

Update (2020/07/20)

Model

My Blog

Prepare Dataset

Training

Evaluation

Notes

Reference

Repository

Paper

About

Releases

Packages

Contributors 3

Languages

License

xcmyz/FastSpeech

Folders and files

Latest commit

History

Repository files navigation

FastSpeech-Pytorch

Update (2020/07/20)

Model

My Blog

Prepare Dataset

Training

Evaluation

Notes

Reference

Repository

Paper

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages