The initial, regular Tacotron model was trained first on LJSpeech, and then on a heavily modified version of the Ellen McClain dataset (all non-Portal 2 voice lines removed, punctuation added). The Forward Tacotron model was only trained on about 600 voice lines. The HiFiGAN model was generated through transfer learning from the sample. All models have been optimized and quantized.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Files

README.md

Latest commit

History

README.md

File metadata and controls