Finetuning trained model #75

aleksas · 2018-05-19T13:26:45Z

In reference to recommendation how should repeated training be executed for the second time?
Should checkpoint path point to checkppoints/checkpoint_step...._ema.pth or not ema checkpoint? And what does ema stands for?

r9y9 · 2018-05-19T14:43:18Z

EMA means Exponential Moving Average (EMA). See details for Tacotron 2 paper: https://arxiv.org/abs/1712.05884. There's no clear answer which to use for finetuning, but I usually used EMA version of the checkpoint when I trained a model for sufficient time (e.g, over 2 days for MoL case).

I'll leave some commands I used for LJSpeech experiments which might help as follows:

Finetuning trained model

python train.py --data-root=./data/ljspeech/ \
  --checkpoint-dir=./checkpoints_mixture \
  --log-event-path=./log/wavenet_mixture \
  --restore-parts=./pretrained_models/20180127_mixture_lj_checkpoint_step000410000_ema.pth

Accidentally training aborted (my PC was rebooted), then resuming

python train.py --data-root=./data/ljspeech/ \
  --checkpoint-dir=./checkpoints_mixture \
  --log-event-path=./log/wavenet_mixture \
  --checkpoint=./checkpoints_mixture/checkpoint_step000256701.pth

The two commands above were actually used for training the pre-traind model.

aleksas · 2018-05-19T21:01:51Z

Thanks! Will try to fine tune my model now.

d2sys · 2018-08-08T08:40:48Z

Anyone can share the minimum number of iterations which is enough for fine-tuning?

aleksas closed this as completed May 19, 2018

aleksas mentioned this issue May 30, 2018

Transfer learning with conditioning #78

Closed

r9y9 mentioned this issue May 8, 2019

About the exact iteration number of LJSpeech pretrained model #153

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finetuning trained model #75

Finetuning trained model #75

aleksas commented May 19, 2018

r9y9 commented May 19, 2018

aleksas commented May 19, 2018

d2sys commented Aug 8, 2018

Finetuning trained model #75

Finetuning trained model #75

Comments

aleksas commented May 19, 2018

r9y9 commented May 19, 2018

aleksas commented May 19, 2018

d2sys commented Aug 8, 2018