Towards New Trainer API #458

ppwwyyxx · 2017-10-28T10:25:27Z

New trainer API is pushed to master and examples are updated. I expect this to be the final API and stay stable, and if thing goes well we will go to tensorpack 1.0. This is an issue to track related problems.

What's new

New docs about this change are in Trainer, Training Interface, and Write a Trainer. Also API docs are changed.

Why

New trainer APIs are isolated from ModelDesc and TrainConfig, which arguably packs arbitrary training options together and is therefore not a good design. Some related discussion in #318 (comment) .
Now ModelDesc and TrainConfig are only used in wrappers on top of trainers, in order to keep trainer interface clean.

What will happen

Use export TENSORPACK_TRAIN_API=v2 to use the new API.
For backwards-compatibility, we will gradually go towards the new API.

(now ~ +1 month) v1 is still the default. Users should set v2 option manually in .bashrc, etc. All old code should run the same, because you'll import the old trainer. But all examples set the envvar to v2 and use v2 API.
(+1 month ~ +6 months) v2 will be the default. Old code can still run, due to some hacks to maintain compatibility.
(+6 months ~ ) Old trainer code may be cleaned up.

Also, new features such as easier Keras model training, horovod, will be only in v2.

What to do

Use v2 today for new code!

~~export TENSORPACK_TRAIN_API=v2 (not doing this and use v2 API directly should also work for single-cost training, but with warnings)~~
If you use old trainers, replace SomeTrainer(config, ...).train() with launch_train_with_config(config, SomeTrainer(...)).
If you use custom trainer, checkout the new docs, as well as the GAN trainer

The text was updated successfully, but these errors were encountered:

ppwwyyxx added the enhancement feature or enhancement label Nov 4, 2017

ppwwyyxx added a commit that referenced this issue Nov 29, 2017

Switch to trainer v2 by default. (#458)

11932e6

rizasif mentioned this issue Mar 17, 2018

Tensorpack Update and OpenCv dependency YixuanLi/densenet-tensorflow#16

Merged

ppwwyyxx closed this as completed May 5, 2018

jackylee1 mentioned this issue Nov 15, 2018

python train1.py timit -gpu 0,how do i do this without gpu? andabi/deep-voice-conversion#75

Open

YashBangera7 mentioned this issue Jan 30, 2019

Help on train2.py andabi/deep-voice-conversion#89

Closed

sallyjoy mentioned this issue Apr 28, 2019

STUCK at Train1.py", Line 60 : launch_train_with_config(train_conf, trainer=trainer) andabi/deep-voice-conversion#103

Open

vijaykumarvg1 mentioned this issue Oct 2, 2019

Error while using halfsqeezenet fpgasystems/spooNN#28

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Towards New Trainer API #458

Towards New Trainer API #458

ppwwyyxx commented Oct 28, 2017 •

edited

Loading

Towards New Trainer API #458

Towards New Trainer API #458

Comments

ppwwyyxx commented Oct 28, 2017 • edited Loading

What's new

Why

What will happen

What to do

ppwwyyxx commented Oct 28, 2017 •

edited

Loading