Add declarative trainer #924

Minyus · 2020-04-15T01:41:44Z

Fixes # 912

Description:
Higher API for training will improve usability of Ignite.

Check list:

New tests are added (if a new feature is added)
[x ] New doc strings: description and/or example code are in RST format
Documentation is updated (if required)

sdesrozis · 2020-04-15T08:23:23Z

@Minyus Thank you very much for this PR !!

I make you my feedback on your proposal quickly :)

vfdev-5 · 2020-04-15T15:24:16Z

@Minyus thanks a lot for the PR ! I really appreciate your contribution, however it would be great to discuss more about the API. We already stated some general remarks on the API in the related issue #912

In my opinion, class NetworkTrain has very specific integrations with MLflow which can be a limitation for some users. Another point is about exposing trainer in the higher API in case of adding more user-specific handlers.

please, let's discuss more about the API in #912.

vfdev-5 · 2020-04-17T13:21:47Z

@Minyus can we work out this at first, please ?

class TimeLimit:
    def __init__(self, limit_sec=3600):
        self.limit_sec = limit_sec
        self.start_time = time.time()

    def __call__(self, engine):
        elapsed_time = time.time() - self.start_time
        if elapsed_time > self.limit_sec:
            log.warning("Reached the time limit: {} sec. Stop training".format(self.limit_sec))
            engine.terminate()

This looks good enough. Maybe, in addition, we can set Checkpoint or ModelCheckpoint to save the work before terminating ?

Idea is to put it directly into ignite/handlers/time_limit.py

Minyus · 2020-04-17T13:39:48Z

@vfdev-5
OK, I'll move TimeLimit to ignite/handlers/time_limit.py.
Let's add saving before terminating to ToDo list as it needs further consideration (e.g. what if model checkpoint path is not specified, etc.)

vfdev-5 · 2020-04-17T15:00:40Z

@vfdev-5
OK, I'll move TimeLimit to ignite/handlers/time_limit.py.
Let's add saving before terminating to ToDo list as it needs further consideration (e.g. what if model checkpoint path is not specified, etc.)

@Minyus thanks, but it would be much better to split this PR into several ones. One PR by feature.
So, please, send the code of TimeLimit into a new PR. It would be nice to follow https://github.com/pytorch/ignite/blob/master/CONTRIBUTING.md
Thanks

Minyus · 2020-04-19T03:23:09Z

As discussed in Slack, this PR is suspended.

Meanwhile, please feel free to send PR (including adding documentation, test, example, etc.) to PipelineX so that I can send a complete PR to Ignite later on.

higher API for training
https://github.com/Minyus/pipelinex/blob/master/src/pipelinex/ops/ignite/declaratives/declarative_trainer.py

MNIST example to use higher API
https://github.com/Minyus/pipelinex/blob/master/examples/mnist/mnist_with_declarative_trainer.py

TimeLimit handler
https://github.com/Minyus/pipelinex/blob/master/src/pipelinex/ops/ignite/handlers/time_limit.py

Cohen Kappa Score metric
https://github.com/Minyus/pipelinex/blob/master/src/pipelinex/ops/ignite/metrics/cohen_kappa_score.py

Please also feel free to copy any code in PipelineX to use in PyTorch Ignite.
I'm happy to see Ignite to be enhanced in any way.

sdesrozis · 2020-04-19T10:30:18Z

Thank you again and see you soon !!

Add declarative trainer

1b3e37b

Format comments

ae126be

Minyus added 3 commits April 16, 2020 23:46

Remove FlexibleModelCheckpoint

0763db0

Split train_params

4c780d7

Reorder parameters

ab56e8c

vfdev-5 marked this pull request as draft April 17, 2020 13:19

Minyus added 4 commits April 17, 2020 14:01

Add TimeLimit handler

db53076

Import TimeLimit from ignite.handlers

24ef2b5

Add train_dataset_size_limit and val_dataset_size_limit

b805152

Add torch.backends.cudnn parameters

9a5ba4e

Minyus and others added 2 commits April 17, 2020 15:08

Remove unused kwarg

112d059

Merge branch 'master' into master

82dfa1f

vfdev-5 closed this Apr 19, 2020

vfdev-5 mentioned this pull request Feb 22, 2021

add cohen kappa in contrib.metrics module #1673

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add declarative trainer #924

Add declarative trainer #924

Minyus commented Apr 15, 2020

sdesrozis commented Apr 15, 2020

vfdev-5 commented Apr 15, 2020

vfdev-5 commented Apr 17, 2020 •

edited

Loading

Minyus commented Apr 17, 2020

vfdev-5 commented Apr 17, 2020 •

edited

Loading

Minyus commented Apr 19, 2020

sdesrozis commented Apr 19, 2020

Add declarative trainer #924

Add declarative trainer #924

Conversation

Minyus commented Apr 15, 2020

sdesrozis commented Apr 15, 2020

vfdev-5 commented Apr 15, 2020

vfdev-5 commented Apr 17, 2020 • edited Loading

Minyus commented Apr 17, 2020

vfdev-5 commented Apr 17, 2020 • edited Loading

Minyus commented Apr 19, 2020

sdesrozis commented Apr 19, 2020

vfdev-5 commented Apr 17, 2020 •

edited

Loading

vfdev-5 commented Apr 17, 2020 •

edited

Loading