OneCycleCosine

Implements the modified version of Leslie Smith's 1cycle policy described by Sylvain Gugger and Jeremy Howard for PyTorch. This version uses cosine annealing like the FastAI version but has three phases instead of two:

phase	default	description
warmup	30%	lr_min -> lr_max, momentum_max -> momentum_min
plateau	0%	lr_max, momentum_min - spends more time looking for an optimal minima.
winddown	70%	lr_max -> lr_max / 24e4, momentum_min -> momentum_max

Phases 1 and 3 are the same as phases 1 and 2 FastAI 1cycle policy. Phase 2 is described in the FastAI blogpost.

Usage

OneCycleCosine should be used for optimizers which have a 'momentum' parameter
OneCycleCosineAdam should be used for Adam based optimizers which have a 'betas' parameter tuple

References

FastAI Blogpost - https://www.fast.ai/2018/07/02/adam-weight-decay
Original Paper - https://arxiv.org/abs/1803.09820

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

OneCycleCosine

Usage

References

Files

README.md

Latest commit

History

README.md

File metadata and controls

OneCycleCosine

Usage

References