Replies: 2 comments 1 reply
-
We currently also have the issue that if a rollout experiment has to restart, and one re-starts from the latest checkpoint within an epoch, we again go through all samples, instead of only the "remaining samples" / iterations within that epoch. Remaining samples is of course not well defined because we shuffle (only if we would save the random states and restore them). |
Beta Was this translation helpful? Give feedback.
-
Would be nice to think about how the dataloader object can be adapted during changes in rollout, e.g. currently the max rollout sets the dataloading fetching, meaning excess data is loaded. |
Beta Was this translation helpful? Give feedback.
-
Our current rollout implementation is very focused on sequential epoch increments, it would be good to generalise this to provide schedulers to control rollout.
Work was done in
aifs-mono
to enable this. hereI think this can be generalised and provide more general applicability.
Features
Below is a list of features and requirements as I see them
Questions
What other features may be needed?
Discussion of #145
Points raised in Issue
Beta Was this translation helpful? Give feedback.
All reactions