Add ability to register algorithm passes #1377

hanlint · 2022-08-06T20:55:48Z

The order in which algorithms are run matters significantly during composition. For example, FusedLayerNorm must run after GatedLinearUnits (which adds layer norms), to ensure that all layer norms are converted into fused versions. To enforce these, we use algorithm passes, which operate on lists of algorithms.

This PR refactors these algorithm passes into its own passes module, and allows the user to register custom passes (for custom algorithms) into the Engine.

Also coming along for the ride are some readability and code quality improvements to the engine.

todos:

add tests

ravi-mosaicml

Thanks for putting this together. I like what this API is trying to accomplish -- agree we need a better API to sort the ordering of algorithms -- but wonder if it would be helpful to work through the design a bit more? My main questions with this API are:

Should an algorithm register a pass, or do we want this functionality to live outside of the algorithm? I was thinking for modularity it could be helpful for each algorithm to self-contain this information. But, this would not be possible via this API (nor via the status quo), since algorithms do not have access to the engine.
Would it be confusing to set the ordering of passes (via the index argument correctly? Mainly thinking if multiple (different) sources are both inserting passes -- then the order in which engine.register_pass is called is important.
Do we want a full function for algorithm sorting that would be opaque to the engine? Or would it be helpful to give the engine more visibility into the scheduling requirements with something like a DAG , where each algorithm could have a run_before(self, event) -> Sequence[Type[Algorithm]] and a run_after(self, event) -> Sequence[Type[Algorithm]] method. The engine could then rearrange algorithms (so long as it satisfies the DAG requirements) for optimal performance (e.g. when running with XLA and lazy execution).

I didn't review the code in detail; happy to do that if we would like to go with this design.

hanlint · 2022-08-08T16:13:05Z

Some passes may be cross-algorithms (for example, see the one warning about multiple interpolate_loss), so they should live outside of algorithms. This API actually does support an extension in the future. We could eventually add a register_pass method to the Algorithm that the Trainer uses this API to register passes.
The default behavior is to append at the end. index is for power users, as we are all consenting adults.
This sounds like pre-mature narrowing / redesign of the API. Let's retain flexibility until we know what kind of passes we want to do.

I dont think this warrants a full design discussion, this is refactoring the existing design for better extensibility and readability (rather than hiding all the algorithm passes inside _compile).

mvpatel2000

Interactions are messy between algorithms -- I don't think you can include per algorithm
Also don't love weird indexing, but fine leaving as a super user feature
We likely will need to rewrite this with a more complicated scheduling algorithm -- agree with many of the points Ravi said. With that said, I'm fine with this refactor (which is much better than current code imo) because I don't think we're at the point where we need to do a full design on algorithm DAGs -- Id punt this to later

composer/core/engine.py

hanlint added 2 commits July 26, 2022 09:50

use algorithm passes

cef255a

add docs

ed9f2bb

hanlint requested review from a team as code owners August 6, 2022 20:55

hanlint requested a review from mvpatel2000 August 6, 2022 21:10

add tests; change to instance variable

fb25957

hanlint changed the title ~~[WIP] Add ability to register algorithm passes~~ Add ability to register algorithm passes Aug 7, 2022

fix doctests

bf118c0

ravi-mosaicml reviewed Aug 8, 2022

View reviewed changes

mvpatel2000 approved these changes Aug 8, 2022

View reviewed changes

composer/core/engine.py Show resolved Hide resolved

composer/core/engine.py Show resolved Hide resolved

composer/core/engine.py Show resolved Hide resolved

hanlint merged commit af8bb17 into mosaicml:dev Aug 8, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ability to register algorithm passes #1377

Add ability to register algorithm passes #1377

hanlint commented Aug 6, 2022 •

edited

Loading

ravi-mosaicml left a comment

hanlint commented Aug 8, 2022 •

edited

Loading

mvpatel2000 left a comment

Add ability to register algorithm passes #1377

Add ability to register algorithm passes #1377

Conversation

hanlint commented Aug 6, 2022 • edited Loading

ravi-mosaicml left a comment

Choose a reason for hiding this comment

hanlint commented Aug 8, 2022 • edited Loading

mvpatel2000 left a comment

Choose a reason for hiding this comment

hanlint commented Aug 6, 2022 •

edited

Loading

hanlint commented Aug 8, 2022 •

edited

Loading