Low precision groupnorm #1976

mvpatel2000 · 2023-02-16T22:37:01Z

What does this PR do?

Adds low precision groupnorm. This is modeled after low precision layernorm and is useful for stable diffusion. We've seen about +5% throughput and memory savings with this with no significant impact to loss curves.

What issue(s) does this change relate to?

CO-1794

dskhudia · 2023-02-16T22:40:07Z

Thoughs on combining this with lowprecisonlayernorm and calling that lowPrecisionNorms?

mvpatel2000 · 2023-02-16T22:43:54Z

Thoughs on combining this with lowprecisonlayernorm and calling that lowPrecisionNorms?

I would prefer to keep separate because we don't have guarantees of convergence with this, and there might be a case where one applies but another does not. While we have never observed issues with this, it is possible there are problems given AMP does not do this by default

dskhudia · 2023-02-16T22:50:34Z

We can allow turning on/off each type based on an option.

nik-mosaic · 2023-02-16T23:00:15Z

Is this scriptable? Can you export a model with LowPrecisionGroupnorm using torch.jit.script?

dblalock

I have no context on this PR, so I just went through and found ways to polish it.

tests/common/models.py

tests/algorithms/test_low_precision_groupnorm.py

composer/algorithms/low_precision_groupnorm/low_precision_groupnorm.py

composer/algorithms/low_precision_groupnorm/README.md

composer/algorithms/low_precision_groupnorm/__init__.py

…tel2000/composer into mvpatel2000/low-precision-groupnorm

mvpatel2000 · 2023-02-24T00:42:05Z

Will switch to SimpleConvModel once #1991 merges

nik-mosaic

Two comments about the arguments and export. Once those are changed, I will approve.

composer/algorithms/low_precision_groupnorm/low_precision_groupnorm.py

mvpatel2000 · 2023-02-24T23:19:01Z

I need a different logo... any suggestions for icon

dblalock

basically LGTM, except maybe one test needs to say "GroupNorm" instead of "LayerNorm"

composer/algorithms/low_precision_groupnorm/README.md

composer/algorithms/low_precision_groupnorm/low_precision_groupnorm.py

composer/algorithms/low_precision_layernorm/README.md

composer/algorithms/low_precision_layernorm/low_precision_layernorm.py

tests/algorithms/test_low_precision_groupnorm.py

mvpatel2000 added 4 commits February 16, 2023 11:08

add lowprecisiongroupnorm

714ffda

fix headers

2864c68

add functional

dced13a

add prints

9de2b5e

mvpatel2000 requested review from dblalock, dskhudia and nik-mosaic as code owners February 16, 2023 22:37

add test settings

ded750b

dblalock reviewed Feb 17, 2023

View reviewed changes

mvpatel2000 added 5 commits February 22, 2023 20:17

add kwargs

6164bea

resolve 1/n davis comments

3cb870a

lots of comments

6d15a84

deprecated arg

f27ec7d

more comment fixes

c87bcec

mvpatel2000 requested a review from dblalock February 23, 2023 19:53

mvpatel2000 added 6 commits February 23, 2023 13:58

fix metadata

3bdcf91

Merge branch 'dev' into mvpatel2000/low-precision-groupnorm

c8b2ba6

add md file

c1e3bed

Merge branch 'mvpatel2000/low-precision-groupnorm' of github.com:mvpa…

3016193

…tel2000/composer into mvpatel2000/low-precision-groupnorm

update docs

6961595

tweaks

d57fa6a

mvpatel2000 added 4 commits February 23, 2023 16:48

reset

c059bf6

update readme

2954e3c

patch

8d3f556

fix change

b419c15

dskhudia approved these changes Feb 24, 2023

View reviewed changes

nik-mosaic suggested changes Feb 24, 2023

View reviewed changes

composer/algorithms/low_precision_groupnorm/low_precision_groupnorm.py Outdated Show resolved Hide resolved

composer/algorithms/low_precision_groupnorm/low_precision_groupnorm.py Outdated Show resolved Hide resolved

mvpatel2000 added 4 commits February 23, 2023 20:57

add export test

015d3d3

fix test

9be2d5a

add args

8792b38

remove test

5dec0b4

nik-mosaic approved these changes Feb 24, 2023

View reviewed changes

mvpatel2000 added 3 commits February 24, 2023 11:13

Merge branch 'dev' into mvpatel2000/low-precision-groupnorm

25c3dd7

Merge branch 'dev' into mvpatel2000/low-precision-groupnorm

09cceed

Merge branch 'dev' into mvpatel2000/low-precision-groupnorm

c09d6d1

update logo

1e25dca

dblalock approved these changes Feb 25, 2023

View reviewed changes

update based on comments

e812cd7

mvpatel2000 merged commit 228efab into mosaicml:dev Feb 25, 2023

mvpatel2000 deleted the mvpatel2000/low-precision-groupnorm branch February 25, 2023 05:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Low precision groupnorm #1976

Low precision groupnorm #1976

mvpatel2000 commented Feb 16, 2023 •

edited by jira bot

Loading

dskhudia commented Feb 16, 2023 •

edited

Loading

mvpatel2000 commented Feb 16, 2023

dskhudia commented Feb 16, 2023

nik-mosaic commented Feb 16, 2023

dblalock left a comment

mvpatel2000 commented Feb 24, 2023

nik-mosaic left a comment

mvpatel2000 commented Feb 24, 2023

dblalock left a comment •

edited

Loading

Low precision groupnorm #1976

Low precision groupnorm #1976

Conversation

mvpatel2000 commented Feb 16, 2023 • edited by jira bot Loading

What does this PR do?

What issue(s) does this change relate to?

dskhudia commented Feb 16, 2023 • edited Loading

mvpatel2000 commented Feb 16, 2023

dskhudia commented Feb 16, 2023

nik-mosaic commented Feb 16, 2023

dblalock left a comment

Choose a reason for hiding this comment

mvpatel2000 commented Feb 24, 2023

nik-mosaic left a comment

Choose a reason for hiding this comment

mvpatel2000 commented Feb 24, 2023

dblalock left a comment • edited Loading

Choose a reason for hiding this comment

mvpatel2000 commented Feb 16, 2023 •

edited by jira bot

Loading

dskhudia commented Feb 16, 2023 •

edited

Loading

dblalock left a comment •

edited

Loading