Add NLP Algorithms Tests #1839

nik-mosaic · 2022-12-19T17:42:21Z

What does this PR do?

This PR updates many algorithms tests to run on NLP models, as guided by our NLP testing plan: https://www.notion.so/NLP-74e31fb2d8f0472e9fcd6fd98dd67686

The testing plan states that all algorithms that support NLP models should have tests that run on a HuggingFace BERT model and a Simple Transformer model.

Note: Tests for Stochastic Weight Averaging do not exist, but should be implemented by the original author. That is outside the scope of this PR (JIRA: CO-1611)

All other algorithms NLP algorithms are included in this PR. However, test_factorize_algorithm.py fails on some SimpleTransformer model tests. The Factorize algorithm needs to be improved to account for this. Included in this PR is updated documentation making clear its shortcoming (JIRA: CO-1612)

Additional PRs will include tests for exporting NLP models.

dakinggg

Mostly looks good, thanks! Left a bunch of minor comments. Also, for any of the follow up work you mention in the PR description, if you could make a JIRA and link it that would be great. Lastly, I think we still need to add some stuff for Alibi and SeqLengthWarmup in tests/algorithms/algorithm_settings.py. These are used for the tests that test all algorithms. Maybe this is coming in your export PR/commits though

tests/conftest.py

tests/algorithms/test_seq_length_warmup.py

tests/algorithms/test_gradient_clipping.py

…nto nikhil/nlp-testing

dakinggg

LGTM, left a couple small comments. Thanks!

tests/common/datasets.py

tests/algorithms/test_factorize_algorithm.py

nik-mosaic added 10 commits November 16, 2022 07:35

Add weight copying for fused layernorm

ca9fde7

Change from weight copy to zero/one init

fce8533

Merge branch 'mosaicml:dev' into dev

d5c0ba8

Merge branch 'mosaicml:dev' into dev

9e04286

Merge branch 'mosaicml:dev' into dev

5517a09

Revert FLN Update

1ceead1

Merge branch 'dev' of https://github.com/nik-mosaic/composer into dev

0b377ae

Add space

f14eefc

Merge branch 'mosaicml:dev' into dev

b71375a

Add NLP Algorithms tests

c1f0171

nik-mosaic requested a review from dakinggg December 19, 2022 17:42

nik-mosaic changed the title ~~Nikhil/nlp testing~~ Add NLP Algorithms Tests Dec 19, 2022

nik-mosaic added 3 commits December 19, 2022 09:45

Add HF model to more grad clipping tests

a747e46

Add HF model to more grad clipping tests

3883c3b

Add Gated Linear Units non-HF tests

9677939

dakinggg reviewed Dec 20, 2022

View reviewed changes

dakinggg and others added 14 commits December 23, 2022 15:55

merge

2db4755

move non fixtures out of fixtures section

b06e80d

move around fixtures pytest configure for hf stuff

b5f293f

add alibi and seqlengthwarmup settings

97de740

add GyroDropout test settings

032cb0d

tiny_bert -> configure_tiny_bert_hf_model

0820253

switch to 3 class

cfdf4d5

move xfail out of test

b9dc8cf

add missing importorskip

4a9ecf5

rename test

7885952

Merge branch 'dev' into nikhil/nlp-testing

68c9be9

Remove unused model_params arg

a97a095

fix linting

ddf4fe6

Add FLN test for non-HF models

e69b236

nik-mosaic and others added 18 commits January 4, 2023 03:50

Add device to arg

2ae76d5

Use device, add parameterize

6af2b08

Add LPLN non-HuggingFaceModel tests

f88b54d

Merge branch 'dev' into nikhil/nlp-testing

ed9f50e

Merge branch 'dev' into nikhil/nlp-testing

11f3ccd

Merge branch 'dev' into nikhil/nlp-testing

75d2c79

Merge branch 'dev' into nikhil/nlp-testing

bd8f4a9

Update NLP algorithms tests

a719d1d

update Factorize README

0d42f33

Removed modified test_inference from PR

3491bdf

Fix new algorithm settings

b545cc1

pyright

afff347

Merge branch 'dev' into nikhil/nlp-testing

f2711af

Fix some FSDP issues; still erroring on SimpleTransformerClassifier

b14fad5

Merge branch 'dev' into nikhil/nlp-testing

4d67573

Xfail transformer fsdp test

79b5eaa

Merge branch 'nikhil/nlp-testing' of github.com:nik-mosaic/composer i…

47ad788

…nto nikhil/nlp-testing

Merge branch 'dev' into nikhil/nlp-testing

88c3545

dakinggg approved these changes Jan 12, 2023

View reviewed changes

tests/common/datasets.py Outdated Show resolved Hide resolved

tests/algorithms/test_factorize_algorithm.py Show resolved Hide resolved

nik-mosaic added 4 commits January 17, 2023 10:09

Merge branch 'dev' into nikhil/nlp-testing

e7ed17d

One batch datasets, add comment

726580c

Rerun tests

44ce006

Merge branch 'dev' into nikhil/nlp-testing

a9cc978

mvpatel2000 approved these changes Jan 17, 2023

View reviewed changes

nik-mosaic marked this pull request as ready for review January 17, 2023 23:05

nik-mosaic requested a review from dskhudia as a code owner January 17, 2023 23:05

nik-mosaic merged commit c992304 into mosaicml:dev Jan 17, 2023

nik-mosaic deleted the nikhil/nlp-testing branch January 17, 2023 23:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NLP Algorithms Tests #1839

Add NLP Algorithms Tests #1839

nik-mosaic commented Dec 19, 2022 •

edited

Loading

dakinggg left a comment

dakinggg left a comment

Add NLP Algorithms Tests #1839

Add NLP Algorithms Tests #1839

Conversation

nik-mosaic commented Dec 19, 2022 • edited Loading

What does this PR do?

dakinggg left a comment

Choose a reason for hiding this comment

dakinggg left a comment

Choose a reason for hiding this comment

nik-mosaic commented Dec 19, 2022 •

edited

Loading