Add logic for shifting labels before computing metrics #1913

alextrott16 · 2023-01-26T20:54:49Z

What does this PR do?

Adds a shift_labels init argument to HuggingFaceModel class. This instructs the model whether to shift labels by one token before computing metrics, which mimics the way HF Causal LM classes handle labels when computing loss. This fixes the current implementation, which never does this shifting and produces incorrect metric results for Causal LMs.

If shift_labels is not specified, HuggingFaceModel will try to infer the correct behavior based on whether the model is an instance of a registered HF Causal LM class (or a subclass of one).

What issue(s) does this change relate to?

https://mosaicml.atlassian.net/browse/CO-1691

Before submitting

Have you read the contributor guidelines?
Is this change a documentation change or typo fix? If so, skip the rest of this checklist.
Was this change discussed/approved in a GitHub issue first? It is much more likely to be merged if so.
Did you update any related docs and document your change?
Did you update any related tests and add any new tests related to your change? (see testing)
Did you run the tests locally to make sure they pass?
Did you run pre-commit on your change? (see the pre-commit section of prerequisites)

dakinggg

Can you add a test please? Otherwise LGTM. Will approve after test

composer/models/huggingface.py

…mposer into alex/causal-roll-labels-1691

Typo fix Co-authored-by: Daniel King <43149077+dakinggg@users.noreply.github.com>

dakinggg

LGTM, left one comment to cleanup the fixture usage a bit (sorry, i pointed you to the wrong place for the usage you wanted)

tests/models/test_hf_model.py

…mposer into alex/causal-roll-labels-1691

tests/common/models.py

alextrott16 added 3 commits January 26, 2023 12:32

Add logic for shifting labels before computing metrics

c272bd5

Minor tweak

6a4cd32

Minor tweak

026fbfe

alextrott16 requested review from dakinggg and bmosaicml January 26, 2023 20:54

alextrott16 requested a review from a team as a code owner January 26, 2023 20:54

Merge branch 'dev' into alex/causal-roll-labels-1691

bd0746d

dakinggg reviewed Jan 26, 2023

View reviewed changes

composer/models/huggingface.py Outdated Show resolved Hide resolved

alextrott16 and others added 4 commits January 26, 2023 14:05

Add tests

baf361b

Merge branch 'alex/causal-roll-labels-1691' of github.com:mosaicml/co…

443e1ac

…mposer into alex/causal-roll-labels-1691

Update composer/models/huggingface.py

5637803

Typo fix Co-authored-by: Daniel King <43149077+dakinggg@users.noreply.github.com>

Merge branch 'dev' into alex/causal-roll-labels-1691

4d38521

dakinggg approved these changes Jan 26, 2023

View reviewed changes

tests/models/test_hf_model.py Outdated Show resolved Hide resolved

alextrott16 added 2 commits January 26, 2023 16:17

Add test changes and clean up hf_model test

6464fd4

Merge branch 'alex/causal-roll-labels-1691' of github.com:mosaicml/co…

d067955

…mposer into alex/causal-roll-labels-1691

dakinggg reviewed Jan 27, 2023

View reviewed changes

tests/common/models.py Outdated Show resolved Hide resolved

Add missing try/excepts

e8edda4

alextrott16 merged commit b33c381 into dev Jan 27, 2023

alextrott16 deleted the alex/causal-roll-labels-1691 branch January 27, 2023 00:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add logic for shifting labels before computing metrics #1913

Add logic for shifting labels before computing metrics #1913

alextrott16 commented Jan 26, 2023 •

edited

Loading

dakinggg left a comment

dakinggg left a comment

Add logic for shifting labels before computing metrics #1913

Add logic for shifting labels before computing metrics #1913

Conversation

alextrott16 commented Jan 26, 2023 • edited Loading

What does this PR do?

What issue(s) does this change relate to?

Before submitting

dakinggg left a comment

Choose a reason for hiding this comment

dakinggg left a comment

Choose a reason for hiding this comment

alextrott16 commented Jan 26, 2023 •

edited

Loading