Correct attention mask dtype for Flax GPT2 #25636

liutianlin0121 · 2023-08-21T17:41:40Z

What does this PR do?

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case. Link: Problem caused by boolean attention mask in pretrained_model.generate of Flax GPT2 #25634
[N/A] Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@sanchit-gandhi
@ArthurZucker

ArthurZucker

Nice catch! Could you maybe add a test in the test_modelling_flax_gpt2.py to make sure this is tested? 😉 (taking inspiration from your minimal reproducer!)

HuggingFaceDocBuilderDev · 2023-08-22T06:48:34Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

liutianlin0121 · 2023-08-22T08:46:27Z

@ArthurZucker Sure! I added a test :-)

sanchit-gandhi

Modelling code changes LGTM - would be great to make the test a fast test by defining a tester function in the model tester, and then executing it in the model test

tests/models/gpt2/test_modeling_flax_gpt2.py

liutianlin0121 · 2023-08-22T17:29:05Z

would be great to make the test a fast test by defining a tester function in the model tester, and then executing it in the model test

@sanchit-gandhi Good point! Done. Let me know if you have further suggestions. :-)

ArthurZucker

Thanks for adding a test! 🤗

ArthurZucker · 2023-08-24T08:59:21Z

cc @sanchit-gandhi feel free to merge if it's alright with you!

tests/models/gpt2/test_modeling_flax_gpt2.py

liutianlin0121 · 2023-08-24T18:08:21Z

@sanchit-gandhi Hey thanks! I change to assertTrue.

sanchit-gandhi

Awesome - thanks @liutianlin0121!

liutianlin0121 · 2023-08-25T14:44:42Z

No problem! Feel free to merge it (it seems that I can't).

* Correct attention mask dtype * reformat code * add a test for boolean mask * convert test to fast test * delete unwanted print * use assertTrue for testing

liutianlin0121 changed the title ~~Correct attention mask dtype~~ Correct attention mask dtype for Flax GPT2 Aug 21, 2023

ArthurZucker reviewed Aug 22, 2023

View reviewed changes

liutianlin0121 force-pushed the attention_mask branch from 2f1e4cb to 1a9d3dd Compare August 22, 2023 08:26

sanchit-gandhi reviewed Aug 22, 2023

View reviewed changes

tests/models/gpt2/test_modeling_flax_gpt2.py Outdated Show resolved Hide resolved

liutianlin0121 force-pushed the attention_mask branch from 1a9d3dd to aa34af0 Compare August 22, 2023 17:11

liutianlin0121 requested a review from sanchit-gandhi August 23, 2023 04:29

liutianlin0121 force-pushed the attention_mask branch from aa34af0 to 7f966d7 Compare August 23, 2023 04:35

liutianlin0121 requested a review from ArthurZucker August 24, 2023 07:58

ArthurZucker approved these changes Aug 24, 2023

View reviewed changes

sanchit-gandhi reviewed Aug 24, 2023

View reviewed changes

tests/models/gpt2/test_modeling_flax_gpt2.py Outdated Show resolved Hide resolved

liutianlin0121 added 6 commits August 24, 2023 19:28

Correct attention mask dtype

7a9ded6

reformat code

6b8b0b9

add a test for boolean mask

1654784

convert test to fast test

a738074

delete unwanted print

ce947a5

use assertTrue for testing

9d97878

liutianlin0121 force-pushed the attention_mask branch from 42b2414 to 9d97878 Compare August 24, 2023 17:29

liutianlin0121 requested a review from sanchit-gandhi August 25, 2023 08:16

sanchit-gandhi approved these changes Aug 25, 2023

View reviewed changes

ArthurZucker merged commit 0040469 into huggingface:main Aug 25, 2023

liutianlin0121 deleted the attention_mask branch August 25, 2023 15:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Correct attention mask dtype for Flax GPT2 #25636

Correct attention mask dtype for Flax GPT2 #25636

liutianlin0121 commented Aug 21, 2023 •

edited

Loading

ArthurZucker left a comment

HuggingFaceDocBuilderDev commented Aug 22, 2023

liutianlin0121 commented Aug 22, 2023

sanchit-gandhi left a comment

liutianlin0121 commented Aug 22, 2023 •

edited

Loading

ArthurZucker left a comment

ArthurZucker commented Aug 24, 2023

liutianlin0121 commented Aug 24, 2023

sanchit-gandhi left a comment

liutianlin0121 commented Aug 25, 2023

Correct attention mask dtype for Flax GPT2 #25636

Correct attention mask dtype for Flax GPT2 #25636

Conversation

liutianlin0121 commented Aug 21, 2023 • edited Loading

What does this PR do?

Before submitting

Who can review?

ArthurZucker left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Aug 22, 2023

liutianlin0121 commented Aug 22, 2023

sanchit-gandhi left a comment

Choose a reason for hiding this comment

liutianlin0121 commented Aug 22, 2023 • edited Loading

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker commented Aug 24, 2023

liutianlin0121 commented Aug 24, 2023

sanchit-gandhi left a comment

Choose a reason for hiding this comment

liutianlin0121 commented Aug 25, 2023

liutianlin0121 commented Aug 21, 2023 •

edited

Loading

liutianlin0121 commented Aug 22, 2023 •

edited

Loading