Use our own Linear layer for easier tie_weights; Fix resize_token_embeddings API #5623

sijunhe · 2023-04-12T05:29:31Z

PR types

PR changes

Description

paddle-bot · 2023-04-12T05:29:35Z

Thanks for your contribution!

paddlenlp/layers/linear.py

paddlenlp/transformers/model_utils.py

sijunhe · 2023-04-12T16:52:14Z

paddlenlp/transformers/codegen/configuration.py

@@ -94,12 +94,14 @@ def __init__(
        layer_norm_epsilon: float = 1e-05,
        initializer_range: float = 0.02,
        n_inner: int = None,
+        tie_word_embeddings: bool = False,


一般模型都是默认tie_weights为True, 而codegen的tie_word_embeddings是false的，所以这里特定指定一下，要不然resize_word_embeddings会出问题

codecov · 2023-04-13T03:33:55Z

Codecov Report

Merging #5623 (cacf36c) into develop (8bea1a9) will decrease coverage by 0.03%.
The diff coverage is 92.75%.

@@             Coverage Diff             @@
##           develop    #5623      +/-   ##
===========================================
- Coverage    59.47%   59.45%   -0.03%     
===========================================
  Files          482      483       +1     
  Lines        68105    68187      +82     
===========================================
+ Hits         40506    40541      +35     
- Misses       27599    27646      +47

Impacted Files	Coverage Δ
paddlenlp/transformers/codegen/configuration.py	`100.00% <ø> (ø)`
paddlenlp/transformers/llama/configuration.py	`100.00% <ø> (ø)`
paddlenlp/transformers/gpt/modeling.py	`77.82% <71.42%> (-0.25%)`	⬇️
paddlenlp/transformers/model_utils.py	`51.10% <90.00%> (-0.53%)`	⬇️
paddlenlp/layers/__init__.py	`100.00% <100.00%> (ø)`
paddlenlp/layers/linear.py	`100.00% <100.00%> (ø)`
paddlenlp/prompt/verbalizer.py	`89.93% <100.00%> (-0.21%)`	⬇️
paddlenlp/transformers/albert/modeling.py	`85.57% <100.00%> (+0.03%)`	⬆️
paddlenlp/transformers/artist/modeling.py	`90.90% <100.00%> (ø)`
paddlenlp/transformers/bert/modeling.py	`89.91% <100.00%> (+0.02%)`	⬆️
... and 3 more

... and 10 files with indirect coverage changes

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

paddlenlp/transformers/model_utils.py

JunnYu

LGTM

linear linear

a6150b2

sijunhe requested review from gongel and JunnYu April 12, 2023 05:29

sijunhe changed the title ~~linear linear~~ Use our own Linear layer for easier tie_weights Apr 12, 2023

sijunhe added 2 commits April 12, 2023 13:46

add linear

2c823e8

styles

6f81c86

sijunhe changed the title ~~Use our own Linear layer for easier tie_weights~~ Use our own Linear layer for easier tie_weights; Fix resize_token_embeddings API Apr 12, 2023

changes

3e4bc11

JunnYu reviewed Apr 12, 2023

View reviewed changes

paddlenlp/layers/linear.py Outdated Show resolved Hide resolved

sijunhe added 2 commits April 12, 2023 17:13

things

f242246

changes

d619eda

JunnYu reviewed Apr 12, 2023

View reviewed changes

paddlenlp/transformers/model_utils.py Outdated Show resolved Hide resolved

sijunhe added 4 commits April 12, 2023 18:02

bias

7862ed2

changes

8584d93

fix artist and ctrl

af0a601

codegen

7ccf8fa

sijunhe commented Apr 12, 2023

View reviewed changes

changes

8a8191e

wj-Mcat mentioned this pull request Apr 13, 2023

【Hackathon 4th No.102】给AutoConverter增加新的模型组网的支持 AlbertModel #5626

Merged

JunnYu reviewed Apr 13, 2023

View reviewed changes

paddlenlp/transformers/model_utils.py Outdated Show resolved Hide resolved

sijunhe added 2 commits April 13, 2023 12:23

address comments

1fac11d

fix dtye

cacf36c

JunnYu approved these changes Apr 13, 2023

View reviewed changes

gongel approved these changes Apr 13, 2023

View reviewed changes

gongel merged commit aef101a into PaddlePaddle:develop Apr 13, 2023

sijunhe deleted the linear branch April 13, 2023 11:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use our own Linear layer for easier tie_weights; Fix resize_token_embeddings API #5623

Use our own Linear layer for easier tie_weights; Fix resize_token_embeddings API #5623

sijunhe commented Apr 12, 2023

paddle-bot bot commented Apr 12, 2023

sijunhe Apr 12, 2023

codecov bot commented Apr 13, 2023 •

edited

Loading

JunnYu left a comment

Use our own Linear layer for easier tie_weights; Fix resize_token_embeddings API #5623

Use our own Linear layer for easier tie_weights; Fix resize_token_embeddings API #5623

Conversation

sijunhe commented Apr 12, 2023

PR types

PR changes

Description

paddle-bot bot commented Apr 12, 2023

sijunhe Apr 12, 2023

Choose a reason for hiding this comment

codecov bot commented Apr 13, 2023 • edited Loading

Codecov Report

JunnYu left a comment

Choose a reason for hiding this comment

codecov bot commented Apr 13, 2023 •

edited

Loading