[LoRA] add quick_lora #8106

JunnYu · 2024-03-13T03:13:59Z

PR types

New features

PR changes

APIs

Description

优化lora的前向和反向计算。
非动态padding、短文本的情况下，提速约为3%。

已知缺陷：

lora dropout 必须设置为0

paddle-bot · 2024-03-13T03:14:04Z

Thanks for your contribution!

codecov · 2024-03-13T03:42:22Z

Codecov Report

Attention: Patch coverage is 29.93197% with 103 lines in your changes are missing coverage. Please review.

Project coverage is 55.41%. Comparing base (f005084) to head (749b419).

Files	Patch %	Lines
paddlenlp/peft/lora/lora_quick_layers.py	28.12%	69 Missing ⚠️
paddlenlp/peft/lora/lora_layers.py	28.88%	32 Missing ⚠️
paddlenlp/peft/lora/lora_config.py	66.66%	2 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8106      +/-   ##
===========================================
- Coverage    55.44%   55.41%   -0.03%     
===========================================
  Files          596      597       +1     
  Lines        91464    91587     +123     
===========================================
+ Hits         50713    50754      +41     
- Misses       40751    40833      +82

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

…t be set to True to prevent any potential errors from occurring.

gongel · 2024-03-19T05:37:21Z

冲突了

paddlenlp/peft/lora/lora_quick_layers.py

lugimzzz · 2024-03-21T05:02:13Z

paddlenlp/peft/lora/lora_quick_layers.py

+        input_grad = None
+
+        if not input.stop_gradient:
+            input_grad = paddle.addmm(


求input_grad是不是可以考虑使用merged_weight，input_grad= paddle.matmul(grad_output, merged_weight, transpose_y=True)

merged_weight这个东西没法从前向复用，复用会占用很大的显存。
然后如果合并计算的话，就无法复用 lora_B_input_grad = paddle.matmul(grad_output, lora_B, transpose_y=True)。需要重新计算一次

lugimzzz

lgtm

quick_lora

122ae02

JunnYu requested review from lugimzzz and gongel March 13, 2024 03:14

JunnYu added 5 commits March 13, 2024 11:52

add use_quick_lora

9ad51be

update

7e2ecb7

update

67991ba

update

f2c8553

update

353595e

gongel previously approved these changes Mar 14, 2024

View reviewed changes

When lora and use_quick_lora are enabled, recompute_use_reentrant mus…

f6094f8

…t be set to True to prevent any potential errors from occurring.

JunnYu dismissed gongel’s stale review via f6094f8 March 18, 2024 02:59

Merge branch 'develop' into add_quick_lora

7ac87a6

lugimzzz reviewed Mar 21, 2024

View reviewed changes

JunnYu added 3 commits March 21, 2024 15:54

update code

7dc0d98

update

2699423

Merge branch 'develop' into add_quick_lora

7cc348f

JunnYu requested review from lugimzzz and gongel March 21, 2024 09:03

lugimzzz approved these changes Mar 21, 2024

View reviewed changes

gongel approved these changes Mar 21, 2024

View reviewed changes

Merge branch 'PaddlePaddle:develop' into add_quick_lora

749b419

wawltor merged commit d577e19 into PaddlePaddle:develop Mar 25, 2024
7 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LoRA] add quick_lora #8106

[LoRA] add quick_lora #8106

JunnYu commented Mar 13, 2024 •

edited

Loading

paddle-bot bot commented Mar 13, 2024

codecov bot commented Mar 13, 2024 •

edited

Loading

gongel commented Mar 19, 2024

lugimzzz Mar 21, 2024

JunnYu Mar 21, 2024

lugimzzz left a comment

[LoRA] add quick_lora #8106

[LoRA] add quick_lora #8106

Conversation

JunnYu commented Mar 13, 2024 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Mar 13, 2024

codecov bot commented Mar 13, 2024 • edited Loading

Codecov Report

gongel commented Mar 19, 2024

lugimzzz Mar 21, 2024

Choose a reason for hiding this comment

JunnYu Mar 21, 2024

Choose a reason for hiding this comment

lugimzzz left a comment

Choose a reason for hiding this comment

JunnYu commented Mar 13, 2024 •

edited

Loading

codecov bot commented Mar 13, 2024 •

edited

Loading