[NPU]Custom fusion operator unification #8431

Galaxy1458 · 2024-05-13T06:55:30Z

PR types

Others

PR changes

Models

Description

Custom fusion operator unification

…o develop

paddle-bot · 2024-05-13T06:55:34Z

Thanks for your contribution!

codecov · 2024-05-13T08:09:12Z

Codecov Report

Attention: Patch coverage is 23.52941% with 65 lines in your changes are missing coverage. Please review.

Project coverage is 55.42%. Comparing base (17fb497) to head (0a6d6b8).
Report is 1 commits behind head on develop.

Files	Patch %	Lines
paddlenlp/transformers/llama/fusion_ops.py	22.50%	62 Missing ⚠️
paddlenlp/transformers/llama/modeling.py	40.00%	3 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8431      +/-   ##
===========================================
- Coverage    55.43%   55.42%   -0.01%     
===========================================
  Files          616      617       +1     
  Lines        96243    96281      +38     
===========================================
+ Hits         53348    53366      +18     
- Misses       42895    42915      +20

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

SylarTiaNII

LGTM

wawltor

LGTM

ZHUI · 2024-05-14T02:38:01Z

paddlenlp/transformers/fusion_ops.py

+    flash_attention = None
+
+
+def fusion_rope(query_states, key_states, value_states, hidden_states, position_ids, past_key_value, rotary_emb):


fusion_rope、fusion_flash_attention这种太长了就不建议去抽取了

已经将paddlenlp/transformers/fusion_ops.py 移动到paddlenlp/transformers/llama/fusion_ops.py

ZHUI · 2024-05-14T04:40:32Z

paddlenlp/transformers/llama/modeling.py

@@ -81,14 +80,16 @@ def swiglu(x, y=None):

 try:
    if get_env_device() == "npu":
-        from paddle.base import core

        for lib in os.listdir(os.getenv("CUSTOM_DEVICE_ROOT")):
            if lib.endswith(".so"):
                paddle.utils.cpp_extension.extension_utils.load_op_meta_info_and_register_op(lib)


注意看是不是有不需要的代码，注意删除掉。

Galaxy1458 and others added 16 commits May 9, 2024 12:03

update

a5ed9ed

Merge branch 'develop' of https://github.com/Galaxy1458/PaddleNLP int…

8ebdcfa

…o develop

add llama-npu-opt-script

bd0aa87

Merge branch 'PaddlePaddle:develop' into develop

ce921ab

Update dev_opt_lora.sh

cc24132

Update dev_opt_ppt.sh

036d03c

Update dev_opt_lora.sh

8dd2d02

Update dev_opt_ppt.sh

96e69aa

Update dev_opt_sft.sh

a35ba59

Rename dev_opt_lora.sh to llama_npu_opt_lora.sh

68388a7

Update dev_opt_ppt.sh

fee8f04

Rename dev_opt_ppt.sh to llama_npu_opt_ppt.sh

783de3b

Update llama_npu_opt_lora.sh

10f9415

Update and rename dev_opt_sft.sh to llama_npu_opt_sft.sh

f3d96e5

Merge branch 'PaddlePaddle:develop' into develop

e51cc9a

add funsion ops

6771aa9

Galaxy1458 added 4 commits May 13, 2024 15:04

add funsion ops

61dc79c

add funsion ops

558200f

add funsion ops

f387c30

add funsion ops

a12947b

Galaxy1458 added 7 commits May 13, 2024 16:12

add funsion ops

aff105e

add funsion ops

075c8de

add funsion ops

15f2fe3

add funsion ops

2741769

add funsion ops

12fc048

add funsion ops

f678361

add funsion ops

9b2ca6b

Galaxy1458 changed the title ~~fix~~ [NPU]Custom fusion operator unification May 13, 2024

Galaxy1458 added 4 commits May 13, 2024 18:13

add funsion ops

cac0f8e

add funsion ops

73866a2

add funsion ops

d8f1950

add funsion ops

9a2f1c5

SylarTiaNII approved these changes May 13, 2024

View reviewed changes

wawltor previously approved these changes May 14, 2024

View reviewed changes

ZHUI reviewed May 14, 2024

View reviewed changes

update

df78b71

Galaxy1458 dismissed wawltor’s stale review via df78b71 May 14, 2024 03:28

Galaxy1458 and others added 2 commits May 14, 2024 11:30

Update fusion_ops.py

8c3cd0d

update

0a6d6b8

ZHUI approved these changes May 14, 2024

View reviewed changes

wawltor merged commit 05acad5 into PaddlePaddle:develop May 14, 2024
8 of 11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NPU]Custom fusion operator unification #8431

[NPU]Custom fusion operator unification #8431

Galaxy1458 commented May 13, 2024 •

edited

Loading

paddle-bot bot commented May 13, 2024

codecov bot commented May 13, 2024 •

edited

Loading

SylarTiaNII left a comment

wawltor left a comment

ZHUI May 14, 2024

Galaxy1458 May 14, 2024

ZHUI May 14, 2024

		flash_attention = None


		def fusion_rope(query_states, key_states, value_states, hidden_states, position_ids, past_key_value, rotary_emb):

[NPU]Custom fusion operator unification #8431

[NPU]Custom fusion operator unification #8431

Conversation

Galaxy1458 commented May 13, 2024 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented May 13, 2024

codecov bot commented May 13, 2024 • edited Loading

Codecov Report

SylarTiaNII left a comment

Choose a reason for hiding this comment

wawltor left a comment

Choose a reason for hiding this comment

ZHUI May 14, 2024

Choose a reason for hiding this comment

Galaxy1458 May 14, 2024

Choose a reason for hiding this comment

ZHUI May 14, 2024

Choose a reason for hiding this comment

Galaxy1458 commented May 13, 2024 •

edited

Loading

codecov bot commented May 13, 2024 •

edited

Loading