[intel_hpu] initial commit for intel_hpu support #9273

yanfeich · 2024-10-15T09:07:30Z

PR types

New features

PR changes

Models

PR Category

Custom Device

Description

add intel_hpu device in PaddleNLP for fused RoPE, fused RMS, fused SDPA, dtype specific support

…t_intel_hpu_backend

paddle-bot · 2024-10-15T09:07:35Z

Thanks for your contribution!

codecov · 2024-10-15T09:39:30Z

Codecov Report

Attention: Patch coverage is 20.00000% with 32 lines in your changes missing coverage. Please review.

Project coverage is 52.91%. Comparing base (ec25cb8) to head (687c0c3).
Report is 2 commits behind head on develop.

Files with missing lines	Patch %	Lines
paddlenlp/transformers/llama/fusion_ops.py	0.00%	21 Missing ⚠️
paddlenlp/transformers/llama/modeling.py	41.17%	10 Missing ⚠️
paddlenlp/utils/tools.py	50.00%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9273      +/-   ##
===========================================
+ Coverage    52.81%   52.91%   +0.09%     
===========================================
  Files          673      673              
  Lines       107657   107687      +30     
===========================================
+ Hits         56857    56980     +123     
+ Misses       50800    50707      -93

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

paddlenlp/transformers/llama/fusion_ops.py

…t_intel_hpu_backend

ZHUI · 2024-10-22T06:16:33Z

可以新建一个 llm/intel_hpu 放对应的运行示例和文档。

DrownFish19 · 2024-10-22T06:28:02Z

paddlenlp/transformers/llama/modeling.py

@@ -248,7 +248,11 @@ def scaled_dot_product_attention(
        value_states = paddle.transpose(value_states, [0, 2, 1, 3])

        # matmul and devide by sqrt(head_dim)
-        attn_weights = paddle.matmul(query_states / math.sqrt(head_dim), key_states.transpose([0, 1, 3, 2]))
+        if get_env_device() == "intel_hpu":
+            attn_weights = paddle.matmul(query_states * (1 / math.sqrt(head_dim)), key_states.transpose([0, 1, 3, 2]))


这里的写法最好增加注释，说明为什么需要 1 / math.sqrt(head_dim。

对立即数的乘法性能要好于除法

DrownFish19 · 2024-10-22T06:30:29Z

paddlenlp/transformers/llama/fusion_ops.py

+            if config.context_parallel_degree > 1:
+                raise ValueError("Context parallel is not implemented for intel_hpu")
+            scaling_factor = query_states.shape[3] ** -0.5
+            attention_mask = attention_mask.astype("bfloat16")


intel_hpu是否支持float16？facebook/llama-7b的数据类型是float16，llama2和llama3是bfloat16

这里改成了q的dtype

DrownFish19 · 2024-10-22T06:33:47Z

可以新建一个 llm/intel_hpu 放对应的运行示例和文档。

可以参考

DrownFish19

LGTM

ZhaiFeiYue1 and others added 10 commits September 24, 2024 13:52

add intel hpu device

78c6e7d

add wa for intel hpu

48ac84f

support rope

2954066

einsum

90aa9e4

float64 fallback

d9fa0e7

refine einsum

d0e4a4a

refine einsum

59d8988

update rope sin,cos

d2da44f

add fsdpa

6d4d355

Merge remote-tracking branch 'PaddleNLP_feiyue/intel_hpu' into suppor…

6569e3f

…t_intel_hpu_backend

DrownFish19 reviewed Oct 15, 2024

View reviewed changes

paddlenlp/transformers/llama/fusion_ops.py Outdated Show resolved Hide resolved

yanfeich added 2 commits October 16, 2024 06:50

update fused_rms

53f4fdc

Merge remote-tracking branch 'PaddleNLP_feiyue/intel_hpu' into suppor…

d9a3ad6

…t_intel_hpu_backend

yanfeich force-pushed the support_intel_hpu_backend branch from bcaf5f3 to d9a3ad6 Compare October 21, 2024 10:06

DrownFish19 reviewed Oct 22, 2024

View reviewed changes

yanfeich added 3 commits October 30, 2024 13:26

Merge branch 'PaddlePaddle:develop' into support_intel_hpu_backend

4743007

add llm/intel_hpu/llama

aebdcbc

retrigger CI

687c0c3

DrownFish19 approved these changes Oct 31, 2024

View reviewed changes

ZHUI merged commit ce083f0 into PaddlePaddle:develop Oct 31, 2024
8 of 11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[intel_hpu] initial commit for intel_hpu support #9273

[intel_hpu] initial commit for intel_hpu support #9273

yanfeich commented Oct 15, 2024

paddle-bot bot commented Oct 15, 2024

codecov bot commented Oct 15, 2024 •

edited

Loading

ZHUI commented Oct 22, 2024

DrownFish19 Oct 22, 2024

yanfeich Oct 30, 2024

DrownFish19 Oct 22, 2024

yanfeich Oct 30, 2024

DrownFish19 commented Oct 22, 2024

DrownFish19 left a comment

[intel_hpu] initial commit for intel_hpu support #9273

[intel_hpu] initial commit for intel_hpu support #9273

Conversation

yanfeich commented Oct 15, 2024

PR types

PR changes

PR Category

Description

paddle-bot bot commented Oct 15, 2024

codecov bot commented Oct 15, 2024 • edited Loading

Codecov Report

ZHUI commented Oct 22, 2024

DrownFish19 Oct 22, 2024

Choose a reason for hiding this comment

yanfeich Oct 30, 2024

Choose a reason for hiding this comment

DrownFish19 Oct 22, 2024

Choose a reason for hiding this comment

yanfeich Oct 30, 2024

Choose a reason for hiding this comment

DrownFish19 commented Oct 22, 2024

DrownFish19 left a comment

Choose a reason for hiding this comment

codecov bot commented Oct 15, 2024 •

edited

Loading