[bf16] add bf16 kernel: layer_norm p_norm reduce_sum #39843

zhangbo9674 · 2022-02-23T06:06:55Z

PR types

New features

PR changes

OPs

Describe

添加 layer_norm p_norm reduce_sum bf16 kernel.

对LayerNorm做性能测试，在embed_dim=1024的场景下分析call_1024_kernel在bf16数据类型下的计算性能：（call_1024_kernel的优化策略见PR39247）

layer_norm前向及反向耗时：
cost_time fp32: 0.01763439178466797s
cost_time bf16 use 1024 kernel: 0.007885456085205078s
cost_time bf16 no use 1024 kernel: 0.008244991302490234s

paddle-bot-old · 2022-02-23T06:07:30Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

… dev/bf16_op_9

zhiqiu

LGTM

ZzSean

LGTM for op benchmark

wanghuancoder

LGTM

zhangbo9674 added 3 commits February 23, 2022 06:03

add layer norm

9f0a57b

add p norm

b40e373

add reduce sum

67a7e74

zhangbo9674 added 11 commits February 23, 2022 08:50

refine layer norm register bf16 for cudnn811

9490159

add bf16 cast for hip

c2ca074

add unittest

a2f7e07

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

22c0b83

… dev/bf16_op_9

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

94b2f48

… dev/bf16_op_9

refine rocm

eba7f4c

refine layer_norm unittest

fe561c2

refine reduce op

0ea4582

refine unittest

33cb7bd

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

c856970

… dev/bf16_op_9

enhance atol for reduce unittest

44ef3a6

zhiqiu approved these changes Mar 1, 2022

View reviewed changes

ZzSean approved these changes Mar 1, 2022

View reviewed changes

wanghuancoder approved these changes Mar 1, 2022

View reviewed changes

zhangbo9674 merged commit ce8ed97 into PaddlePaddle:develop Mar 1, 2022

zhangbo9674 deleted the dev/bf16_op_9 branch March 2, 2023 02:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bf16] add bf16 kernel: layer_norm p_norm reduce_sum #39843

[bf16] add bf16 kernel: layer_norm p_norm reduce_sum #39843

zhangbo9674 commented Feb 23, 2022 •

edited

Loading

paddle-bot-old bot commented Feb 23, 2022

zhiqiu left a comment

ZzSean left a comment

wanghuancoder left a comment

[bf16] add bf16 kernel: layer_norm p_norm reduce_sum #39843

[bf16] add bf16 kernel: layer_norm p_norm reduce_sum #39843

Conversation

zhangbo9674 commented Feb 23, 2022 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Feb 23, 2022

zhiqiu left a comment

Choose a reason for hiding this comment

ZzSean left a comment

Choose a reason for hiding this comment

wanghuancoder left a comment

Choose a reason for hiding this comment

zhangbo9674 commented Feb 23, 2022 •

edited

Loading