[XPU] AdamW: fp16 for moment1/moment2 #62688

houj04 · 2024-03-13T08:14:32Z

PR types

New features

PR changes

OPs

Description

类似于 #57077 ，增加对“用fp16来存moment1和moment2”的支持。

使用方式和之前一样：目前仅在XPU下生效，通过export xpu_adamw_moment_dtype="fp16"来打开此功能。

paddle-bot · 2024-03-13T08:14:37Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

paddle-ci-bot · 2024-03-21T03:09:33Z

Sorry to inform you that 125b6a1's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually.

HarperCy · 2024-03-27T04:51:41Z

paddle/phi/kernels/xpu/adamw_kernel.cc

+                                           moment1_output_for_xdnn,
+                                           buffer_for_findmax,
+                                           moment1_out->numel());
+    float moment1_scale_value = 65504.0f / moment1_max / 2.0f;


请教一下珏爷，这里固定的这个值不会有问题嘛

参考per tensor scale设计。内部文档就不往外面贴了。

lj970926

LGTM

* [XPU] AdamW: fp16 for moment1/moment2 on KL3 * fix function name typo.

[XPU] AdamW: fp16 for moment1/moment2 on KL3

125b6a1

houj04 added 3 commits March 25, 2024 13:10

Merge branch 'develop' into 20240313-adamw-fp16

996f362

Merge branch 'develop' into 20240313-adamw-fp16

9665a77

fix function name typo.

96c608f

HarperCy reviewed Mar 27, 2024

View reviewed changes

HarperCy approved these changes Mar 27, 2024

View reviewed changes

lj970926 approved these changes Mar 27, 2024

View reviewed changes

houj04 changed the title ~~[XPU] AdamW: fp16 for moment1/moment2 on KL3~~ [XPU] AdamW: fp16 for moment1/moment2 Mar 27, 2024

QingshuChen approved these changes Mar 28, 2024

View reviewed changes

QingshuChen merged commit d5863bf into PaddlePaddle:develop Mar 28, 2024
30 checks passed

co63oc pushed a commit to co63oc/Paddle that referenced this pull request Mar 29, 2024

[XPU] AdamW: fp16 for moment1/moment2 (PaddlePaddle#62688)

8f14fbf

* [XPU] AdamW: fp16 for moment1/moment2 on KL3 * fix function name typo.

houj04 mentioned this pull request Aug 6, 2024

[XPU] only save adamw scale_value when flag is on #67077

Merged

houj04 added the XPU label Sep 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[XPU] AdamW: fp16 for moment1/moment2 #62688

[XPU] AdamW: fp16 for moment1/moment2 #62688

houj04 commented Mar 13, 2024 •

edited

Loading

paddle-bot bot commented Mar 13, 2024

paddle-ci-bot bot commented Mar 21, 2024

HarperCy Mar 27, 2024

houj04 Mar 27, 2024

lj970926 left a comment

[XPU] AdamW: fp16 for moment1/moment2 #62688

[XPU] AdamW: fp16 for moment1/moment2 #62688

Conversation

houj04 commented Mar 13, 2024 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Mar 13, 2024

paddle-ci-bot bot commented Mar 21, 2024

HarperCy Mar 27, 2024

Choose a reason for hiding this comment

houj04 Mar 27, 2024

Choose a reason for hiding this comment

lj970926 left a comment

Choose a reason for hiding this comment

houj04 commented Mar 13, 2024 •

edited

Loading