Fix LayerNorm Problem #33420

zhiboniu · 2021-06-08T06:58:21Z

PR types

Bug fixes

PR changes

OPs

Describe

优化LayerNorm计算过程。
同时修复如下问题：
DLTP-25782 [Bug]
DLTP-26400 [Bug]

修复LayerNorm在大数输入时输出Nan的bug。
DLTP-28680 [Bug]

修复LayerNorm在归一化维度长度<kMaxBlockDim时的BlockDim计算错误bug。

修复LayerNorm在大输入shape时输出结果0的bug。

paddle-bot-old · 2021-06-08T06:58:25Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

…hile large data input

jeff41404 · 2021-06-10T16:46:23Z

paddle/fluid/operators/layer_norm_op.cu

@@ -952,7 +977,7 @@ class LayerNormGradKernel<platform::CUDADeviceContext, T>
    const auto begin_norm_axis = ctx.Attr<int>("begin_norm_axis");
    auto matrix_dim = framework::flatten_to_2d(x_dims, begin_norm_axis);
    int batch_size = static_cast<int>(matrix_dim[0]);


batch_size是否也需要改成 int64_t

paddle/fluid/operators/layer_norm_op.cu

ZHUI

建议文件权限仍然改为 644

jeff41404 · 2021-06-11T04:43:45Z

paddle/fluid/operators/layer_norm_op.cu

@@ -898,7 +923,7 @@ class LayerNormKernel<platform::CUDADeviceContext, T>

    auto matrix_dim = framework::flatten_to_2d(x_dims, begin_norm_axis);
    int batch_size = static_cast<int>(matrix_dim[0]);


batch_size是否也需要改成 int64_t

jeff41404 · 2021-06-11T04:45:15Z

paddle/fluid/operators/layer_norm_op.cu

@@ -952,7 +977,7 @@ class LayerNormGradKernel<platform::CUDADeviceContext, T>
    const auto begin_norm_axis = ctx.Attr<int>("begin_norm_axis");
    auto matrix_dim = framework::flatten_to_2d(x_dims, begin_norm_axis);
    int batch_size = static_cast<int>(matrix_dim[0]);
-    int feature_size = static_cast<int>(matrix_dim[1]);
+    int64_t feature_size = static_cast<int64_t>(matrix_dim[1]);

    LayerNormBackward<T, U>(x_data, d_y_data, scale_data, mean_data, var_data,


本次主要对正向计算进行了优化，反向计算的优化也可以在未来考虑

jeff41404 · 2021-06-11T06:38:42Z

LGTM

XiaoguangHu01

LGTM

Eliminate numerical differences of LayerNorm; fix LayerNorm Nan Bug w…

c5ec5d5

…hile large data input

jeff41404 reviewed Jun 10, 2021

View reviewed changes

ZHUI reviewed Jun 11, 2021

View reviewed changes

paddle/fluid/operators/layer_norm_op.cu Show resolved Hide resolved

ZHUI reviewed Jun 11, 2021

View reviewed changes

jeff41404 reviewed Jun 11, 2021

View reviewed changes

fix bug while large shape of data input

1324078

zhiboniu mentioned this pull request Jun 11, 2021

Fix LayerNorm Problem Release2.1 #33534

Merged

XiaoguangHu01 approved these changes Jun 12, 2021

View reviewed changes

XiaoguangHu01 merged commit fe94db6 into PaddlePaddle:develop Jun 12, 2021

ZHUI mentioned this pull request Jun 18, 2021

fix gpt2 train loss Nan problem by add a line __syncthreads in BlockR… #33658

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix LayerNorm Problem #33420

Fix LayerNorm Problem #33420

zhiboniu commented Jun 8, 2021 •

edited

Loading

paddle-bot-old bot commented Jun 8, 2021

jeff41404 Jun 10, 2021

zhiboniu Jun 11, 2021

ZHUI left a comment

jeff41404 Jun 11, 2021

zhiboniu Jun 11, 2021

jeff41404 Jun 11, 2021

zhiboniu Jun 11, 2021

jeff41404 commented Jun 11, 2021

XiaoguangHu01 left a comment

		@@ -898,7 +923,7 @@ class LayerNormKernel<platform::CUDADeviceContext, T>

		auto matrix_dim = framework::flatten_to_2d(x_dims, begin_norm_axis);
		int batch_size = static_cast<int>(matrix_dim[0]);

Fix LayerNorm Problem #33420

Fix LayerNorm Problem #33420

Conversation

zhiboniu commented Jun 8, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Jun 8, 2021

jeff41404 Jun 10, 2021

Choose a reason for hiding this comment

zhiboniu Jun 11, 2021

Choose a reason for hiding this comment

ZHUI left a comment

Choose a reason for hiding this comment

jeff41404 Jun 11, 2021

Choose a reason for hiding this comment

zhiboniu Jun 11, 2021

Choose a reason for hiding this comment

jeff41404 Jun 11, 2021

Choose a reason for hiding this comment

zhiboniu Jun 11, 2021

Choose a reason for hiding this comment

jeff41404 commented Jun 11, 2021

XiaoguangHu01 left a comment

Choose a reason for hiding this comment

zhiboniu commented Jun 8, 2021 •

edited

Loading