Fix sequence expand op #11618

ktlichkid · 2018-06-21T02:48:27Z

Sequence expand op's GPU grad kernel implementation is not robust enough if memory optimizer is on.

The GPU kernel directly computed the sum of gradient without checking the initial value in d_x tensor.

In this PR, I moved the "set zero" function outside the functor to guarantee d_x is set to zero both on CPU and GPU.

kuke · 2018-06-26T12:47:47Z

paddle/fluid/operators/sequence_expand_op.h

@@ -151,8 +151,8 @@ struct SequenceExpandGradFunctor<platform::CPUDeviceContext, T> {
      const framework::Vector<size_t>& x_lod,   /*expand source lod*/
      const framework::Vector<size_t>& ref_lod, /*expand referenced lod*/
      LoDTensor* dx) {
-    math::SetConstant<platform::CPUDeviceContext, T> set_zero;
-    set_zero(context, dx, static_cast<T>(0));
+    // math::SetConstant<platform::CPUDeviceContext, T> set_zero;


Please remove these two lines

wanghaoshuang

Great job! Actually, the sequence expand op may also give wrong gradient value even memory optimizer is off.

kuke · 2018-06-27T02:43:30Z

@wanghaoshuang Please have a test on the attention-based OCR model to make sure that this change solves the problem.

reyoung

Excellent! thanks!

ktlichkid added 12 commits June 21, 2018 02:42

Alloc memory for output grad

1381f9a

Merge branch 'develop' into fix-seqexp

b561ab8

Try tensor copy to fix

5b92a63

Merge remote-tracking branch 'upstream/develop' into fix-seqexp

e40babd

Set temp tensor to zero

b10f958

Merge branch 'fix-seqexp' of github.com:ktlichkid/Paddle into fix-seqexp

6ce2e0c

param fix

3527d30

clean up

16629be

rm temp tensor, set g_x to 0

2b09a6e

Merge branch 'fix-seqexp' of github.com:ktlichkid/Paddle into fix-seqexp

5883078

clean up

ecca7a9

Set zero outside functor

2f79823

ktlichkid requested review from reyoung, kuke and wanghaoshuang June 26, 2018 12:08

ktlichkid changed the title ~~[WIP] Fix sequence expand op~~ Fix sequence expand op Jun 26, 2018

kuke reviewed Jun 26, 2018

View reviewed changes

Remove comment

8cea236

wanghaoshuang approved these changes Jun 27, 2018

View reviewed changes

Merge remote-tracking branch 'upstream/develop' into fix-seqexp

bd0e414

reyoung approved these changes Jun 27, 2018

View reviewed changes

ktlichkid merged commit 8630ba2 into PaddlePaddle:develop Jun 27, 2018

ktlichkid deleted the fix-seqexp branch June 27, 2018 05:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix sequence expand op #11618

Fix sequence expand op #11618

ktlichkid commented Jun 21, 2018 •

edited

Loading

kuke Jun 26, 2018

ktlichkid Jun 27, 2018

wanghaoshuang left a comment

kuke commented Jun 27, 2018

reyoung left a comment

Fix sequence expand op #11618

Fix sequence expand op #11618

Conversation

ktlichkid commented Jun 21, 2018 • edited Loading

kuke Jun 26, 2018

Choose a reason for hiding this comment

ktlichkid Jun 27, 2018

Choose a reason for hiding this comment

wanghaoshuang left a comment

Choose a reason for hiding this comment

kuke commented Jun 27, 2018

reyoung left a comment

Choose a reason for hiding this comment

ktlichkid commented Jun 21, 2018 •

edited

Loading