[oneDNN] lookup_table op with support for BF16 data type. #31558

arogowie-intel · 2021-03-11T13:40:51Z

PR types

New features

PR changes

OPs

Describe

This PR adds support for BF16 data type in lookup_table op for both forward and backward pass.

…okup

arogowie-intel · 2021-03-11T13:41:38Z

@jczaja @wozna @wojtuss @arlesniak @lidanqing-intel Please start your review.

python/paddle/fluid/tests/unittests/test_lookup_table_bf16_op.py

…okup

wozna

Well done python tests :)
LGTM

luotao1 · 2021-03-16T02:44:30Z

paddle/fluid/operators/lookup_table_op.h

@@ -102,7 +102,8 @@ class LookupTableKernel : public framework::OpKernel<T> {
            auto id_index = table_t.GetIndexFromId(ids[i]);

            if (id_index != -1) {
-              if (input_data_type == framework::proto::VarType::INT8) {
+              if (input_data_type == framework::proto::VarType::INT8 ||
+                  input_data_type == framework::proto::VarType::BF16) {


I wonder why do you change lookup_table_op but not lookup_table_v2_op?

That's because in the model "word2vec" there is lookup_table op used. Moreover paddle.fluid.layers.embedding Py API creates lookup_table op.

luotao1 · 2021-03-16T02:56:30Z

python/paddle/fluid/tests/unittests/op_test.py

@@ -33,10 +33,19 @@
 from paddle.fluid.op import Operator
 from paddle.fluid.executor import Executor
 from paddle.fluid.framework import Program, OpProtoHolder, Variable
-from testsuite import create_op, set_input, append_input_output, append_loss_ops
+from paddle.fluid.tests.unittests.testsuite import (


why change this file?

Since this is the recommended Python style for importing multiple modules from single package PEP328 and generally it looks more readable. Moreover this way you can have multiline import statement.

arogowie-intel · 2021-03-16T15:41:32Z

@luotao1 regarding issues rised in PR-CI-APPROVAL:

@unittest.skipIf(not core.supports_bfloat16(),

I need to use this decorator, since otherwise the CI configurations with GPU will throw errors about no kernel registered for this datatype for GPU.
| self.check_output_with_place(core.CPUPlace(), check_dygraph=False)

The check_dygraph=False is needed to pass tests. The BF16 data type is not supported in dygraph mode.
| max_relative_error=1.5e-2,

Usage of BF16 data type might incur slightly lower accuracy in comparison to FP32.
| @skip_check_grad_ci(

This is analogous as in tests here - citing: "gradient of paddings makes no sense."

…okup

arlesniak

Good job!

arogowie-intel added 14 commits February 25, 2021 10:09

Add CBlas VCOPY specialization for bfloat16.

ae78893

Enable bfloat16 kernel for lookup_table and grad.

dee2551

Add first UT for lookup_table bf16

d92b60b

Merge remote-tracking branch 'upstream/develop' into aosewski/bf16_lo…

e53b6c5

…okup

Merge remote-tracking branch 'upstream/develop' into aosewski/bf16_lo…

c6e8522

…okup

Add missing header.

eacc266

Fix typo.

47ebd1a

Handle bfloat16 while checking gradients.

dca8af1

UT with Ids as 4D tensor.

9b541f1

Add UT with W as selected rows.

465e31a

Refactor UT with selected rows.

ba3aca1

Skip test if no support for BF16

28e0410

Add UT with padding index.

af57d37

Merge remote-tracking branch 'upstream/develop' into aosewski/bf16_lo…

226aa33

…okup

arogowie-intel mentioned this pull request Mar 11, 2021

Unit test for lookup_table OP with SelectedRows in grad. #31559

Closed

Fix for Python2

0b3964d

arlesniak suggested changes Mar 11, 2021

View reviewed changes

python/paddle/fluid/tests/unittests/test_lookup_table_bf16_op.py Outdated Show resolved Hide resolved

python/paddle/fluid/tests/unittests/test_lookup_table_bf16_op.py Outdated Show resolved Hide resolved

arogowie-intel added 4 commits March 12, 2021 16:08

Review comments: refactoring.

5aa76da

Call check functions with place explicitly.

b1b5240

Merge remote-tracking branch 'upstream/develop' into aosewski/bf16_lo…

e54606d

…okup

Fix for old and EOL Python2.

d3ec88a

wozna added BF16 Intel labels Mar 15, 2021

wozna approved these changes Mar 15, 2021

View reviewed changes

luotao1 reviewed Mar 16, 2021

View reviewed changes

Merge remote-tracking branch 'upstream/develop' into aosewski/bf16_lo…

d699e85

…okup

arlesniak approved these changes Mar 18, 2021

View reviewed changes

luotao1 approved these changes Mar 19, 2021

View reviewed changes

luotao1 merged commit a4a2b77 into PaddlePaddle:develop Mar 19, 2021

luotao1 mentioned this pull request Mar 19, 2021

[oneDNN] Initial bf16 amp integration #31093

Merged

arogowie-intel deleted the aosewski/bf16_lookup branch March 19, 2021 08:44

lidanqing-intel mentioned this pull request Apr 14, 2021

Enable BF16 on Paddle Parameter Server Distributed Training #30560

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[oneDNN] lookup_table op with support for BF16 data type. #31558

[oneDNN] lookup_table op with support for BF16 data type. #31558

arogowie-intel commented Mar 11, 2021 •

edited by luotao1

Loading

arogowie-intel commented Mar 11, 2021

wozna left a comment

luotao1 Mar 16, 2021

arogowie-intel Mar 16, 2021

luotao1 Mar 16, 2021

arogowie-intel Mar 16, 2021 •

edited

Loading

arogowie-intel commented Mar 16, 2021

arlesniak left a comment

[oneDNN] lookup_table op with support for BF16 data type. #31558

[oneDNN] lookup_table op with support for BF16 data type. #31558

Conversation

arogowie-intel commented Mar 11, 2021 • edited by luotao1 Loading

PR types

PR changes

Describe

arogowie-intel commented Mar 11, 2021

wozna left a comment

Choose a reason for hiding this comment

luotao1 Mar 16, 2021

Choose a reason for hiding this comment

arogowie-intel Mar 16, 2021

Choose a reason for hiding this comment

luotao1 Mar 16, 2021

Choose a reason for hiding this comment

arogowie-intel Mar 16, 2021 • edited Loading

Choose a reason for hiding this comment

arogowie-intel commented Mar 16, 2021

arlesniak left a comment

Choose a reason for hiding this comment

arogowie-intel commented Mar 11, 2021 •

edited by luotao1

Loading

arogowie-intel Mar 16, 2021 •

edited

Loading