support inplace in dygraph eager_fluid state #40400

pangyoki · 2022-03-10T06:19:13Z

PR types

New features

PR changes

Others

Describe

动态图中间态添加inplace策略。

python-c

实现要点：

返回inplace输出tensor时，应该直接将对应inplace输入tensor的PyObject返回，不需新建一个PyObject对象，否则会导致python端的inplace输入输出的id不一致。

static PyObject * eager_api_exp_(PyObject *self, PyObject *args, PyObject *kwargs)
{
  PyThreadState *tstate = nullptr;
  try
  {
    
    auto& X = GetTensorFromArgs("exp", "X", args, 0, false);
    framework::AttributeMap attrs;
    ConstructAttrMapFromPyArgs("exp", args, 1, PyTuple_GET_SIZE(args) , attrs);
    tstate = PyEval_SaveThread();
    auto out = exp__dygraph_function(X, attrs);
    PyEval_RestoreThread(tstate);
    tstate = nullptr;
    ssize_t arg_id = GetIdxFromCoreOpsInfoMap(core_ops_args_info, "exp", "X");
    ssize_t return_id = GetIdxFromCoreOpsInfoMap(core_ops_returns_info, "exp", "Out");
    return ToPyObject(out, return_id, args, arg_id);
  }
  catch(...) {
    if (tstate) {
      PyEval_RestoreThread(tstate);
    }
    ThrowExceptionToPython(std::current_exception());
    return nullptr;
  }
}

动态图层

实现要点：

自动代码生成的动态图执行流程有变（不影响功能）：
- 原流程：先跑TraceOp，然后生成输入和输出的auto_grad meta，然后构反向。
- 新流程：因为需要为inplace op做check_inplace的检查，check_inplace检查在执行TraceOp之前比较好。另一方面，check_inplace需要使用输入auto_grad meta信息。所以流程变为：先创建输入的auto_grad meta信息，然后check_inplace，然后执行TraceOp生成输出，然后创建输出的auto_grad meta信息，然后构反向。
实现inplace功能：TraceOp执行时，直接使用输入的EagerVariable替换输出。后续不重新创建输出Tensor，直接使用输入Tensor代替输出。
实现inplace问题：因为EagerVariable内不会改变输入Tensor的meta信息（导致inplace reshape无法改变ddim信息），因此新增了ModifyInplaceInput修改inplace tensor的meta信息。

paddle::experimental::Tensor exp__dygraph_function(paddle::experimental::Tensor& X, const paddle::framework::AttributeMap& attr_map) {

  paddle::platform::RecordEvent dygraph_entrance_record_event("exp dygraph", paddle::platform::TracerEventType::Operator, 1);
  VLOG(3) << "Running Eager Forward Op: exp";
  // Dygraph Forward Pass

  std::map<std::string, std::vector<std::shared_ptr<egr::EagerVariable>>> ins = { { "X", egr::EagerUtils::TrySyncToVars(X) } };

  std::map<std::string, std::vector<std::shared_ptr<egr::EagerVariable>>> outs = { { "Out", ins["X"] } };


  // Prepare Autograd Meta 
  egr::AutogradMeta* p_autograd_X = egr::EagerUtils::nullable_autograd_meta(X);

  bool trace_backward = egr::Controller::Instance().HasGrad();

  bool require_any_grad = egr::EagerUtils::ComputeRequireGrad(trace_backward, p_autograd_X);
  // Check Inplace
  egr::EagerUtils::CheckInplace(X, p_autograd_X, require_any_grad);

  paddle::framework::AttributeMap attrs = attr_map;
  paddle::framework::AttributeMap default_attrs;
  egr::Controller::Instance().GetCurrentTracer()->TraceOp("exp", ins, outs, attrs, 
     egr::Controller::Instance().GetExpectedPlace(),
     &default_attrs, true, {{"X", "Out"}});

  egr::EagerUtils::ModifyInplaceInput(outs["Out"][0], &X);
  X.bump_inplace_version();
  VLOG(3) << "Tensor(" << X.name() << ") uses Inplace Strategy.";

  {
    paddle::platform::RecordEvent node_creation_record_event("exp node_creation", paddle::platform::TracerEventType::Operator, 1);
    p_autograd_X = egr::EagerUtils::autograd_meta(&X);
    if(require_any_grad) {
      VLOG(6) << " Construct Grad for exp "; 
      egr::EagerUtils::PassStopGradient(false, p_autograd_X);
      // Create GradOpNode
      auto grad_node = std::make_shared<GradNodeexp>(1, 1);

      // Set Attributes
      grad_node->SetAttrMap(std::move(attrs));
      grad_node->SetDefaultAttrMap(std::move(default_attrs));

      // Set Tensor Wrappers
      grad_node->SetTensorWrapperOut(X, false);

      grad_node->SetGradOutMeta(p_autograd_X, 0);
      if(p_autograd_X) grad_node->AddEdges(p_autograd_X, 0);
      egr::EagerUtils::SetOutRankWithSlot(p_autograd_X, 0);
      egr::EagerUtils::SetHistory(p_autograd_X, grad_node);
      grad_node->SetGradInMeta(p_autograd_X, 0);
      egr::EagerUtils::CheckAndRetainGrad(X);

    }
  }

  return X;

}

反向检测

TensorWrapper中加入snapshot_inplace_version_快照信息。
在执行反向GradNode，从TensorWrapper recover出Tensor时，进行inplace_version反向检测。比较snapshot_inplace_version_与Tensor的current_inplace_version_是否一致。

示例

inplace操作

with _test_eager_guard():
    a = paddle.rand([2,3])
    a.stop_gradient=False
    b = a * 2
    c = b ** 2
    d = c.exp_()
    print(c.inplace_version)
    # 1
    print(id(c) == id(d))
    # True
    d.sum().backward()

stop_gradient=False的叶子节点做inplace操作报错

with _test_eager_guard():
    a = paddle.rand([2,3])
    a.stop_gradient=False
    a.reshape_([-1])

# ValueError: (InvalidArgument) Leaf Var () that doesn't stop gradient can't use inplace strategy.

反向检测报错

with _test_eager_guard():
    a = paddle.rand([2,3])
    a.stop_gradient=False
    b = a * 2
    c = b ** 2
    d = b.exp_()
    c.sum().backward()

# RuntimeError: (PermissionDenied) Tensor '' used in gradient computation has been modified by an inplace operation. Its version is 1 but the expected version is 0. Please fix your code to void calling an inplace operator after using the Tensor which will used in gradient computation.

paddle-bot-old · 2022-03-10T06:19:32Z

✅ This PR's description meets the template requirements!
Please wait for other CI results.

paddle-bot-old · 2022-03-10T06:19:50Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

… support_partial_grad

…e/Paddle into inplace_in_eager_fluid_state

jim19930609 · 2022-03-18T02:31:15Z

paddle/fluid/eager/tensor_wrapper.h

@@ -94,15 +105,52 @@ class TensorWrapper {
      intermidiate_tensor_.set_autograd_meta(
          std::static_pointer_cast<paddle::experimental::AbstractAutogradMeta>(
              p_ab_autograd_meta));
+      check_inplace_version();


Looks like we're gonna check inplace version anyway, let's move this function "check_inplace_version" out.

done in PR #41118

jim19930609 · 2022-03-18T02:35:05Z

paddle/fluid/pybind/eager_method.cc

@@ -716,6 +716,15 @@ static PyObject* set_grad_type(TensorObject* self, PyObject* args,
  EAGER_CATCH_AND_THROW_RETURN_NULL
 }

+static PyObject* tensor__inplace_version(TensorObject* self, PyObject* args,


single underscore "_" in function name?

its ok if this method indicate _inplace_version api in python

jim19930609 · 2022-03-18T02:36:07Z

paddle/fluid/pybind/eager_op_function_generator.cc

  std::string ins_initializer_with_null = "";
  std::string py_arg = "";
  int arg_idx = 0;
  int input_args_num = 0;
  std::string ins_cast_str = "";
  std::string view_strategy_str = "";
+  if (!inplace_map.empty()) {
+    // change call_api_str for inplace op
+    call_api_str = "auto out = " + op_type + "__dygraph_function(";


Better add "" at the very end of the function name, like "scale_dygraph_function" for inplaced scale

jim19930609 · 2022-03-18T02:39:52Z

paddle/fluid/pybind/eager_op_function_generator.cc

+    std::map<std::string, std::string> inplace_map;
+    // `sum` op has duplicate input. Don't consider adding inplace strategy
+    // for `sum` in temporary.
+    if (op_type != "sum" && infer_inplace) {


Better store hard-coded op name in a static set

done in PR #41118

JiabinYang

LGTM

XieYunshen

LGTM for set_tests_properties(test_inplace_eager_fluid PROPERTIES TIMEOUT 120)

veyron95 added 16 commits March 7, 2022 12:42

[Eager] Support eager grad interface, draft version

9fc70fe

Support eager grad interface with allow_unused and multi startup_op

ba8d79e

Fix code format

137db9d

Fix allow_unused case, return PyNone if tensor not initialize

1a18aa2

Support output's stop_gradient related to create_graph

d09ec3b

Support grad exception case in eager mode, fix coverage CI

f84f2be

Update ToPyObject, return PyNone if not initialize

733672e

AccumulationNode add FLAGS_retain_grad_for_all_tensor

68b1991

Fix ci issue

7665d63

Fix CI issue

86393f5

fix, use core.eager.Tensor

c653ec0

Add func SetBufferSlotRankZeros for GradTensorHolder

9156cea

Support retain_graph by using ClearTensorWrappers

6fd613d

Support retain_graph by using ClearTensorWrappers

58731e9

Update retain_graph and no_grad_vars related test case

a88f9b1

Update code gen logic for ClearTensorWrappers

778719b

veyron95 added 3 commits March 10, 2022 06:29

Fix by override statement

65cf9e3

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

af7b919

… support_partial_grad

fix override func args

4d3b57d

paddle-bot-old bot referenced this pull request Mar 10, 2022

support inplace in dygraph eager fluid state

527752b

veyron95 added 8 commits March 10, 2022 14:36

Support retain_graph, update unit tests

415ff65

Updated ClearTensorWrappers logic

bb283ce

fix grad python interface

e548c22

Use deep copy and update unit tests

519c9a6

Polish code

1fbc61b

Polish code

c0a2b8b

Fix CI issue, Deep copy only use when user set grad_tensors

536a28b

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

2417858

… support_partial_grad

veyron95 and others added 17 commits March 15, 2022 05:20

Update purify potential_startup_nodes logic

d18697a

Fix errors

1b5eac2

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

f4e42e2

… support_partial_grad

Polish code

58a03b5

solve conflict

ea41a1c

Remove useless args for ToPyObject

b04e9a9

Remove useless TensorWrappersSet

c7bd6fc

fix record conflict

7bb3cbd

Fix code-format, re-install pre-commit

ac85d81

fix tensor_wrapper bug

945d282

Fix pre-process logic for potential_startup_ops

8312d2d

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

441bc81

… support_partial_grad

Update unit tests, use eager mode

326eee5

solve conflict

3562b64

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

a0ec433

… support_partial_grad

Fix conflicts

df99eea

Merge commit 'refs/pull/40655/head' of https://github.com/PaddlePaddl…

cecc6e1

…e/Paddle into inplace_in_eager_fluid_state

jim19930609 reviewed Mar 18, 2022

View reviewed changes

jim19930609 previously approved these changes Mar 18, 2022

View reviewed changes

JiabinYang previously approved these changes Mar 18, 2022

View reviewed changes

pangyoki mentioned this pull request Mar 18, 2022

support inplace in dygraph eager_final state #40695

Merged

fix unittest timeout

9491a06

pangyoki dismissed stale reviews from JiabinYang and jim19930609 via 9491a06 March 18, 2022 06:27

pangyoki added 2 commits March 18, 2022 06:47

solve conflict

894791c

little change

41dec57

XieYunshen approved these changes Mar 18, 2022

View reviewed changes

pangyoki merged commit 8e61290 into PaddlePaddle:develop Mar 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support inplace in dygraph eager_fluid state #40400

support inplace in dygraph eager_fluid state #40400

pangyoki commented Mar 10, 2022 •

edited

Loading

paddle-bot-old bot commented Mar 10, 2022 •

edited

Loading

paddle-bot-old bot commented Mar 10, 2022

jim19930609 Mar 18, 2022

pangyoki Mar 30, 2022

jim19930609 Mar 18, 2022

JiabinYang Mar 18, 2022

jim19930609 Mar 18, 2022

jim19930609 Mar 18, 2022

pangyoki Mar 30, 2022

JiabinYang left a comment

XieYunshen left a comment

support inplace in dygraph eager_fluid state #40400

support inplace in dygraph eager_fluid state #40400

Conversation

pangyoki commented Mar 10, 2022 • edited Loading

PR types

PR changes

Describe

python-c

动态图层

反向检测

示例

paddle-bot-old bot commented Mar 10, 2022 • edited Loading

paddle-bot-old bot commented Mar 10, 2022

jim19930609 Mar 18, 2022

Choose a reason for hiding this comment

pangyoki Mar 30, 2022

Choose a reason for hiding this comment

jim19930609 Mar 18, 2022

Choose a reason for hiding this comment

JiabinYang Mar 18, 2022

Choose a reason for hiding this comment

jim19930609 Mar 18, 2022

Choose a reason for hiding this comment

jim19930609 Mar 18, 2022

Choose a reason for hiding this comment

pangyoki Mar 30, 2022

Choose a reason for hiding this comment

JiabinYang left a comment

Choose a reason for hiding this comment

XieYunshen left a comment

Choose a reason for hiding this comment

pangyoki commented Mar 10, 2022 •

edited

Loading

paddle-bot-old bot commented Mar 10, 2022 •

edited

Loading