Add copy from tensor #34406

shangzhizhou · 2021-07-27T02:37:30Z

PR types

Others

PR changes

APIs

Describe

add copy_from_tensor api for inference tensor

一、python增加同步copy_tensor的api，使用示例

from paddle.inference.contrib import utils

utils.copy_tensor(dst_tensor, src_tensor)

二、C++增加paddle_infer::Tensor::CopyToCpuAsync接口(当前只支持GPU到CPU的异步拷贝，此时不能使用Host申请的内存，需要使用cuda的pinned memory，可以使用提供的工具函数申请和释放 CudaMallocPinnedMemory()/CudaFreePinnedMemory())，示例如下

返回stream的调用方式

  //...
  //predictor运行代码

  const auto &output_names = predictor->GetOutputNames();
  auto output_tensor = predictor->GetOutputHandle(output_names[0]);
  std::vector<int> output_shape = output_tensor->shape();
  int out_num = std::accumulate(output_shape.begin(), output_shape.end(), 1,
                                std::multiplies<int>());

  float *out_data = static_cast<float *>(
      contrib::TensorUtils::CudaMallocPinnedMemory(sizeof(float) * out_num));

  cudaStream_t stream;
  output_tensor->CopyToCpuAsync(out_data, static_cast<void *>(&stream));

  // sync
  cudaStreamSynchronize(stream);

  contrib::TensorUtils::CudaFreePinnedMemory(static_cast<void *>(out_data));

使用回调的调用方式

  //...
  //predictor运行代码

  const auto &output_names = predictor->GetOutputNames();
  auto output_tensor = predictor->GetOutputHandle(output_names[0]);
  std::vector<int> output_shape = output_tensor->shape();
  int out_num = std::accumulate(output_shape.begin(), output_shape.end(), 1,
                                std::multiplies<int>());

  float *out_data = static_cast<float *>(
      contrib::TensorUtils::CudaMallocPinnedMemory(sizeof(float) * out_num));

  output_tensor->CopyToCpuAsync(
      out_data,
      [](void *cb_params) {
        float *data = static_cast<float *>(cb_params);
        for (int i = 0; i < 10; i++) {
          std::cout << data[i] << std::endl;
        }
      },
      static_cast<void *>(out_data));

  cudaDeviceSynchronize();
  contrib::TensorUtils::CudaFreePinnedMemory(static_cast<void *>(out_data));

三、增加C++ tensor拷贝函数

  static void CopyTensor(Tensor* p_dst, const Tensor& src);
  static void CopyTensorAsync(Tensor* p_dst, const Tensor& src,
                              void* exec_stream);
  static void CopyTensorAsync(Tensor* p_dst, const Tensor& src, CallbackFunc cb,
                              void* cb_params);

异步使用方式参考Tensor.CopyToCpuAsync()
测试代码参考 paddle/fluid/inference/tests/api/paddle_infer_api_copy_tensor_tester.cc

… add_copy_from_tensor

paddle-bot-old · 2021-07-27T02:37:33Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

… add_copy_from_tensor

Superjomn · 2021-08-25T05:16:27Z

paddle/fluid/inference/api/paddle_infer_contrib.h

+      const std::string& name, PlaceType place, void* p_scope);
+
+ private:
+  static void CopyTensorImp(Tensor& dst, const Tensor& src, void* exec_stream,


Imp -> Impl

Superjomn · 2021-08-25T05:18:58Z

paddle/fluid/inference/tests/api/paddle_infer_api_copytensor.cc

+  std::vector<float> input(in_num, 1.0);
+
+  auto input_names = predictor->GetInputNames();
+  auto input_t = predictor->GetInputHandle(input_names[0]);


input_tensor

_t 后缀一般表示 type，比如 value_t

Superjomn · 2021-08-25T05:19:33Z

paddle/fluid/inference/api/details/zero_copy_tensor.cc

@@ -185,7 +187,8 @@ void Tensor::CopyFromCpu(const T *data) {
 }

 template <typename T>
-void Tensor::CopyToCpu(T *data) {
+void Tensor::CopyToCpuImp(T *data, void *exec_stream, CallbackFunc cb,


全局， Imp -> Impl

Superjomn · 2021-08-25T05:22:09Z

paddle/fluid/inference/api/paddle_infer_contrib.cc

+
+using paddle::PaddleDType;
+
+std::unique_ptr<Tensor> TensorUtils::CreateInferTensorForTest(


用于单测的，最好单独拆出去，用 WITH_TESTING 宏隔开

这个会进到最终生产环境的库里面吧？

done，thanks

Superjomn · 2021-08-25T05:23:48Z

paddle/fluid/inference/tests/api/paddle_infer_api_copytensor.cc

@@ -0,0 +1,325 @@
+/* Copyright (c) 2021 PaddlePaddle Authors. All Rights Reserved.
+


文件名 copy_tensor，两个单词

这是个单测？

文件名也需要加 _tester

Superjomn · 2021-08-25T05:27:54Z

最好把用例，功能在 PR 描述里面也加下

paddle/fluid/inference/api/paddle_infer_contrib.cc

paddle/fluid/inference/api/paddle_infer_contrib.h

paddle/fluid/inference/tests/api/paddle_infer_api_copytensor.cc

paddle/fluid/inference/api/paddle_infer_contrib.cc

shangzhizhou · 2021-08-25T08:26:19Z

最好把用例，功能在 PR 描述里面也加下

done

XieYunshen

LGTM for 'set_tests_properties(paddle_infer_api_copy_tensor_tester PROPERTIES TIMEOUT 30) '

This reverts commit ac33c0c.

…dle#35173)" This reverts commit 32c1ec4.

* Revert "Revert "Add copy from tensor (#34406)" (#35173)" This reverts commit 32c1ec4. * add template instantiation

shangzhizhou added 2 commits July 26, 2021 20:57

add api

0e568d3

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

24f6ae2

… add_copy_from_tensor

shangzhizhou added 24 commits August 12, 2021 19:30

temp save

80ce681

fix conflict

107ce3c

revert

1fe24a9

copytocpu async ok

5dbcc80

fix style

37deaab

copy sync ok

656abda

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

de42de4

… add_copy_from_tensor

fix compile error

1cb2f58

fix compile error

5967b02

api done

718429e

update python async api

1a20251

fix compile

94a7798

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

40334ab

… add_copy_from_tensor

remove async python api; add c++ async unittest

293fa53

remove python async api

5b18916

update unittest

8db5a62

update unittest

01917d8

add C++ unittest for copytensor

c4b0956

add unittest

e566a52

update namespace utils to class TensorUtils

23890de

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

743cb76

… add_copy_from_tensor

add unittest

6334896

update unittest

be93486

update unittest

3f4a40d

Superjomn reviewed Aug 25, 2021

View reviewed changes