Conv cudnn 3d #5783

typhoonzero · 2017-11-20T12:44:59Z

chengduoZH · 2017-11-20T13:59:17Z

paddle/operators/conv_cudnn_op.cu.cc

    int group_offset_out =
-        output_channels / groups * output_height * output_width;
+        output_channels / groups * output_height * output_width * output_depth;
    int group_offset_filter = filter->numel() / groups;


Do you think it's simpler to write this ?

According to http://www.cplusplus.com/reference/vector/vector/erase/

Because vectors use an array as their underlying storage, erasing elements in positions other than the vector end causes the container to relocate all the elements after the segment erased to their new positions.

Erasing first two elements will cause memory re-allocation, which is not efficient.

chengduoZH · 2017-11-20T14:05:58Z

paddle/operators/conv_cudnn_op.cu.cc

    int group_offset_out =
-        output_channels / groups * output_height * output_width;
+        output_channels / groups * output_height * output_width * output_depth;
    int group_offset_filter = filter->numel() / groups;


group is supported in cudnn7.0 .

… conv_cudnn_3d

chengduoZH · 2017-11-24T08:22:51Z

paddle/operators/conv_cudnn_op.cu.cc

+    cudnnConvolutionDescriptor_t cudnn_conv_desc =
+        conv_desc.descriptor<T>(paddings, strides, dilations);
+
+#if CUDNN_VERSION > 6000


#if CUDNN_VERSION > 6000 - > #if CUDNN_VERSION >= 7000 or #if CUDNN_VERSION_MIN(7,0,0)

This place needs to be changed too.

chengduoZH · 2017-11-24T09:43:10Z

paddle/operators/conv_cudnn_op.cu.cc

@@ -155,19 +200,34 @@ class CudnnConvGradOpKernel : public framework::OpKernel<T> {
    cudnnTensorDescriptor_t cudnn_input_grad_desc = nullptr;


cudnn_input_grad_desc and cudnn_input_desc are the same, you can replace cudnn_input_grad_desc with cudnn_input_desc. Just like this.

chengduoZH

LGTM++

typhoonzero added 4 commits November 20, 2017 19:36

conv cudnn 3d

d5e327a

update test case

86a0c99

update

754be4d

update

f03915e

typhoonzero requested a review from chengduoZH November 20, 2017 12:44

chengduoZH reviewed Nov 20, 2017

View reviewed changes

chengduoZH added the OpPorting label Nov 21, 2017

typhoonzero added 5 commits November 22, 2017 16:17

follow comments and remove groups from helper

6cc4cb3

update

1fac167

refine

7f0ba5a

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

946061f

… conv_cudnn_3d

update

7e89c41

chengduoZH reviewed Nov 24, 2017

View reviewed changes

typhoonzero added 3 commits November 24, 2017 19:11

follow comments2

91a2a81

update

3ae6646

fix compile

419d6c8

chengduoZH approved these changes Nov 27, 2017

View reviewed changes

typhoonzero merged commit a06bec1 into PaddlePaddle:develop Nov 27, 2017

typhoonzero deleted the conv_cudnn_3d branch December 22, 2017 05:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conv cudnn 3d #5783

Conv cudnn 3d #5783

typhoonzero commented Nov 20, 2017 •

edited

Loading

chengduoZH Nov 20, 2017

typhoonzero Nov 21, 2017

chengduoZH Nov 20, 2017

chengduoZH Nov 24, 2017

chengduoZH Nov 24, 2017

chengduoZH left a comment

		@@ -155,19 +200,34 @@ class CudnnConvGradOpKernel : public framework::OpKernel<T> {
		cudnnTensorDescriptor_t cudnn_input_grad_desc = nullptr;

Conv cudnn 3d #5783

Conv cudnn 3d #5783

Conversation

typhoonzero commented Nov 20, 2017 • edited Loading

chengduoZH Nov 20, 2017

Choose a reason for hiding this comment

typhoonzero Nov 21, 2017

Choose a reason for hiding this comment

chengduoZH Nov 20, 2017

Choose a reason for hiding this comment

chengduoZH Nov 24, 2017

Choose a reason for hiding this comment

chengduoZH Nov 24, 2017

Choose a reason for hiding this comment

chengduoZH left a comment

Choose a reason for hiding this comment

typhoonzero commented Nov 20, 2017 •

edited

Loading