Non-square Filters and Separated Stride and Padding #505

shelhamer · 2014-06-16T08:10:13Z

add non-square kernel size, padding, and stride fields
check these parameters
add rectangular im2col test
fix checks -- see Rectangular pooling #614
fix GPU rectangular im2col (thanks @ejaz-izy!)
add separable filter test case for convolution layer.

Accept pairs of height/Y and width/X values for kernel size, stride, and pad in lieu of a single, shared value.

Of course the square/equal case still works and the old defaults (stride = 1 and padding = 0) are kept.

Open to comments on implementation and style.

jeffdonahue · 2014-06-17T01:31:20Z

src/caffe/layers/im2col_layer.cpp

-  stride_ = this->layer_param_.convolution_param().stride();
-  pad_ = this->layer_param_.convolution_param().pad();
+  ConvolutionParameter conv_param = this->layer_param_.convolution_param();
+  kernel_height_ = conv_param.kernel_size(0);


should probably CHECK_GT(conv_param.kernel_size_size(), 0) first. Also CHECK_GT(kernel_height_ * kernel_width_, 0) after setting those -- we should have had a similar check all along really (since kernel_size defaults to 0).

(Please see my comments regarding protobuf fields)

Maybe we can save old format by having:

if (conv_param.has_kernel_size()) {
kernel_height_ = conv_param.kernel_size();
kernel_height_ = conv_param.kernel_size();
} else {
CHECK_EQ(conv_param.rectangular_kernel_size_size(), 2)
<< "Must specify either kernel_size or rectangular_kernel_size (2 numbers).";
kernel_height_ = conv_param.rectangular_kernel_size(0);
kernel_height_ = conv_param.rectangular_kernel_size(1);
}

This may provide maximum backward compatibility.

Agreed. I've added these checks.

Le mardi 17 juin 2014, Jeff Donahue notifications@github.com a écrit :

In src/caffe/layers/im2col_layer.cpp:

@@ -13,15 +13,28 @@ template
void Im2colLayer::SetUp(const vector<Blob>& bottom,
vector<Blob>* top) {
Layer::SetUp(bottom, top);

kernel_size_ = this->layer_param_.convolution_param().kernel_size();

stride_ = this->layer_param_.convolution_param().stride();

pad_ = this->layer_param_.convolution_param().pad();

ConvolutionParameter conv_param = this->layer_param_.convolution_param();

kernel_height_ = conv_param.kernel_size(0);

should probably CHECK_GT(conv_param.kernel_size_size(), 0) first. Also CHECK_GT(kernel_height_

kernel_width_, 0) after setting those -- we should have had a similar
check all along really (since kernel_size defaults to 0).

—
Reply to this email directly or view it on GitHub
https://github.com/BVLC/caffe/pull/505/files#r13839327.

Yangqing · 2014-06-19T01:16:48Z

Have fun in Beijing Jeff and Evan :)

shelhamer · 2014-06-19T04:43:03Z

Thanks Yangqing, and thanks for the advice about the fields.

I'll introduce kernel_size_y, kernel_size_x, etc. I've added tests too, but
I still have work to do because my GPU im2col seems to be wrong...

Le jeudi 19 juin 2014, Yangqing Jia notifications@github.com a écrit :

Have fun in Beijing Jeff and Evan :)

—
Reply to this email directly or view it on GitHub
#505 (comment).

shelhamer · 2014-07-02T03:00:13Z

Rebased to improve interface per #505 (comment).

However, this isn't done. See todo list in the PR message. I'll give this another shot after I finish some other work.

@buaaliyi you might try to debug the rectangular im2col test failure if you are still interested in this feature.

rmanor · 2014-07-02T20:49:38Z

Hi, I'd be happy to help if needed.
Thanks.

rmanor · 2014-07-10T20:47:05Z

Hey guys, I don't want to push, but are there any plans for this? Just to know if I should wait or not.
Thanks!

shelhamer · 2014-07-10T21:08:21Z

This has stalled for now. If you debug the GPU mode im2col please
contribute the fix!

Le jeudi 10 juillet 2014, rmanor notifications@github.com a écrit :

Hey guys, I don't want to push, but are there any plans for this? Just to
know if I should wait or not.
Thanks!

—
Reply to this email directly or view it on GitHub
#505 (comment).

rmanor · 2014-07-12T13:24:12Z

@shelhamer I spent a few hours on this and still haven't figured it out. Still trying...
Quick question though, was there a good reason to change the order of iteration between the cpu and gpu implementations of im2col?
Also, how is it possible that the stride value doesn't change the values in the test?
I don't see it taken into consideration when comparing the output to the input and the test succeeds if I change the stride values.
Shouldn't the values be different since the input index progresses in jumps of [stride]?

ejaz-izy · 2014-07-27T22:13:20Z

Hello everyone,

I was looking to implement this functionality myself and came across this thread. Thanks @shelhamer for starting this. I found your implementation very helpful. As you mentioned it was failing few tests. I found some minor issues with your implementation which was causing the problem. They are listed below -

In function col2im_gpu_kernel (file - im2col.cu) -
// int offset = (c * patch_h * patch_w + h * patch_h + w) * height_col * width_col;
should be changed to
int offset = (c * patch_h * patch_w + h * patch_w + w) * height_col * width_col;
Explanation - let us assume for simplicity that c = 0 and stride_w = stride_h = 1; then to find where h,w is found in data_col for location hcol, wcol we need to do the following -
data_col + hcol_width_out + wcol + ((h-hcol)_patch_w+(w-wcol))*height_out * width_out
If we expand it and rearrange we will find that it should be patch_w instead of patch_h
In function im2col_cpu (file im2col.cpp)
// int w_offset = c % patch_h;
// int h_offset = (c / patch_h) % patch_h;
should be changed to
int w_offset = c % patch_w;
int h_offset = (c / patch_w) % patch_h;
In function col2im_cpu (file im2col.cpp)
// int w_offset = c % patch_h;
// int h_offset = (c / patch_h) % patch_h;
should be changed to
int w_offset = c % patch_w;
int h_offset = (c / patch_w) % patch_h;
TestRectCPU (file - test_im2col_layer.cpp)
// EXPECT_EQ(this->blob_top_->data_at(0, c, 0, 0),
// this->blob_bottom_->data_at(0, (c / 15), (c / 5) % 5, c % 5));
should be changed to
EXPECT_EQ(this->blob_top_->data_at(0, c, 0, 0),
this->blob_bottom_->data_at(0, (c / 15), (c / 3) % 5, c % 3));
TestRectGPU (file test_im2col_layer.cpp)
// EXPECT_EQ(this->blob_top_->data_at(0, c, 0, 0),
// this->blob_bottom_->data_at(0, (c / 15), (c / 5) % 5, c % 5));
should be changed to
EXPECT_EQ(this->blob_top_->data_at(0, c, 0, 0),
this->blob_bottom_->data_at(0, (c / 15), (c / 3) % 5, c % 3));
Explanation for 2,3,4 and 5 is same, for w_offset mod with patch_w should be taken and the rest follows accordingly.

With the above changes I ran the tests and it passed all the tests!

Sorry for writing a long response and thanks again for opening this issue.

shelhamer · 2014-07-29T05:45:24Z

@ejaz-izy thank you for taking a look and figuring out the issues! I will incorporate your fixes and finish up this PR with a separable convolution test that convolves with rectangular filters. I'll be sure to credit you in the commit message once this is done and rebased for merge.

I can confirm your fixes are correct -- I'm happy many-eyes came to the rescue here since I couldn't spot my mistake.

while keeping everything working as-is.

@ejaz-izy

Thanks to @ejaz-izy's debugging in BVLC#505 (comment)

Compute the G_x kernel of the Sobel operator as a full filter and as separable filters to check the rectangular filter output.

Non-square Filters and Separated Stride and Padding

rmanor · 2014-07-29T19:42:26Z

Thanks @ejaz-izy!

Ran

On Sun, Jul 27, 2014 at 3:13 PM, ejaz-izy notifications@github.com wrote:

Hello everyone,

I was looking to implement this functionality myself and came across this
thread. Thanks @shelhamer https://github.com/shelhamer for starting
this. I found your implementation very helpful. As you mentioned it was
failing few tests. I found some minor issues with your implementation which
was causing the problem. They are listed below -

In function col2im_gpu_kernel (file - im2col.cu) - // int offset =
(c * patch_h * patch_w + h * patch_h + w) * height_col * width_col; should
be changed to int offset = (c * patch_h * patch_w + h * patch_w + w) *
height_col * width_col;

Explanation - let us assume for simplicity that c = 0 and stride_w =
stride_h = 1; then to find where h,w is found in data_col for location
hcol, wcol we need to do the following -
data_col + hcol_width_out + wcol + ((h-hcol)_patch_w+(w-wcol))*height_out

width_out
If we expand it and rearrange we will find that it should be patch_w
instead of patch_h

In function im2col_cpu (file im2col.cpp)
// int w_offset = c % patch_h;
// int h_offset = (c / patch_h) % patch_h;
should be changed to
int w_offset = c % patch_w;
int h_offset = (c / patch_w) % patch_h;
2.

In function col2im_cpu (file im2col.cpp)
// int w_offset = c % patch_h;
// int h_offset = (c / patch_h) % patch_h;
should be changed to
int w_offset = c % patch_w;
int h_offset = (c / patch_w) % patch_h;
3.

TestRectCPU (file - test_im2col_layer.cpp)
// EXPECT_EQ(this->blob_top_->data_at(0, c, 0, 0),
// this->blob_bottom_->data_at(0, (c / 15), (c / 5) % 5, c % 5));
should be changed to
EXPECT_EQ(this->blob_top_->data_at(0, c, 0, 0),
this->blob_bottom_->data_at(0, (c / 15), (c / 3) % 5, c % 3));
4.

TestRectGPU (file test_im2col_layer.cpp)
// EXPECT_EQ(this->blob_top_->data_at(0, c, 0, 0),
// this->blob_bottom_->data_at(0, (c / 15), (c / 5) % 5, c % 5));
should be changed to
EXPECT_EQ(this->blob_top_->data_at(0, c, 0, 0),
this->blob_bottom_->data_at(0, (c / 15), (c / 3) % 5, c % 3));

Explanation for 2,3,4 and 5 is same, for w_offset mod with patch_w should
be taken and the rest follows accordingly.

With the above changes I ran the tests and it passed all the tests!

Sorry for writing a long response and thanks again for opening this issue.

—
Reply to this email directly or view it on GitHub
#505 (comment).

@ejaz-izy

Thanks to @ejaz-izy's debugging in BVLC#505 (comment)

Non-square Filters and Separated Stride and Padding

@ejaz-izy

Thanks to @ejaz-izy's debugging in BVLC#505 (comment)

Non-square Filters and Separated Stride and Padding

@ejaz-izy

Thanks to @ejaz-izy's debugging in BVLC/caffe#505 (comment)

shelhamer added enhancement labels Jun 16, 2014

shelhamer mentioned this pull request Jun 16, 2014

How to support rectangular filter? By introduce kernelSizeX and kernelSizeY? #490

Closed

jeffdonahue reviewed Jun 17, 2014
View reviewed changes

kloudkl mentioned this pull request Jun 30, 2014

Allow images of different sizes as inputs #557

Closed

ronghanghu mentioned this pull request Jul 4, 2014

Rectangular pooling #614

Merged

ronghanghu referenced this pull request in ronghanghu/caffe Jul 5, 2014

add tests for rectangular pooling regions

f74979e

shelhamer mentioned this pull request Jul 11, 2014

Non-square convolutional kernels #672

Closed

add h/w kernel size, stride, and pad for non-square filtering

edf438a

while keeping everything working as-is.

shelhamer added 2 commits July 29, 2014 01:20

im2col + convolve non-square filters, padding, and stride

4d44fe7

test rectangular im2col

909e4d8

shelhamer added 2 commits July 29, 2014 01:20

fix GPU indexing

ca4482c

Thanks to @ejaz-izy's debugging in BVLC#505 (comment)

test non-square filters by separable convolution of Sobel operator

dadeb99

Compute the G_x kernel of the Sobel operator as a full filter and as separable filters to check the rectangular filter output.

shelhamer removed the work in progress label Jul 29, 2014

shelhamer added a commit that referenced this pull request Jul 29, 2014

Merge pull request #505 from shelhamer/non-square-filters

5542cf7

Non-square Filters and Separated Stride and Padding

shelhamer merged commit 5542cf7 into BVLC:dev Jul 29, 2014

shelhamer deleted the non-square-filters branch July 29, 2014 08:37

shelhamer mentioned this pull request Aug 7, 2014

Next: 0.9999 #880

Merged

shelhamer mentioned this pull request Aug 20, 2014

boost::python vs. cython and Python interface preprocessing profiling and improvement #941

Closed

mitmul pushed a commit to mitmul/caffe that referenced this pull request Sep 30, 2014

fix GPU indexing

e2ab94b

Thanks to @ejaz-izy's debugging in BVLC#505 (comment)

mitmul pushed a commit to mitmul/caffe that referenced this pull request Sep 30, 2014

Merge pull request BVLC#505 from shelhamer/non-square-filters

365d775

Non-square Filters and Separated Stride and Padding

RazvanRanca pushed a commit to RazvanRanca/caffe that referenced this pull request Nov 4, 2014

fix GPU indexing

782b605

Thanks to @ejaz-izy's debugging in BVLC#505 (comment)

RazvanRanca pushed a commit to RazvanRanca/caffe that referenced this pull request Nov 4, 2014

Merge pull request BVLC#505 from shelhamer/non-square-filters

ca6ec14

Non-square Filters and Separated Stride and Padding

aiworld pushed a commit to aiworld/aiworld.github.com that referenced this pull request Feb 28, 2015

fix GPU indexing

ca2d336

Thanks to @ejaz-izy's debugging in BVLC/caffe#505 (comment)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Non-square Filters and Separated Stride and Padding #505

Non-square Filters and Separated Stride and Padding #505

shelhamer commented Jun 16, 2014

jeffdonahue Jun 17, 2014

Yangqing Jun 19, 2014

shelhamer Jun 19, 2014

Yangqing commented Jun 19, 2014

shelhamer commented Jun 19, 2014

shelhamer commented Jul 2, 2014

rmanor commented Jul 2, 2014

rmanor commented Jul 10, 2014

shelhamer commented Jul 10, 2014

rmanor commented Jul 12, 2014

ejaz-izy commented Jul 27, 2014

shelhamer commented Jul 29, 2014

rmanor commented Jul 29, 2014

Non-square Filters and Separated Stride and Padding #505

Non-square Filters and Separated Stride and Padding #505

Conversation

shelhamer commented Jun 16, 2014

jeffdonahue Jun 17, 2014

Choose a reason for hiding this comment

Yangqing Jun 19, 2014

Choose a reason for hiding this comment

shelhamer Jun 19, 2014

Choose a reason for hiding this comment

Yangqing commented Jun 19, 2014

shelhamer commented Jun 19, 2014

shelhamer commented Jul 2, 2014

rmanor commented Jul 2, 2014

rmanor commented Jul 10, 2014

shelhamer commented Jul 10, 2014

rmanor commented Jul 12, 2014

ejaz-izy commented Jul 27, 2014

shelhamer commented Jul 29, 2014

rmanor commented Jul 29, 2014