Within-channel LRN layer #273

jeffdonahue · 2014-03-29T19:05:14Z

(Not sure how I accidentally submitted this without any description.)

This PR implements within-channel local ~~contrast~~ response normalization across a square neighborhood of each input channel, a la cuda-convnet's rnorm layer [1]. This layer is used in many of the cuda-convnet CIFAR example architectures, including our current cifar_full example that was based on the layers-18pct example in cuda-convnet, so I've updated that example to use this new layer type here. It doesn't make much of a difference -- running the full training gets to (exactly) 82% accuracy, as opposed to 81.65% the old normalization across channels was getting. It is also unfortunately slightly slower, taking 6 minutes and 57 seconds for 5000 iterations (compare to 6 minutes 43 seconds for the cross channel normalization), but I think this might make sense as we're summing over N^2 input pixels for each output, instead of N. I think it's nice to be able to reproduce these network architectures exactly though, even if it doesn't make much of a difference which type you use in practice.

Because I'm not smart enough to write something along the lines of the code for the current cross-channel LRN layer [2], I basically implemented this under the hood as a sequence of 5 other layer types, including 2 new ones: the EltwiseProductLayer, which computes outputs z = x .* y (excuse the MATLAB notation) on >=2 input blobs and PowerLayer (open to suggestions on a better name..), a neuron which computes z = (alpha + beta * x) ^ gamma for fixed values of those parameters. This implementation has a small memory penalty as it uses a few internal blobs for storing the intermediate results of each layer's computation. If somebody wanted to rewrite this later without using any "helper layers" to make it more memory efficient, they could do that.

[1] https://code.google.com/p/cuda-convnet/wiki/LayerParams#Local_response_normalization_layer_(same_map)

[2] https://github.com/BVLC/caffe/blob/master/src/caffe/layers/lrn_layer.cpp

shelhamer · 2014-03-29T19:39:18Z

Agreed that exact replication's in the spirit of having reference architectures and examples. I don't quite understand why you introduced the layers instead of doing blob operations, but perhaps the helper layers could be otherwise useful, and like you said they do no harm.

The direct, efficient way can be Caffe dev practice for the future.

(Note I haven't fully reviewed this–someone else should take a look and merge.)

jeffdonahue · 2014-03-29T19:47:21Z

My main motivation was to avoid rewriting code to sum over regions (for which the implementation, to me at least, looks pretty hairy). This is handled by an (average) PoolingLayer instead.

kloudkl · 2014-03-29T20:23:10Z

The acronym for local contrast normalization is perhaps LCN.

kloudkl · 2014-03-29T20:27:34Z

include/caffe/vision_layers.hpp

+  vector<Blob<Dtype>*> power_top_vec_;
+  shared_ptr<EltwiseProductLayer<Dtype> > product_layer_;
+  Blob<Dtype> product_data_input_;
+  vector<Blob<Dtype>*> product_bottom_vec_;


It seems that both PowerLayer and EltwiseProductLayer are suitable to be refactored in #244.

shelhamer · 2014-03-29T20:27:50Z

This is distinct from LCN. This is a within-channel scoped response
normalization. LCN normalizes with variance instead if I remember right.

Le samedi 29 mars 2014, kloudkl notifications@github.com a écrit :

The acronym for local contrast normalization is perhaps LCN.

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/273#issuecomment-39007771
.

jeffdonahue · 2014-03-29T21:55:46Z

yup my bad, it is indeed not contrast normalization (e.g. https://code.google.com/p/cuda-convnet/wiki/LayerParams#Local_contrast_normalization_layer)

jeffdonahue · 2014-04-08T03:54:41Z

Let me know if someone wants to merge this*. If/when that's going to happen, I'll first change the added IDs in the LayerType enum (and the PowerParameter ID) to the actual next available values (currently using placeholder values of 1000+ to ease rebasing).

*if not, feel free to close the PR -- not going to be offended if people don't care about having a within-channel LRN in Caffe.

shelhamer · 2014-04-08T04:19:50Z

I'm all for including this to polish the replication, but I don't see myself reviewing this soon.

How about you set the field IDs, take a last glance at the diff with dev, and merge it yourself?

architecture) instead of LRN

layers-18pct) -- slightly slower (5000 iters now takes 6:57; took 6:43 previously), but slightly more accurate (exactly 82% test accuracy; got 81.65% before)

Within-channel LRN layer

jeffdonahue · 2014-04-08T18:53:04Z

Thanks for giving the go-ahead Evan - done.

Within-channel LRN layer

jeffdonahue changed the title ~~Lrn map layer~~ Within-channel LRN layer Mar 29, 2014

kloudkl reviewed Mar 29, 2014
View reviewed changes

kloudkl mentioned this pull request Mar 29, 2014

Fix kernel for loop in macro #275

Merged

jeffdonahue added 18 commits April 8, 2014 11:36

add LRN within map layer and dependencies (eltwise product and power)

42c9d66

add unit tests for new layer types

f3e2fe6

use average pool instead of conv

6e92f47

add padding for average pooling

bd756fe

bug fix: average pooling already divides by N^2

d047bef

use split layer in LRNMapLayer

6c844cc

use bvlc copyright

2031663

add cifar example using LRN_MAP (just like the cuda-convnet layers-18pct

42b832c

architecture) instead of LRN

fix some param bugs

4c81ee5

merge LRNMapLayer into LRNLayer with norm_region proto field

824c344

replace old cifar full with within channel LRN (per cuda-convnet

24f7318

layers-18pct) -- slightly slower (5000 iters now takes 6:57; took 6:43 previously), but slightly more accurate (exactly 82% test accuracy; got 81.65% before)

minor polishing

69ac4f4

remove unnecessary local variables from EltwiseProductLayer

d61adb8

don't recompute pre_pad

9fb7818

cleanup extra LRN method names

45f8626

minor unit test cleanup

86db2a9

cleanup power layer test suite

bf511f7

update proto field IDs from placeholder values

404f22d

jeffdonahue added a commit that referenced this pull request Apr 8, 2014

Merge pull request #273 from jeffdonahue/lrn-map-layer

58998df

Within-channel LRN layer

jeffdonahue merged commit 58998df into BVLC:dev Apr 8, 2014

jeffdonahue deleted the lrn-map-layer branch April 8, 2014 18:54

shelhamer mentioned this pull request May 20, 2014

Next: 0.999 #429

Merged

mitmul pushed a commit to mitmul/caffe that referenced this pull request Sep 30, 2014

Merge pull request BVLC#273 from jeffdonahue/lrn-map-layer

cc391c3

Within-channel LRN layer

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Within-channel LRN layer #273

Within-channel LRN layer #273

jeffdonahue commented Mar 29, 2014

shelhamer commented Mar 29, 2014

jeffdonahue commented Mar 29, 2014

kloudkl commented Mar 29, 2014

kloudkl Mar 29, 2014

shelhamer commented Mar 29, 2014

jeffdonahue commented Mar 29, 2014

jeffdonahue commented Apr 8, 2014

shelhamer commented Apr 8, 2014

jeffdonahue commented Apr 8, 2014

Within-channel LRN layer #273

Within-channel LRN layer #273

Conversation

jeffdonahue commented Mar 29, 2014

shelhamer commented Mar 29, 2014

jeffdonahue commented Mar 29, 2014

kloudkl commented Mar 29, 2014

kloudkl Mar 29, 2014

Choose a reason for hiding this comment

shelhamer commented Mar 29, 2014

jeffdonahue commented Mar 29, 2014

jeffdonahue commented Apr 8, 2014

shelhamer commented Apr 8, 2014

jeffdonahue commented Apr 8, 2014