Scale Invariant CNN (SICNN) #576

kloudkl · 2014-07-01T15:41:53Z

The Spatial Pyramid Pooling net of #548 improves the speed of Regions with Convolutional Neural Network Features by extracting features for each image only once while R-CNN does so for each region of interest in an image. The most important insight of SPP-net is that only the classifiers or the fully-connected layers require fixed-length vector. The convolution layers do not have to constrain the sizes of the images. The experiments show that full image is better than cropped ones and larger scales lead to higher accuracy.

The SPP-net simulate the multiple scales with fixed-size networks. The "scale-mismatch" problem is not solved. In #308, multi-scale feature extraction is achieved by packing the multiple scales of a image in a single large image. They can only process pre-defined discrete scales.

The authentic scale invariant CNN means that the extracted features can be scaled up or down to get the features of the images undergoing the same scaling. The feature of an image only has to be extracted once by the network.

Any ideas about the existing works in this direction?

shelhamer · 2014-10-15T06:25:31Z

One can run a single net on a multi-scale pyramid by weight sharing or run on whatever scale is desired by on-the-fly net reshaping in #594. A single extraction for a deep feature invariant to all scaling is not possible due to filter discretization, nonlinearities, and so on (although one can down and upsample features as they please).

kloudkl · 2014-12-02T08:07:55Z

Angjoo Kanazawa, Abhishek Sharma, David Jacobs, Locally Scale-invariant Convolutional Neural Network, Deep Learning and Representation Learning Workshop: NIPS 2014.

akanazawa · 2015-02-18T14:43:17Z

Hi,
the code for the locally scale-invariant ConvNet paper is available here,
Thanks.

etienne87 · 2015-08-06T15:02:12Z

Must be great, but wouldn't that take much more time? i mean you need to transform every blob several time for max-pooling out right?

akanazawa · 2015-08-06T16:09:24Z

Yeah, it does take more memory and time. Now I recommend you checking out this recent arxiv paper http://arxiv.org/abs/1506.02025

shelhamer closed this as completed Oct 15, 2014

shelhamer added the question label Oct 15, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scale Invariant CNN (SICNN) #576

Scale Invariant CNN (SICNN) #576

kloudkl commented Jul 1, 2014

shelhamer commented Oct 15, 2014

kloudkl commented Dec 2, 2014

akanazawa commented Feb 18, 2015

etienne87 commented Aug 6, 2015

akanazawa commented Aug 6, 2015

Scale Invariant CNN (SICNN) #576

Scale Invariant CNN (SICNN) #576

Comments

kloudkl commented Jul 1, 2014

shelhamer commented Oct 15, 2014

kloudkl commented Dec 2, 2014

akanazawa commented Feb 18, 2015

etienne87 commented Aug 6, 2015

akanazawa commented Aug 6, 2015