Refactor LayerParameter into per-layer Parameter messages #219

jeffdonahue · 2014-03-17T14:01:04Z

Addresses #208. This changes the way net protos are specified. Before:

layers {
  layer {
    name: "conv1"
    type: "conv"
    num_output: 96
    kernelsize: 11
    stride: 4
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0.
    }
    blobs_lr: 1.
    blobs_lr: 2.
    weight_decay: 1.
    weight_decay: 0.
  }
  bottom: "data"
  top: "conv1"
}

After:

layers {
  name: "conv1"
  type: CONVOLUTION
  bottom: "data"
  top: "conv1"
  blobs_lr: 1
  blobs_lr: 2
  weight_decay: 1
  weight_decay: 0
  convolution_param {
    num_output: 96
    kernel_size: 11
    stride: 4
    weight_filler {
      type: "gaussian"
      std: 0.01
    }
    bias_filler {
      type: "constant"
      value: 0
    }
  }
}

The most notable change is that we have a message, e.g., convolution_param. for each layer type that has its own parameters. I also turned the layer types from a string into an enum to catch spelling errors etc. at compile time.

This should be fully backward compatible with old model protos - the Net::Net(const string& param_file) constructor will attempt to load the param_file using the new proto specification, but if that fails will instead try to load it with the old ("v0") proto specification and then convert it to the new format at runtime, printing a warning that you should convert your nets to the new format.

There is also a script tools/upgrade_net_proto.cpp which converts the a v0-formatted proto to the new format and saves the new format to a file.

Both the converter script and the Net constructor will first pass the v0 formatted proto through a function UpgradeV0Padding that turns padding layers into pad-aware conv layers (assuming they are passed into conv layers; errors if not), and then another function that upgrades to the new proto format.

shelhamer · 2014-03-17T18:00:39Z

This is awesome! I like that 0def06d kicks it off with style.

This also addresses #170.

@sguada could you try the upgrade tool on some of your models, just to double-check?

sguada · 2014-03-17T18:29:36Z

@jeffdonahue great job!

Although so far in a clean directory it doesn't compile,
src/caffe/layers/hdf5_data_layer.cpp:80:23: error: ‘batchsize’ was not declared in this scope

I guess due to the changes now other layers that assumed that some params were available in LayerParams now they are not.

I will look into the code with more care, once I can compile it.

jeffdonahue · 2014-03-17T18:40:11Z

Sorry about that, forgot to commit the last set of files I had to change after rebasing to get this to compile. It should compile with the commit I just pushed.

Other than @sguada and anyone else making sure this works with their existing models, I still want to try a couple things out before we consider merging this - e.g. make sure finetuning an old model still works.

sguada · 2014-03-17T18:57:55Z

Great, @jeffdonahue now code compiles and pass all test but the one about hdf5_data_layer

[  FAILED  ] 2 tests, listed below:
[  FAILED  ] HDF5DataLayerTest/0.TestRead, where TypeParam = float
[  FAILED  ] HDF5DataLayerTest/1.TestRead, where TypeParam = double

@shelhamer I think in with this case and #209 we should merge them first in a different branch, and make sure everything works well before merging into dev.

jeffdonahue · 2014-03-17T19:10:42Z

Whoops, HDF5 test failure was a merge conflict problem I think. Passes with latest commit.

shelhamer · 2014-03-17T19:20:34Z

@sguada I agree it is important to test how major PRs combine. You can check out a PR as a branch for testing with hub. You can then do merges and such with other branches to see how they integrate.

Once we have tried the joint merges ourselves I think we can merge to dev and continue to test before the release to master.

sguada · 2014-03-17T19:28:47Z

Totally agree, @shelhamer I using hub as you showed me before to do that.

What I meant is that we usually test PR in isolation, and that is fine, but for these 2 big ones, it will affect any other PR on the way. So we can either merge the other PR before and then adjust #219 and #209 or we will have to adjust all other pending PRs to the new formats.

shelhamer · 2014-03-17T23:21:15Z

I vote for these core changes #219 and #209 going in first and other PRs
rebasing on them. Individually they should have only a little work to do.

@jeffdonahue, that should be less work for you so we can do our testing
then merge this.

Le lundi 17 mars 2014, Sergio Guadarrama notifications@github.com a
écrit :

Totally agree, @shelhamer https://github.com/shelhamer I using hub as
you showed me before to do that.

What I meant is that we usually test PR in isolation, and that is fine,
but for these 2 big ones, it will affect any other PR on the way. So we can
either merge the other PR before and then adjust #219 https://github.com/BVLC/caffe/pull/219and
#209 #209 or we will have to adjust
all other pending PRs to the new formats.

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/219#issuecomment-37859411
.

jeffdonahue · 2014-03-18T00:50:38Z

Works for me (obviously) but am also happy to rebase and tweak on any other PRs you want to merge before.

shelhamer · 2014-03-18T06:35:20Z

The back-merge of historical PRs to master into dev caused some conflicts (most likely in caffe.proto I'd imagine).

jeffdonahue · 2014-03-19T19:09:25Z

Understand if we're still waiting to merge but FYI this and #209 (forward pass loss) are now rebased on dev.

kloudkl · 2014-03-20T01:36:26Z

src/caffe/layer_factory.cpp

-  } else {
-    LOG(FATAL) << "Unknown layer name: " << type;
+  case LayerParameter_LayerType_NONE:
+    LOG(FATAL) << "Layer " << name << " has unspecified type.";


should be fine, LOG(FATAL) causes a crash.

sergeyk · 2014-03-20T03:12:45Z

Looks great, I think we should go ahead and merge.
One thing: we should have a documentation page explaining the layer parameters.
Jeff, could you contribute the first such document, for convolution layers?
I'd put it in docs/layers.md; all layers will be briefly documented there.

shelhamer · 2014-03-20T03:17:56Z

#245's window data layer params will need to be added and the padding layer should be dropped, as discussed offline (Jeff's auto-conversion makes the deprecation unnecessary).

sguada · 2014-03-21T18:04:12Z

@jeffdonahue please and name to blobs params

jeffdonahue · 2014-03-21T18:33:00Z

@sguada huh? I will rebase this, add window data layer, do a bit of testing (beyond the existing unit tests), and merge - hopefully all today.

sguada · 2014-03-22T15:35:54Z

Sorry Jeff, I meant to say that Blobs could have name, as a parameter. Which is defined by the layer. It will make things a bit easier to manage them.

message BlobProto {
optional int32 num = 1 [default = 0];
optional int32 channels = 2 [default = 0];
optional int32 height = 3 [default = 0];
optional int32 width = 4 [default = 0];
repeated float data = 5 [packed=true];
repeated float diff = 6 [packed=true];
optional string name = 7 [default =""];
}

sguada · 2014-03-22T15:37:10Z

Does the upgrade_net_proto also upgrade the prototxt files? Or only the network params in the proto file.

sguada · 2014-03-22T15:42:30Z

examples/feature_extraction/imagenet_val.prototxt

-    cropsize: 227
-  }
+  name: "data"
+  type: IMAGE_DATA


Here there are missing the IMAGE_DATA params

jeffdonahue · 2014-03-22T16:53:43Z

@sguada great catch on the missing IMAGE_DATA params - I just added support for that layer type and I guess I missed its parameters. I'll fix that and step through every other layer type (again) to make sure I didn't miss anything else.

Aside from that this actually needs a bit more work - I need to go through all the tools/*.cpp to make them all be backwards compatible with V0 net param files by calling the Net constructor which takes a filename string. (So far I only changed train_net's entry point - from grepping ReadProtoFromTextFile and ReadProtoFromBinaryFile there are several more.)

Does the upgrade_net_proto also upgrade the prototxt files? Or only the network params in the proto file.

Not sure what you mean by this - the changes to examples/feature_extraction/imagenet_val.prototxt and other prototxts changed show what the upgrade_net_proto script does (they were all done automatically just by using upgrade_net_proto, except the script doesn't retain comments so I manually re-added a couple of those).

sguada · 2014-03-22T18:05:49Z

@jeffdonahue, I think we should provide the script to transform the prototxt and snapshots but I don't think we should change all tools/*.cpp be aware of the changes. A net should be able to be constructed given a Network params, although you can add the option to read it from a file.

Yes I meant if changes to imagent_val.prototxt and others were automatic or manual.

shelhamer · 2014-03-22T19:15:56Z

@sguada I'm confused why you don't want to upgrade the tools. @jeffdonahue's new ReadProto* methods transparently load the old or new schema as needed and convert. One can still construct a net from a NetParameter otherwise loaded however you'd like.

I agree with Jeff's plan in #219 (comment) for finishing this.

sguada · 2014-03-22T20:38:07Z

Hi Jeff,
I was thinking that conversion is done outside of the network so it doesn't
need to know about it.
So there are two options. Choose the one you think will be easier to
maintain in the future.

On Saturday, March 22, 2014, Evan Shelhamer notifications@github.com
wrote:

@sguada https://github.com/sguada I'm confused why you don't want to
upgrade the tools. @jeffdonahue https://github.com/jeffdonahue's new
ReadProto* methods transparently load the old or new schema as needed and
convert. One can still construct a net from a NetParameter otherwise loaded
however you'd like.

I agree with Jeff's plan in #219 (comment)#219 (comment) finishing this.

Reply to this email directly or view it on GitHubhttps://github.com//pull/219#issuecomment-38361258
.

Sergio

IMAGE_DATA params

models)

V0NetParameter

a different route

incorporate into util/upgrade_proto)

(which retains an optional V0LayerParameter field for legacy support) and LayerConnection renamed to LayerParameter

layers

shelhamer · 2014-03-28T07:24:55Z

Almost perfect–but there's some chaff to clean up:

1322fa3 adds symlinks to caffe model params and solver state and `examples/imagenet/finetune*.sh scripts that I think are accidental. Please rewrite that commit so that these aren't versioned.

shelhamer · 2014-03-28T07:25:09Z

I've read over everything and tests pass so I'll merge as soon as those files are dropped.

Sweet work Jeff!

jeffdonahue · 2014-03-28T07:41:21Z

Whoops, great catch - got lazy and git added that whole directory and (apparently) didn't inspect results closely. History rewritten to exclude those files. Thanks for reviewing and all your help along the way Evan!

Refactor LayerParameter into per-layer Parameter messages and add tools for seamless proto upgrade.

shelhamer · 2014-03-28T07:49:52Z

Lovely layers.

Refactor LayerParameter into per-layer Parameter messages and add tools for seamless proto upgrade.

shelhamer added the work in progress label Mar 17, 2014

shelhamer mentioned this pull request Mar 17, 2014

Converting to and from cuda-convnet #212

Closed

shelhamer mentioned this pull request Mar 17, 2014

Write script to remove padding layers #170

Closed

shelhamer mentioned this pull request Mar 18, 2014

Bring back padding layer to ease release upgrade #227

Merged

jeffdonahue mentioned this pull request Mar 19, 2014

Separate the reusable data processors from the data layers #244

Closed

kloudkl reviewed Mar 20, 2014
View reviewed changes

This was referenced Mar 22, 2014

Reshape layer #108

Closed

Split monolithic LayerParameter into per-layer Protobuf Messages #208

Closed

sguada reviewed Mar 22, 2014
View reviewed changes

jeffdonahue added 20 commits March 27, 2014 23:42

update deprecated protos to latest dev versions

4667f0e

incorporate WindowDataLayer into V0Upgrade and add tests

335697f

fix upgrade_net_proto name

76e22ce

upgrade_net_proto: allow input files already in new proto format

1b30e0a

upgrade remaining prototxts

fafa505

upgrade images layer

c1fa11d

make all tools backwards compatible with v0 net param

17a59c3

regenerate imagenet_val feature extraction prototxt with missing

85b9e82

IMAGE_DATA params

allow upgrade_net_proto to also read/write binary protos (e.g. saved

927642e

models)

add NetParameter required version number as breaking change for

23bfeeb

V0NetParameter

rollback previous commit adding version number to NetParameter -- going

8198585

a different route

some post rebase fixes -- copyright, hdf5_output layer (still need to

b7444d6

incorporate into util/upgrade_proto)

cleaner version of refactoring with fields added to LayerConnection

57167a0

(which retains an optional V0LayerParameter field for legacy support) and LayerConnection renamed to LayerParameter

add support for hdf5 output layer

1d27409

minor cleanup

12d88d5

fix upgrade_net_proto names

56ce562

update docs (and a couple comments) for refactored layerparam

e20c43a

add NetParameterPrettyPrint so that upgrade tool prints inputs before

48e2e8e

layers

move ReadNetParamsFrom{Text,Binary}File into util

9f92159

rename test_innerproduct_layer to test_inner_product_layer

d821caf

shelhamer added a commit that referenced this pull request Mar 28, 2014

Merge pull request #219 from jeffdonahue/refactor-layerparam-proto

06bef17

Refactor LayerParameter into per-layer Parameter messages and add tools for seamless proto upgrade.

shelhamer merged commit 06bef17 into BVLC:dev Mar 28, 2014

jeffdonahue deleted the refactor-layerparam-proto branch March 28, 2014 08:01

shelhamer mentioned this pull request Mar 28, 2014

Improve Net & Layer Schema #169

Closed

3 tasks

shelhamer mentioned this pull request May 20, 2014

Next: 0.999 #429

Merged

mitmul pushed a commit to mitmul/caffe that referenced this pull request Sep 30, 2014

Merge pull request BVLC#219 from jeffdonahue/refactor-layerparam-proto

a8139ca

Refactor LayerParameter into per-layer Parameter messages and add tools for seamless proto upgrade.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor LayerParameter into per-layer Parameter messages #219

Refactor LayerParameter into per-layer Parameter messages #219

jeffdonahue commented Mar 17, 2014

shelhamer commented Mar 17, 2014

sguada commented Mar 17, 2014

jeffdonahue commented Mar 17, 2014

sguada commented Mar 17, 2014

jeffdonahue commented Mar 17, 2014

shelhamer commented Mar 17, 2014

sguada commented Mar 17, 2014

shelhamer commented Mar 17, 2014

jeffdonahue commented Mar 18, 2014

shelhamer commented Mar 18, 2014

jeffdonahue commented Mar 19, 2014

kloudkl Mar 20, 2014

jeffdonahue Mar 20, 2014

sergeyk commented Mar 20, 2014

shelhamer commented Mar 20, 2014

sguada commented Mar 21, 2014

jeffdonahue commented Mar 21, 2014

sguada commented Mar 22, 2014

sguada commented Mar 22, 2014

sguada Mar 22, 2014

jeffdonahue commented Mar 22, 2014

sguada commented Mar 22, 2014

shelhamer commented Mar 22, 2014

sguada commented Mar 22, 2014

shelhamer commented Mar 28, 2014

shelhamer commented Mar 28, 2014

jeffdonahue commented Mar 28, 2014

shelhamer commented Mar 28, 2014

Refactor LayerParameter into per-layer Parameter messages #219

Refactor LayerParameter into per-layer Parameter messages #219

Conversation

jeffdonahue commented Mar 17, 2014

shelhamer commented Mar 17, 2014

sguada commented Mar 17, 2014

jeffdonahue commented Mar 17, 2014

sguada commented Mar 17, 2014

jeffdonahue commented Mar 17, 2014

shelhamer commented Mar 17, 2014

sguada commented Mar 17, 2014

shelhamer commented Mar 17, 2014

jeffdonahue commented Mar 18, 2014

shelhamer commented Mar 18, 2014

jeffdonahue commented Mar 19, 2014

kloudkl Mar 20, 2014

Choose a reason for hiding this comment

jeffdonahue Mar 20, 2014

Choose a reason for hiding this comment

sergeyk commented Mar 20, 2014

shelhamer commented Mar 20, 2014

sguada commented Mar 21, 2014

jeffdonahue commented Mar 21, 2014

sguada commented Mar 22, 2014

sguada commented Mar 22, 2014

sguada Mar 22, 2014

Choose a reason for hiding this comment

jeffdonahue commented Mar 22, 2014

sguada commented Mar 22, 2014

shelhamer commented Mar 22, 2014

sguada commented Mar 22, 2014

shelhamer commented Mar 28, 2014

shelhamer commented Mar 28, 2014

jeffdonahue commented Mar 28, 2014

shelhamer commented Mar 28, 2014