Converting to and from cuda-convnet #212

jamt9000 · 2014-03-14T20:04:45Z

Unfinished, but creating PR to start discussion.

I plan on creating a tool to convert between cuda-convnet (pickled) files and caffe protobuffers.

At the moment it can (try to) convert the cuda-convnet "schema" to prototxt format (ie., ignoring the blobs). Next I will copy the blobs too and work on going the other way. It also doesn't do anything sensible about the input and loss layers yet.

If you have the supplied decaf net (trained with cuda-convnet) you can try:

print cudaconv_to_prototxt('./imagenet.decafnet.epoch90')

Can read the pickled cuda-convnet format and convert to prototxt format. Currently missing the top/bottom links and blob data.

Yangqing · 2014-03-14T23:24:07Z

Thanks @jamt9000 !

If it helps, the old decaf repo has some codes that tries to read and convert cuda-convnet formats. Of course decaf uses yet another network definition, but it may be helpful:

https://github.com/UCB-ICSI-Vision-Group/decaf-release/tree/master/decaf/util/translator

ie, the pickled dict of the model rather than just the pickled list of layers

To create a proper prototxt file model file that can be used with the python API. Add the data name and dimensions to the NetParameter fields.

shelhamer · 2014-03-17T20:05:41Z

Note #219. If you convert to the current caffe.proto the tool in #219 will update it to the new schema we're adopting, but you may want to wait and convert directly to the new one.

jamt9000 · 2014-03-18T10:56:46Z

Yeah, I'll switch to the new schema when that's all settled.

Also, I spent a day debugging before realising...the decaf net is upside down :| But now the conversion seems to work. Also, another nice surprise is that caffe imagenet wrapper net is BGR instead of RGB :)

jamt9000 · 2014-03-18T13:37:14Z

Here's an example to show it working

Convert the net:

from convert_net import *

netpt = cudaconv_to_prototxt('/home/james/Code/decaf-release/imagenet/imagenet.decafnet.epoch90')
fh=open('/tmp/decafnet.caffe.prototxt', 'w')
fh.write(netpt)
fh.close()

netpb = cudaconv_to_proto('/home/james/Code/decaf-release/imagenet/imagenet.decafnet.epoch90')
fh=open('/tmp/decafnet.caffe', 'wb')
fh.write(netpb.SerializeToString())
fh.close()

Compare predictions on lena with DeCAF:

import caffe
from decaf.scripts.imagenet import DecafNet
from pylab import imread

# prepare image
imname = '/home/james/Desktop/lena.jpg' # 256x256
im = imread(imname)[16:-16,16:-16,:]    # 224x224
im = im - 128. # subtract "mean"
flippedim = np.require(im[::-1][None,...], np.float32, 'C')
flippedim2 = np.rollaxis(flippedim,3,1).copy() # channels first for caffe

# Predict with DeCAF
dnet = DecafNet('/home/james/Code/decaf-release/imagenet/imagenet.decafnet.epoch90', '/home/james/Code/decaf-release/imagenet/imagenet.decafnet.meta')
decaf_prediction = dnet.classify_direct(flippedim).mean(0)

# Predict with CAFFE
caffenet = caffe.CaffeNet('/tmp/decafnet.caffe.prototxt', '/tmp/decafnet.caffe')
caffenet.set_phase_test()
caffenet.set_mode_cpu()
output_blobs = [np.empty((1, 1000, 1, 1), dtype=np.float32)]
caffenet.Forward([flippedim2], output_blobs)
caffe_prediction = output_blobs[0].mean(0).flatten()

# Use top_k function from DeCAF
decaf_labels, decaf_names = dnet.top_k_prediction(decaf_prediction, 5)
caffe_labels, caffe_names = dnet.top_k_prediction(caffe_prediction, 5)

print 'DeCAF', decaf_names
print 'probs', decaf_prediction[decaf_labels]
print 'CAFFE', caffe_names
print 'probs', caffe_prediction[caffe_labels]


# DeCAF ['sombrero', 'cowboy hat', 'hand blower', 'bonnet', 'shower cap']
# probs [ 0.33793405  0.26088405  0.06967235  0.05376853  0.02327657]
# CAFFE ['sombrero', 'cowboy hat', 'hand blower', 'bonnet', 'shower cap']
# probs [ 0.33793321  0.26088339  0.06967238  0.05376886  0.02327651]

Yangqing · 2014-03-19T00:57:19Z

Thanks @jamt9000 for working this out, and I apologize for the decaf upside-down thing - yeah, it is a legacy issue and we were too lazy to address it...

jamt9000 · 2014-03-20T11:27:39Z

I'll wait for the refactoring before working on writing to cuda-convnet format. But I'm wondering what to do about dropout, since it is left as an "exercise for the reader" in the official cuda-convnet and there are various different implementations around (eg dnouri's). Would BVLC be able to share the patch they use to allow for better comparison?

shelhamer · 2014-03-22T08:50:10Z

@jamt9000 the proto refactoring should be merged soon–thanks for your patience.

As for a dropout patch for cuda-convnet, there is no such patch from Caffe since it was developed as a separate project. @jeffdonahue is your JeffNet fork of cuda-convnet public?

Re: BGR, we have OpenCV's default to thank for that. Who would ever choose that?

bhack · 2014-11-27T00:34:14Z

This is the oldest PR in the queue. It is 7 months old. Why some PR like this seems abandoned?

The proposer is still available?
We are waiting for others contributors to come for finish this work?
We are waiting for response from core team? This PR is no more interesting?

Yangqing · 2014-11-27T00:53:38Z

Please understand that we are a small research team doing this, and will
prioritize stuff based on actual needs - at the current moment, my
impression is that supporting cuda-convnet is not a top priority. Caffe has
all the training capabilities so there is little reason for us to invest
time on this.

If anyone would like to finish writing a translator I am happy to include
it in an unsupported/ folder. But keep in mind that our further development
may well break it.

Yangqing

On Wed, Nov 26, 2014 at 4:34 PM, bhack notifications@github.com wrote:

This is the oldest PR in the queue. It is 7 months old. Why some PR like
this seems abandoned?

The proposer is still available?

We are waiting for others contributors to come for finish this work?

We are waiting for response from core team? This PR is no more
interesting?

Reply to this email directly or view it on GitHub
#212 (comment).

bhack · 2014-11-27T00:57:56Z

@Yangqing I know. My question was on this older because I want to understand if the process of handing the PR queue could be improved and to incentive "third party" contributors in the period that they have effectively time to invest on.

bhack · 2014-11-27T01:04:25Z

Probably we could cluster PR using a common tag semantics so its always clear what is the "stage/status" of an open PR.

ducha-aiki · 2014-11-27T10:00:51Z

@Yangqing, may be caffe-core-dev group consider the possibility of adding one active community-member as moderator with rights to merge? It happens in almost all communities after library or whatever becomes popular.

bhack · 2014-11-27T10:15:45Z

I don't know what is the best solution every community has found its own. The actual approach surely doesn't scale.

jamt9000 · 2014-11-27T12:37:38Z

I am very busy at the moment (just started a PhD) but should have time closer to Christmas and will surely be contributing to Caffe more in the future.

bhack · 2014-11-27T12:44:20Z

@jamt9000 never mind the life change. This is the motivation cause I suggest to maintain the time slot a little bit short on PR for not vast contributions and stay with unfinished code.

shelhamer · 2014-12-30T03:41:21Z

Closing as stale -- however model converters are welcome.

plutolove · 2015-08-31T10:30:12Z

Can you send ‘imagenet.decafnet.epoch90’ and ‘imagenet.decafnet.meta’ to me？
The download link is not available.

dupf · 2016-04-07T02:49:27Z

Can you send ‘imagenet.decafnet.epoch90’ and ‘imagenet.decafnet.meta’ to me？
The download link is not available.

qiongxiao · 2017-06-01T09:13:37Z

@jamt9000 do you have these two files of " imagenet.decafnet.epoch90" and "imagenet.decaf.meta"? they could not be downloaded from Yangqing Jia's homepage: http://daggerfs.com/index.html#publications.

shelhamer and others added 4 commits March 14, 2014 10:51

note hdf5 prerequisite in installation guide

d8e15ef

Started work on reading cuda-convnet files

3a540b1

Can read the pickled cuda-convnet format and convert to prototxt format. Currently missing the top/bottom links and blob data.

Include top/bottom fields reading cuda-convnet

a3cec5c

Scaffolding for CudaConvNetWriter

3745c05

jamt9000 added 5 commits March 15, 2014 19:59

Support reading from cuda-convnet snapshots

d2e3f9d

ie, the pickled dict of the model rather than just the pickled list of layers

Convert the cuda-convnet weights to blobs

6fdefea

Fix bias shape

4376b58

Do not create data and loss layers by default.

3e08c67

To create a proper prototxt file model file that can be used with the python API. Add the data name and dimensions to the NetParameter fields.

Correctly handle "in place" top/bottom links for neurons

1ae1b4d

jamt9000 added 2 commits March 18, 2014 14:13

Move definition of get_non_neuron_inputs outside of loop and rename

c0c47a5

No num_output needed for pool layer

c0c9e44

shelhamer added the work in progress label Mar 22, 2014

Yangqing mentioned this pull request Jul 25, 2014

Script to convert cude-convnet check-point to caffe protobuffer #780

Closed

shelhamer force-pushed the dev branch 3 times, most recently from 4278286 to c01f07a Compare August 28, 2014 07:00

shelhamer added in progress and removed work in progress labels Aug 29, 2014

shelhamer force-pushed the dev branch from 64258b6 to 403b56b Compare September 19, 2014 04:38

shelhamer removed the in progress label Sep 21, 2014

shelhamer force-pushed the dev branch from d8eb4df to 914da95 Compare October 8, 2014 16:36

sergeyk force-pushed the dev branch from 2fb4c97 to 1718903 Compare October 17, 2014 18:44

shelhamer added the compatibility label Dec 30, 2014

shelhamer closed this Dec 30, 2014

shelhamer added the sandbox label Dec 30, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Converting to and from cuda-convnet #212

Converting to and from cuda-convnet #212

jamt9000 commented Mar 14, 2014

Yangqing commented Mar 14, 2014

shelhamer commented Mar 17, 2014

jamt9000 commented Mar 18, 2014

jamt9000 commented Mar 18, 2014

Yangqing commented Mar 19, 2014

jamt9000 commented Mar 20, 2014

shelhamer commented Mar 22, 2014

bhack commented Nov 27, 2014

Yangqing commented Nov 27, 2014

bhack commented Nov 27, 2014

bhack commented Nov 27, 2014

ducha-aiki commented Nov 27, 2014

bhack commented Nov 27, 2014

jamt9000 commented Nov 27, 2014

bhack commented Nov 27, 2014

shelhamer commented Dec 30, 2014

plutolove commented Aug 31, 2015

dupf commented Apr 7, 2016

qiongxiao commented Jun 1, 2017

Converting to and from cuda-convnet #212

Converting to and from cuda-convnet #212

Conversation

jamt9000 commented Mar 14, 2014

Yangqing commented Mar 14, 2014

shelhamer commented Mar 17, 2014

jamt9000 commented Mar 18, 2014

jamt9000 commented Mar 18, 2014

Yangqing commented Mar 19, 2014

jamt9000 commented Mar 20, 2014

shelhamer commented Mar 22, 2014

bhack commented Nov 27, 2014

Yangqing commented Nov 27, 2014

bhack commented Nov 27, 2014

bhack commented Nov 27, 2014

ducha-aiki commented Nov 27, 2014

bhack commented Nov 27, 2014

jamt9000 commented Nov 27, 2014

bhack commented Nov 27, 2014

shelhamer commented Dec 30, 2014

plutolove commented Aug 31, 2015

dupf commented Apr 7, 2016

qiongxiao commented Jun 1, 2017