Add WARPLossLayer and gradient check test cases #126

kloudkl · 2014-02-18T05:11:48Z

This pull request chose to implement the Weighted Approximate Rank Pairwise (WARP) loss layer in response to issue #88 "Implement ranking loss layer". WARP[1] was applied in a similar manner for image annotation[2] and is also useful in image retrieval using text queries.

The only open source implementation of WARP loss that I found is in the mrec recommender systems Python library from Mendeley. WARP was used there to optimize a matrix factorization model and a hybrid ranking model.

Obeying the contributing protocol designed by @shelhamer in #101, I opened this work in progress PR to welcome comments to correct defects as early as possible and to guide my further development.

References:
[1] Jason Weston, Samy Bengio, and Nicolas Usunier. Wsabie: Scaling up to large vocabulary image annotation. In IJCAI, 2011.
[2] Yunchao Gong, Yangqing Jia, Sergey Ioffe, Alexander Toshev, Thomas Leung. Deep Convolutional Ranking for Multilabel Image Annotation. arXiv:1312.4894 [cs.CV]

kloudkl · 2014-02-18T07:03:56Z

The base branch was not set to the newly created dev branch. The project maintainer can change it with the Pull Request API. Thanks!

curl --user ":owner" \
       --request POST \
       --data '{"issue": "126", "head": "kloudkl:warp_loss_layer", "base": "dev"}' \
       https://api.github.com/repos/BVLC/caffe/pulls/

shelhamer · 2014-02-18T07:35:42Z

As far as I know the API actually creates a new PR rather than updating the existing request. Since dev is a recent invention for Caffe this PR and other open PRs have amnesty, and we'll merge them to dev behind-the-scenes.

shelhamer · 2014-02-18T07:37:26Z

As to the actual content of this PR: ranking loss has been on my todo list as well, and WARP is a welcome choice of loss.

sergeyk · 2014-02-24T23:18:02Z

Thanks @kloudkl!
@shelhamer will review this after March 7.

kloudkl · 2014-02-25T08:19:22Z

I did not make progress since the opening of this PR because there is an assumption that was not satisfied, i.e. computing the loss relied on how multiple labels of a data point is store in the bottom label blob. Currently the DataLayer only supports single label input data. So the loss layer is blocked by the data layer. That's why I opened #144 asking for an external potential solution.

Now, the related works are concentrated in #149 thanks to @sergeyk. I will have discussions there to have the data layer implemented first and refine the WARP loss layer at the same time. Making changes to be coordinated with #149 is expected.

sergeyk · 2014-03-13T18:38:59Z

@kloudkl, if you need a multi-label input to make progress on this, the HDF5DataLayer provides both data and label as vectors. See test example.

kloudkl · 2014-03-15T13:16:46Z

@sergeyk, thanks for the HDF5DataLayer! I will give it a try. We also need tools to convert existing datasets into the HDF5 format.

jay2002 · 2014-03-23T06:58:22Z

@kloudkl have you tried to convert your dataset into HDF5 format?
If my dataset have 4 (0, 1, 2, 3) labels and one sample has 3 labels (0, 1, 2)
should the label vetcor be [1 1 1 0]? or [0 1 2]?

kloudkl · 2014-03-23T07:35:45Z

@jay2002, I am too busy at some other related issues #244, #250 and #251 at the moment. So I haven't converted a dataset yet.

#220 used the C++ interface of HDF5 which must be manually compiled on Ubuntu 12.04. Have a look at it to see it helps. I will implement #213 using the C interface very soon.

@sergeyk, as the creator of HDF5DataLayer, you must have a lot of first hand experiences in doing so. How did you handle such situations?

sergeyk · 2014-03-23T09:43:50Z

ALthough HDF5DataLayer can load matrices for label data, I haven't ever
used that functionality. The answer to @jay2002's question depends entirely
on the loss layer. -- and I'm not up to date on our multi-label loss
options, sorry.

One thing I can say is that each row of the label matrix must be the same
size, so your option 2 doesn't make sense.

On Sun, Mar 23, 2014 at 12:35 AM, kloudkl notifications@github.com wrote:

@jay2002 https://github.com/jay2002, I am too busy at some other
related issues #244 #244, #250 https://github.com/BVLC/caffe/pull/250and
#251 #251 at the moment. #213 https://github.com/BVLC/caffe/issues/213will be the next target. So I havn't got the time to do the conversion.

@sergeyk https://github.com/sergeyk, as the creator of HDF5DataLayer,
you must have a lot of first hand experiences in doing so. How did you
handle such situations?

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/126#issuecomment-38376170
.

kloudkl · 2014-03-23T11:13:48Z

HDF5OutputLayer has been implemented in #252. You can convert a dataset using a network with only a DataLayer and an HDF5OutputLayer.

jay2002 · 2014-03-24T06:12:09Z

I got it. Thx @kloudkl @sergeyk

kloudkl · 2014-03-25T00:27:28Z

Continued in #257.

Add WARPLossLayer and gradient check test cases

98dc029

kloudkl mentioned this pull request Feb 18, 2014

Establish Development and Contribution Guidelines #101

Closed

shelhamer added enhancement labels Feb 23, 2014

sergeyk assigned shelhamer Feb 24, 2014

shelhamer added the work-in-progress label Feb 25, 2014

This was referenced Feb 25, 2014

Multi label data layer #144

Closed

Implement MultiLabel losses and data input #149

Closed

kloudkl mentioned this pull request Mar 17, 2014

Util functions converting formats between HDF5 and Blob #220

Closed

kloudkl closed this Mar 25, 2014

kloudkl mentioned this pull request Mar 25, 2014

Implement WARPLossLayer #257

Closed

kloudkl mentioned this pull request Jul 3, 2014

Implement the triplet ranking hinge loss layer #603

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add WARPLossLayer and gradient check test cases #126

Add WARPLossLayer and gradient check test cases #126

kloudkl commented Feb 18, 2014

kloudkl commented Feb 18, 2014

shelhamer commented Feb 18, 2014

shelhamer commented Feb 18, 2014

sergeyk commented Feb 24, 2014

kloudkl commented Feb 25, 2014

sergeyk commented Mar 13, 2014

kloudkl commented Mar 15, 2014

jay2002 commented Mar 23, 2014

kloudkl commented Mar 23, 2014

sergeyk commented Mar 23, 2014

kloudkl commented Mar 23, 2014

jay2002 commented Mar 24, 2014

kloudkl commented Mar 25, 2014

Add WARPLossLayer and gradient check test cases #126

Add WARPLossLayer and gradient check test cases #126

Conversation

kloudkl commented Feb 18, 2014

kloudkl commented Feb 18, 2014

shelhamer commented Feb 18, 2014

shelhamer commented Feb 18, 2014

sergeyk commented Feb 24, 2014

kloudkl commented Feb 25, 2014

sergeyk commented Mar 13, 2014

kloudkl commented Mar 15, 2014

jay2002 commented Mar 23, 2014

kloudkl commented Mar 23, 2014

sergeyk commented Mar 23, 2014

kloudkl commented Mar 23, 2014

jay2002 commented Mar 24, 2014

kloudkl commented Mar 25, 2014