bugfix regarding #100 #103

Yangqing · 2014-02-13T05:54:40Z

The bugfix for #100: when checking blobs_lr, also check the size of the parameter's blobs().size(): if size() is nonzero then we need to do backpropagation.

TODO: maybe add a regression test to rule out future bugs. Also the Init() function is growing quite big now.

bugfix regarding #100

sguada · 2014-02-20T23:16:19Z

@Yangqing Due to the change in #100 and #103 in default value of blob_lr now all during test and deploy the network assume it needs to do backward propagation (reserving more memory) even it is not going to do it. At least one set blob_lr=0. for all the layers with parameters.

kloudkl · 2014-02-21T04:32:57Z

Because both the Forward and Backward methods use the bottom_vecs_ and top_vecs, I'm afraid there is no way to save memory.

template <typename Dtype>
const vector<Blob<Dtype>*>& Net<Dtype>::ForwardPrefilled() {
  for (int i = 0; i < layers_.size(); ++i) {
    // LOG(ERROR) << "Forwarding " << layer_names_[i];
    layers_[i]->Forward(bottom_vecs_[i], &top_vecs_[i]);
  }
  return net_output_blobs_;
}

template <typename Dtype>
const vector<Blob<Dtype>*>& Net<Dtype>::Forward(
    const vector<Blob<Dtype>*> & bottom) {
  // Copy bottom to internal bottom
  for (int i = 0; i < bottom.size(); ++i) {
    net_input_blobs_[i]->CopyFrom(*bottom[i]);
  }
  return ForwardPrefilled();
}

template <typename Dtype>
Dtype Net<Dtype>::Backward() {
  Dtype loss = 0;
  for (int i = layers_.size() - 1; i >= 0; --i) {
    if (layer_need_backward_[i]) {
      Dtype layer_loss = layers_[i]->Backward(
          top_vecs_[i], true, &bottom_vecs_[i]);
      loss += layer_loss;
    }
  }
  return loss;
}

Yangqing · 2014-02-21T07:30:23Z

There is no concern on memory consumption as long as you do not invoke
backward(). Note that we probably do want backward function calls during
deploy time (e.g. gradient as saliency).

All the memory chunks are lazy allocated, which is one of the beauty of
caffe: if you don't use cpu, no cpu memory allocated; if you don't use gpu,
no gpu memory allocated; if you don't run backward, no diff allocated.

Yangqing

On Thu, Feb 20, 2014 at 8:32 PM, kloudkl notifications@github.com wrote:

Because both the Forward and Backward methods use the bottom_vecs_ and
top_vecs, I'm afraid there is no way to save memory.

template const vector<Blob>& Net::ForwardPrefilled() {
for (int i = 0; i < layers.size(); ++i) {
// LOG(ERROR) << "Forwarding " << layer_names_[i];
layers_[i]->Forward(bottom_vecs_[i], &top_vecs_[i]);
}
return net_output_blobs_;}
template const vector<Blob>& Net::Forward(
const vector<Blob> & bottom) {
// Copy bottom to internal bottom
for (int i = 0; i < bottom.size(); ++i) {
net_input_blobs_[i]->CopyFrom(bottom[i]);
}
return ForwardPrefilled();}
template Dtype Net::Backward() {
Dtype loss = 0;
for (int i = layers.size() - 1; i >= 0; --i) {
if (layer_need_backward_[i]) {
Dtype layer_loss = layers_[i]->Backward(
top_vecs_[i], true, &bottom_vecs_[i]);
loss += layer_loss;
}
}
return loss;}

Reply to this email directly or view it on GitHubhttps://github.com//pull/103#issuecomment-35698464
.

kloudkl · 2014-02-21T09:02:52Z

The lazy beauties lie in SyncedMemory::to_cpu and SyncedMemory::to_gpu.

sguada · 2014-02-21T16:12:40Z

@Yangqing thanks for the clarification, it seemed to me that it was using more memory but you are right it is not.
Do you know how to deallocate all the memory when running the matcaffe wrapper, I always get a core dump when I exit matlab, and I think is due to that.

rbgirshick · 2014-02-24T03:59:35Z

I haven't experienced a core dump when exiting matlab after using the
matcaffe wrapper. It might be a good idea to check if the segfault is
related to one of the modifications you added and then debug that.

On Fri, Feb 21, 2014 at 10:12 AM, Sergio Guadarrama <
notifications@github.com> wrote:

@Yangqing https://github.com/Yangqing thanks for the clarification, it
seemed to me that it was using more memory but you are right it is not.
Do you know how to deallocate all the memory when running the matcaffe
wrapper, I always get a core dump when I exit matlab, and I think is due to
that.

Reply to this email directly or view it on GitHubhttps://github.com//pull/103#issuecomment-35745036
.

http://www.cs.berkeley.edu/~rbg/

sguada · 2014-02-25T02:29:13Z

@rbgirshick I have double checked with the new #132 and don't get any more core dump. I guess it was probably because my branch was in a mixed state. But if you get any just let me know.

bugfix regarding BVLC#100

Cleaner workaround for max pool

bugfix regarding #100

ce8e2ed

Yangqing mentioned this pull request Feb 13, 2014

Behavior when not specifying the blobs_lr in a layer #100

Closed

Yangqing added a commit that referenced this pull request Feb 13, 2014

Merge pull request #103 from Yangqing/master

0b3f9c8

bugfix regarding #100

Yangqing merged commit 0b3f9c8 into BVLC:master Feb 13, 2014

kloudkl mentioned this pull request Apr 5, 2014

Add a layer for in-memory datasets, and expose it to Python #294

Merged

kloudkl mentioned this pull request May 5, 2014

Explicitly disable backward propagation of a layer for controlled fine tuning #389

Closed

mitmul pushed a commit to mitmul/caffe that referenced this pull request Sep 30, 2014

Merge pull request BVLC#103 from Yangqing/master

97c0811

bugfix regarding BVLC#100

drozdvadym mentioned this pull request Sep 26, 2015

Out-of-memory on g2.8xlarge NVIDIA/caffe#34

Closed

lukeyeager pushed a commit to lukeyeager/caffe that referenced this pull request Jan 22, 2016

Merge pull request BVLC#103 from thatguymike/maxpool_workaround

4d58e6d

Cleaner workaround for max pool

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bugfix regarding #100 #103

bugfix regarding #100 #103

Yangqing commented Feb 13, 2014

sguada commented Feb 20, 2014

kloudkl commented Feb 21, 2014

Yangqing commented Feb 21, 2014

kloudkl commented Feb 21, 2014

sguada commented Feb 21, 2014

rbgirshick commented Feb 24, 2014

sguada commented Feb 25, 2014

bugfix regarding #100 #103

bugfix regarding #100 #103

Conversation

Yangqing commented Feb 13, 2014

sguada commented Feb 20, 2014

kloudkl commented Feb 21, 2014

Yangqing commented Feb 21, 2014

kloudkl commented Feb 21, 2014

sguada commented Feb 21, 2014

rbgirshick commented Feb 24, 2014

sguada commented Feb 25, 2014