Mini-batch Size vs. Memory Limit #1929

jimmie33 · 2015-02-21T18:45:38Z

Currently mini-batch size N is subject to the memory limit. For example, for training a large model, I cannot use large mini-batch size, otherwise my GPU cannot N training sample at once.

Is it possible that Caffe can support mini-batch size that can be a multiple of input data batch size? My understanding is that it just needs to accumulate the gradients over several batches of input data before doing a model update step. Right?

I wonder if Caffe will support this functionality, or it already does that (I am new to Caffe so I may have missed something)? Or is there any difficulty I overlooked in implementing this functionality?

shelhamer · 2015-02-21T18:50:17Z

Already done in #1663! Now that the latest release is out it'll be merged once we double-check the details.

jimmie33 · 2015-02-21T19:05:58Z

@shelhamer Thanks for your information. This is great!

Is there a way now to control the data batch size and the mini-batch size, based on the new gradient accumulation implementation? I think this needs an extra parameter in the proto files, and it also requires some changes in the solver, right? Are these also be done, or will be done soon?

shelhamer · 2015-02-26T00:35:01Z

It's already there in #1663 -- now #1977. It's the iter_size setting in the solver config.

rohrbach added the ES label Feb 22, 2015

shelhamer closed this as completed Feb 26, 2015

gcr mentioned this issue Jan 12, 2016

Code for reproducing cifar-10 examples in "Deep Residual Learni… Lasagne/Recipes#38

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mini-batch Size vs. Memory Limit #1929

Mini-batch Size vs. Memory Limit #1929

jimmie33 commented Feb 21, 2015

shelhamer commented Feb 21, 2015

jimmie33 commented Feb 21, 2015

shelhamer commented Feb 26, 2015

Mini-batch Size vs. Memory Limit #1929

Mini-batch Size vs. Memory Limit #1929

Comments

jimmie33 commented Feb 21, 2015

shelhamer commented Feb 21, 2015

jimmie33 commented Feb 21, 2015

shelhamer commented Feb 26, 2015