Possible performance drop in -dev branch #1006

ozancaglayan · 2014-08-29T08:25:00Z

Hi,

I was testing several BLAS implementations to see the performance difference. I'm using the MNIST dataset as instructed in its tutorial but with max_iter set to 1000.

I just discovered that the training (using train_lenet.sh) is significantly slow compared to the master branch. I tested on two different machines. The results below are from an Intel Xeon W3530 Nehalem CPU. I'm training using CPU mode. Is this an expected slow-down caused by some implementation change?

atlas-sse3 - fedora 19 x86_64 (dev branch)
-------------------------------------------------------
I0828 17:59:20.025907 20321 solver.cpp:302]     Test net output #0: accuracy = 0.9788
I0828 17:59:20.025959 20321 solver.cpp:302]     Test net output #1: loss = 0.0642497 (* 1 = 0.0642497 loss)
I0828 17:59:20.025979 20321 solver.cpp:237] Optimization Done.
I0828 17:59:20.025987 20321 caffe.cpp:113] Optimization Done.

real6m11.887s
user6m31.207s
sys0m1.324s

atlas-sse3 - fedora 19 x86_64 (master branch)
-----------------------------------------------------------
I0828 18:06:28.892992 11738 solver.cpp:270] Test score #0: 0.9776
I0828 18:06:28.893049 11738 solver.cpp:270] Test score #1: 0.0670089
I0828 18:06:28.893060 11738 solver.cpp:218] Optimization Done.
I0828 18:06:28.893131 11738 caffe.cpp:113] Optimization Done.

real4m6.125s
user4m5.772s
sys0m0.140s

The text was updated successfully, but these errors were encountered:

kloudkl · 2014-08-31T00:19:30Z

Try the fixes in #1008, time the other examples or benchmark the network with the tool "caffe time" to find out the layer-wise speed gaps.

shelhamer · 2014-08-31T06:19:29Z

I can't reproduce this and in fact my dev is faster than master according to caffe time. Are you sure there weren't interfering tasks increasing the CPU or disk load during the test for dev?

shelhamer · 2014-08-31T06:34:58Z

It could be the switch to the DataTransformer from inline data transformation in dev since you are running in purely in CPU. You could try your evaluation at aee9cd3 before it was merged in #954 to test if it makes a difference.

longjon · 2015-05-08T06:30:05Z

Closing as this seems to have expired; we don't know of any current relevant performance issue.

shelhamer added the downstream problem? label Aug 31, 2014

longjon closed this as completed May 8, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible performance drop in -dev branch #1006

Possible performance drop in -dev branch #1006

ozancaglayan commented Aug 29, 2014

kloudkl commented Aug 31, 2014

shelhamer commented Aug 31, 2014

shelhamer commented Aug 31, 2014

longjon commented May 8, 2015

Possible performance drop in -dev branch #1006

Possible performance drop in -dev branch #1006

Comments

ozancaglayan commented Aug 29, 2014

kloudkl commented Aug 31, 2014

shelhamer commented Aug 31, 2014

shelhamer commented Aug 31, 2014

longjon commented May 8, 2015