update MKLDNN cmakes #3168

tensor-tang · 2017-08-02T08:12:46Z

move MKLDNN and MKLML install path to build third party path
disable both when build doc
disable both on MacOS and Win32, not supported yet
give up hard code building MLKDNN

and disable both when build doc and MacOS

luotao1 · 2017-08-02T09:41:48Z

I test the changes on my machine, but there are errors as follows:

/home/luotao02/Paddle/third_party/mkldnn/src/extern_mkldnn/src/cpu/gemm_convolution.cpp:26:23: fatal error: mkl_cblas.h: No such file or directory
 #include "mkl_cblas.h"
                       ^
compilation terminated.

The full log is log.txt

And you forget to change the CMakeLists: use ${AVX_FOUND} to switch the ON/OFF of mkldnn and mklml.

option(WITH_MKLDNN      "Compile PaddlePaddle with mkl-dnn support."    ${AVX_FOUND})

tensor-tang · 2017-08-02T11:34:32Z

And you forget to change the CMakeLists: use ${AVX_FOUND} to switch the ON/OFF of mkldnn and mklml.

Actually, I did not forget about that. I thought you would prefer default OFF just like last time.
I can use ${AVX_FOUND} instead, will update later. Anyway, thanks for your reminder.

About the error, I check that ASAP, never shown on my machine before.

luotao1 · 2017-08-02T14:01:22Z

The CMake command result is CMakeCache.txt

tensor-tang · 2017-08-02T16:34:22Z

The CMakeCache.txt make sense to me.
Maybe we need to check your local env carefully, some environment variable must be loaded before use MKLDNN, and that impact on the missing files.

tensor-tang · 2017-08-03T01:35:21Z

The last commit failed with Teamcity shown with Cuda Error: out of memory.

The following tests FAILED:
[20:57:53] : [Step 1/1] 34 - test_matrixCompare (OTHER_FAULT)
[20:57:53]W: [Step 1/1] Errors while running CTest

Not actually this commit caused this failing.

luotao1 · 2017-08-03T03:34:00Z

@helinwang test_matrixCompare, test_NetworkCompare and test_CompareSparse fail occasionally, maybe we should refine the unittest to decrease the memory.

tensor-tang · 2017-08-03T04:17:03Z

Still failed with GPU, this time is about the accuracy.
Please help to reset the unit test, thanks~

The last test passed.

test_matrixCompare .................. Passed 259.42 sec

While failed with test_LayerGrad with the accuracy.

The following tests FAILED:
[03:48:30] : [Step 1/1] 46 - test_LayerGrad (Failed)

layer_type=recurrent useGpu=1
[03:44:52] : [Step 1/1] I8030 30:41:55.718948 27745 LayerGradUtil.cpp:703] cost 65.9891
[03:44:52] : [Step 1/1] I8030 30:41:55.758448 27745 LayerGradUtil.cpp:43] recurrent layer_0 step=1e-06 cost1=66.3733 cost2=65.6477 true_delta=0.725571 analytic_delta=0.756912 diff=-0.0414063 ***
[03:44:52] : [Step 1/1] /paddle/paddle/gserver/tests/LayerGradUtil.cpp:752: Failure
[03:44:52] : [Step 1/1] Expected: (fabs(maxDiff)) <= (epsilon), actual: 0.0414063 vs 0.02

helinwang · 2017-08-03T04:44:21Z

@luotao1 @tensor-tang Sorry about the out of memory problem, I will take a look.

luotao1

LGTM.
Since I create a clean /build directory, I can compile and test successfully.

move MKLDNN and MKLML install path to build third party path

1bd64f1

and disable both when build doc and MacOS

change default option for MKLDNN and MKLML

4dd89e8

add meesage and cmake cache arg

e6f62f7

luotao1 approved these changes Aug 3, 2017

View reviewed changes

luotao1 merged commit ca39600 into PaddlePaddle:develop Aug 3, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update MKLDNN cmakes #3168

update MKLDNN cmakes #3168

tensor-tang commented Aug 2, 2017 •

edited by luotao1

Loading

luotao1 commented Aug 2, 2017 •

edited

Loading

tensor-tang commented Aug 2, 2017 •

edited

Loading

luotao1 commented Aug 2, 2017

tensor-tang commented Aug 2, 2017

tensor-tang commented Aug 3, 2017

luotao1 commented Aug 3, 2017

tensor-tang commented Aug 3, 2017

helinwang commented Aug 3, 2017

luotao1 left a comment

update MKLDNN cmakes #3168

update MKLDNN cmakes #3168

Conversation

tensor-tang commented Aug 2, 2017 • edited by luotao1 Loading

luotao1 commented Aug 2, 2017 • edited Loading

tensor-tang commented Aug 2, 2017 • edited Loading

luotao1 commented Aug 2, 2017

tensor-tang commented Aug 2, 2017

tensor-tang commented Aug 3, 2017

luotao1 commented Aug 3, 2017

tensor-tang commented Aug 3, 2017

helinwang commented Aug 3, 2017

luotao1 left a comment

Choose a reason for hiding this comment

tensor-tang commented Aug 2, 2017 •

edited by luotao1

Loading

luotao1 commented Aug 2, 2017 •

edited

Loading

tensor-tang commented Aug 2, 2017 •

edited

Loading