create new api to indicate detect thread usage #18081

LeoZhao-Intel · 2019-06-13T11:48:36Z

This PR solves 2 issues:

When Predictor.run is called in changed thread, it will make memory leak due to threadid is inserted into key, while each time key is different.
For detect model, input dims are dynamic, not fixed, it will make conv/pool/concat mkldnn op memory leak due to each time key is different.

The solve method is to disable cache in this case, we extend EnableMKLDNN in AnaysisConfig, use parameter to control if cache is needed.

related #17611

…e kernel issues

test=develop

luotao1

Please add the description of this PR.

paddle/fluid/platform/device_context.cc

paddle/fluid/inference/api/analysis_predictor.cc

paddle/fluid/inference/api/paddle_analysis_config.h

paddle/fluid/inference/api/analysis_config.cc

LeoZhao-Intel · 2019-06-14T12:59:51Z

@jczaja @jianhang-liu

LeoZhao-Intel · 2019-06-14T13:03:57Z

This PR solves 2 issues:

When Predictor.run is called in changed thread, it will make memory leak due to threadid is inserted into key, while each time key is different.
For detect model, input dims are dynamic, not fixed, it will make conv/pool/concat mkldnn op memory leak due to each time key is different.

The solve method is to disable cache in this case, we extend EnableMKLDNN in AnaysisConfig, use parameter to control if cache is needed.

test=develop

luotao1 · 2019-06-15T03:04:32Z

[19:20:47]	[100%] Linking CXX executable test_analyzer_small_dam
[19:20:48]	../../../../../third_party/install/ngraph/lib/libngraph.so: file not recognized: File truncated
[19:20:48]	collect2: error: ld returned 1 exit status
[19:20:48]	make[2]: *** [paddle/fluid/inference/tests/api/test_analyzer_mobilenet_depthwise_conv] Error 1
[19:20:48]	paddle/fluid/inference/tests/api/CMakeFiles/test_analyzer_mobilenet_depthwise_conv.dir/build.make:641: recipe for target 'paddle/fluid/inference/tests/api/test_analyzer_mobilenet_depthwise_conv' failed
[19:20:48]	make[1]: *** [paddle/fluid/inference/tests/api/CMakeFiles/test_analyzer_mobilenet_depthwise_conv.dir/all] Error 2
[19:20:48]	CMakeFiles/Makefile2:108240: recipe for target 'paddle/fluid/inference/tests/api/CMakeFiles/test_analyzer_mobilenet_depthwise_conv.dir/all' failed
[19:20:48]	make[1]: *** Waiting for unfinished jobs....

http://ci.paddlepaddle.org/viewLog.html?buildId=115069&buildTypeId=Paddle_PrCi&tab=buildLog&_focus=7893

test=develop

jczaja

Those are very core changes , have you run extensive performance tests to check if inference of our models (CAPI tests of our validation) is unharmed ?

paddle/fluid/inference/api/analysis_predictor.cc

LeoZhao-Intel · 2019-06-18T01:29:41Z

This is just WA to avoid memory leak issue due to dynamic shape and run calling from different threads, and does harm performance. mkldnn reuse does improve performance much.
This is perf data using reuse.

This is perf data after disable cache.

jczaja · 2019-06-18T09:55:15Z

@LeoZhao-Intel I just want to make sure that reusing is disabled only in situation when we expect it to be disabled. I will copy this PR branch to internal repo and run automatic tests. Results should be available tonight

LeoZhao-Intel · 2019-06-18T10:42:04Z

should be able to pass CI test, but it will fail if EnableMKLDNN(1) is called in multiple instances case and SetMkldnnthreadid() is not set properly, so as I said before it is just WA. Need re-factor whole stack including API.

jczaja · 2019-06-18T11:31:20Z

@LeoZhao-Intel Ok, performance (C-API) tests passed eg. no impact visible. My understanding is that you are still working on this PR ?

LeoZhao-Intel · 2019-06-18T12:32:30Z

this PR gonna be dropped since it impacts performance much and can't reach target, but some requirements may be leveraged into formal solution. e.g. support cache disabled case, use API to indicate scene.

jianhang-liu · 2019-06-18T12:35:05Z

Thanks Leo for big efforts on this PR. The investigation in this PR had given us much insight of this issue.

jianhang-liu · 2019-06-26T08:22:21Z

Closed. But some code will be cloned to a new PR from Leo. Similar as PR #18217 .

LeoZhao-Intel added 7 commits June 13, 2019 19:52

create new api to indicate detect thread usage

fcc05d1

add api into paddle_api

ffc6fc0

use another setting to avoid in one devicecontext

4953979

use Analysis Config to set reuse id and fix few potential mkldnn reus…

923f038

…e kernel issues

reset cur_thread_id to 0 after run if reuse_id is set

81b7f9f

fix conv and pool mkldnn op issue when mkldnn cache is disabled

43539db

remove new defined interface and reuse EnableMKLDNN

c074e87

test=develop

luotao1 reviewed Jun 14, 2019

View reviewed changes

luotao1 requested review from jczaja and jianhang-liu June 14, 2019 13:03

luotao1 added the Intel label Jun 14, 2019

refine variable name, and add more comments

8db300d

test=develop

update python part for EnableMKLDNN() parameter change

6eb0e30

test=develop

jczaja reviewed Jun 17, 2019

View reviewed changes

paddle/fluid/inference/api/analysis_predictor.cc Show resolved Hide resolved

LeoZhao-Intel mentioned this pull request Jun 20, 2019

[MKL-DNN] Thread ID is added for caching only ParallelExecutor based executions #18217

Closed

jianhang-liu closed this Jun 26, 2019

LeoZhao-Intel mentioned this pull request Jul 9, 2019

enhance config.EnableMKLDNN api for mkldnn cache clear strategy #18549

Closed

luotao1 mentioned this pull request May 28, 2021

config. SetMkldnnCacheCapacity is useless #33021

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

create new api to indicate detect thread usage #18081

create new api to indicate detect thread usage #18081

LeoZhao-Intel commented Jun 13, 2019 •

edited by luotao1

Loading

luotao1 left a comment

LeoZhao-Intel commented Jun 14, 2019

LeoZhao-Intel commented Jun 14, 2019

luotao1 commented Jun 15, 2019

jczaja left a comment

LeoZhao-Intel commented Jun 18, 2019 •

edited

Loading

jczaja commented Jun 18, 2019

LeoZhao-Intel commented Jun 18, 2019 •

edited

Loading

jczaja commented Jun 18, 2019

LeoZhao-Intel commented Jun 18, 2019 •

edited

Loading

jianhang-liu commented Jun 18, 2019

jianhang-liu commented Jun 26, 2019

create new api to indicate detect thread usage #18081

create new api to indicate detect thread usage #18081

Conversation

LeoZhao-Intel commented Jun 13, 2019 • edited by luotao1 Loading

luotao1 left a comment

Choose a reason for hiding this comment

LeoZhao-Intel commented Jun 14, 2019

LeoZhao-Intel commented Jun 14, 2019

luotao1 commented Jun 15, 2019

jczaja left a comment

Choose a reason for hiding this comment

LeoZhao-Intel commented Jun 18, 2019 • edited Loading

jczaja commented Jun 18, 2019

LeoZhao-Intel commented Jun 18, 2019 • edited Loading

jczaja commented Jun 18, 2019

LeoZhao-Intel commented Jun 18, 2019 • edited Loading

jianhang-liu commented Jun 18, 2019

jianhang-liu commented Jun 26, 2019

LeoZhao-Intel commented Jun 13, 2019 •

edited by luotao1

Loading

LeoZhao-Intel commented Jun 18, 2019 •

edited

Loading

LeoZhao-Intel commented Jun 18, 2019 •

edited

Loading

LeoZhao-Intel commented Jun 18, 2019 •

edited

Loading