Fix perf gap in thread safe prediction #6696

ShvetsKS · 2021-02-09T15:22:07Z

As in #6648 local RegTree::FVec storage was introduced to preserve thread safety each training iteration requires buffer initialization in case of subsampling.

stages (santander dataset)	before #6648	thread safe #6648 and master	current PR
full training	136s	165s	140s
PredictRaw	55s	78s	59s

this PR introduces threading for local buffers initialization.
But as still some gaps are exist is it possible to have not thread safe PredictDMatrix call during training?

trivialfis · 2021-02-09T15:34:14Z

I thought we have prediction cache enabled?

ShvetsKS · 2021-02-09T16:37:13Z

I thought we have prediction cache enabled?

For subsampling case we still don't have prediction caching (seems it's still in progress: #6683, major complexity is related to multiclass classification case as for each group own indices subset is generated.)

trivialfis · 2021-02-10T01:45:33Z

@ShvetsKS Would you like to take a look into the GPU implementation of subsampling? Or my WIP rewrite for CPU hist? The subsampling doesn't have to conflict with cache.

trivialfis · 2021-02-10T01:47:14Z

Last time your found it to be slower than master branch, which is expected as I don't have time to incooperate many recent optimization into it. But I believe my rewrite can at least offer some insight on how to implement some of the features in a different way.

ShvetsKS · 2021-02-10T05:34:13Z

Last time your found it to be slower than master branch, which is expected as I don't have time to incooperate many recent optimization into it. But I believe my rewrite can at least offer some insight on how to implement some of the features in a different way.

Current changes are applicable even for inference stage as it's better to initialize RegTree::FVec thread locally anyway.

codecov-io · 2021-02-10T10:58:46Z

Codecov Report

Merging #6696 (198747f) into master (9b267a4) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #6696   +/-   ##
=======================================
  Coverage   81.56%   81.56%           
=======================================
  Files          13       13           
  Lines        3759     3759           
=======================================
  Hits         3066     3066           
  Misses        693      693

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9b267a4...198747f. Read the comment docs.

ShvetsKS · 2021-02-10T13:27:22Z

problem with continuous-integration/travis-ci/pr is related with server node
await ServerNode.close(self)
seems restarting should help.

trivialfis · 2021-02-16T05:48:59Z

Sorry for the long wait. Still on vacation, but will try to look into it soon as possible.

ShvetsKS changed the title ~~Fix perf issue in thread safe prediction~~ Fix perf gap in thread safe prediction Feb 9, 2021

ShvetsKS mentioned this pull request Feb 9, 2021

Introduced cpu performance gaps disscussion #6697

Closed

ShvetsKS force-pushed the fix_perf_in_thread_safe_predict branch 2 times, most recently from 4e8ed8c to 198747f Compare February 10, 2021 09:40

Shvets Kirill added 2 commits February 10, 2021 02:59

fix perf

779e933

fix order

1d4e4d1

ShvetsKS force-pushed the fix_perf_in_thread_safe_predict branch from 198747f to 1d4e4d1 Compare February 10, 2021 12:02

trivialfis approved these changes Feb 16, 2021

View reviewed changes

trivialfis merged commit 9f15b9e into dmlc:master Feb 16, 2021

RukhovichIV mentioned this pull request Feb 17, 2021

Prediction by indices (subsample < 1) #6683

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix perf gap in thread safe prediction #6696

Fix perf gap in thread safe prediction #6696

ShvetsKS commented Feb 9, 2021 •

edited

Loading

trivialfis commented Feb 9, 2021

ShvetsKS commented Feb 9, 2021

trivialfis commented Feb 10, 2021 •

edited

Loading

trivialfis commented Feb 10, 2021

ShvetsKS commented Feb 10, 2021

codecov-io commented Feb 10, 2021 •

edited

Loading

ShvetsKS commented Feb 10, 2021

trivialfis commented Feb 16, 2021

Fix perf gap in thread safe prediction #6696

Fix perf gap in thread safe prediction #6696

Conversation

ShvetsKS commented Feb 9, 2021 • edited Loading

trivialfis commented Feb 9, 2021

ShvetsKS commented Feb 9, 2021

trivialfis commented Feb 10, 2021 • edited Loading

trivialfis commented Feb 10, 2021

ShvetsKS commented Feb 10, 2021

codecov-io commented Feb 10, 2021 • edited Loading

Codecov Report

ShvetsKS commented Feb 10, 2021

trivialfis commented Feb 16, 2021

ShvetsKS commented Feb 9, 2021 •

edited

Loading

trivialfis commented Feb 10, 2021 •

edited

Loading

codecov-io commented Feb 10, 2021 •

edited

Loading