Benchmarks to track performance changes in 'hist' method #5126

SmirnovEgorRu · 2019-12-16T13:46:33Z

It is PR №2 from the issue #5104.
It is required to understand impact of the optimizations. I'm planning to use them for all further PRs.

trivialfis · 2019-12-16T14:18:21Z

@RAMitchell We staged many benchmarking scripts in external projects. I also have a collection of them with dask. I'm open to have some of those XGBoost specific scripts to be maintained in one place. WDYT?

SmirnovEgorRu · 2019-12-16T15:46:44Z

Time by kernels after optimizations reverting collected by the benchmarks:

Data set	ApplySplit	EvaluateSplit	BuildHist	SyncHistogram	Prediction	Total, sec
higgs1m	36	62	156	186	3	446
airline-ohe	30	46	77	126	2	303
msrank-10k	162	244	1366	836	50	2680

Time by kernels before optimizations reverting:

Data set	ApplySplit	EvaluateSplit	BuildHist	Prediction	Total, sec
higgs1m	3.7	3.5	6.2	1.6	17.7
airline-ohe	9.0	6.1	28.8	0.7	64
msrank-10k	15.5	52.6	66.6	47.6	197

HW: c5.metal AWS instance

RAMitchell · 2019-12-16T21:37:27Z

As this becomes more sophisticated it begs the question, should this code be inside the xgboost main repo? It has no dependency on xgboost source code, only on having some installed version of xgboost. We can just as easily run it via our CI as a separate repo.

Also how is this different from https://github.com/NVIDIA/gbm-bench? Would you get the information you need by running this? Maybe we need a more neutrally hosted version of gbm-bench.

RAMitchell · 2019-12-16T21:40:23Z

Also, one of the problems with previous optimisations was that they caused performance regression in the distributed algorithm due to increasing the number of rabit calls. To resolve this we could run experiments with dask.

trivialfis · 2019-12-17T06:20:58Z

@hcho3

hcho3 · 2019-12-17T10:34:34Z

@RAMitchell We can probably combine NVIDIA/gbm-bench and this pull request. For now, let us just benchmark XGBoost and not worry about other libraries (LightGBM, CatBoost etc). And as you mentioned, we should definitely test distributed training.

@tqchen Can I have admin right over https://github.com/dmlc/xgboost-bench ? This seems perfect for hosting benchmark script.

hcho3 · 2019-12-19T05:17:32Z

@dmlc/xgboost-committer https://github.com/dmlc/xgboost-bench is now public. All committers of XGBoost should have push access to it.

hcho3 · 2019-12-19T20:52:37Z

Closing this PR now. I will move this PR code to https://github.com/dmlc/xgboost-bench

hcho3 · 2019-12-19T21:02:48Z

@SmirnovEgorRu I moved your benchmark code to xgboost-bench repo: dmlc/xgboost-bench@c787a59

added benchmarks to track performance changes

b2a4a5a

SmirnovEgorRu mentioned this pull request Dec 16, 2019

CPU optimizations - 'hist' method #5104

Closed

hcho3 closed this Dec 19, 2019

This was referenced Dec 20, 2019

Remove benchmark code in GPU test. #5141

Merged

[RFC] Benchmark scripts to detect performance regression #5142

Closed

lock bot locked as resolved and limited conversation to collaborators Mar 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarks to track performance changes in 'hist' method #5126

Benchmarks to track performance changes in 'hist' method #5126

SmirnovEgorRu commented Dec 16, 2019

trivialfis commented Dec 16, 2019 •

edited

Loading

SmirnovEgorRu commented Dec 16, 2019

RAMitchell commented Dec 16, 2019

RAMitchell commented Dec 16, 2019

trivialfis commented Dec 17, 2019

hcho3 commented Dec 17, 2019 •

edited

Loading

hcho3 commented Dec 19, 2019

hcho3 commented Dec 19, 2019

hcho3 commented Dec 19, 2019

Benchmarks to track performance changes in 'hist' method #5126

Benchmarks to track performance changes in 'hist' method #5126

Conversation

SmirnovEgorRu commented Dec 16, 2019

trivialfis commented Dec 16, 2019 • edited Loading

SmirnovEgorRu commented Dec 16, 2019

RAMitchell commented Dec 16, 2019

RAMitchell commented Dec 16, 2019

trivialfis commented Dec 17, 2019

hcho3 commented Dec 17, 2019 • edited Loading

hcho3 commented Dec 19, 2019

hcho3 commented Dec 19, 2019

hcho3 commented Dec 19, 2019

trivialfis commented Dec 16, 2019 •

edited

Loading

hcho3 commented Dec 17, 2019 •

edited

Loading