Colsample performance when using tree_method=hist #7002

karunrao97 · 2021-05-26T14:25:32Z

In general, I would expect usage of colsample_by* parameters to improve training speed (per tree), since we do not need to consider all features when evaluating splits. For my use case, however, I do not observe this.

Using tree_method=hist and grow_policy=depthwise, I see that most of the time is taken in building histograms (I profiled QuantileHistMaker::Builder::ExpandWithDepthWise, and saw that almost all of the time is spent in BuildLocalHistograms), which is actually done before sampling the feature sets for each node (in

xgboost/src/tree/updater_quantile_hist.cc

Line 996 in 522b897

features_sets[nid_in_set] = column_sampler_.GetFeatureSet(tree.GetDepth(nid));

).

Could performance be improved by sampling the features prior to building the histograms instead, since we then do not need to compute histograms for the unused features? If so, can we please include this as a feature request?

Denisevi4 · 2021-07-22T16:31:37Z

I'm surprised this doesn't bring attention. Seems an easy win since a lot of times people use colsample.

exalate-issue-sync bot mentioned this issue May 11, 2023

Check sampling aliases are set correctly in H2O XGBoost h2oai/h2o-3#8458

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Colsample performance when using tree_method=hist #7002

Colsample performance when using tree_method=hist #7002

karunrao97 commented May 26, 2021 •

edited

Loading

Denisevi4 commented Jul 22, 2021

Colsample performance when using tree_method=hist #7002

Colsample performance when using tree_method=hist #7002

Comments

karunrao97 commented May 26, 2021 • edited Loading

Denisevi4 commented Jul 22, 2021

karunrao97 commented May 26, 2021 •

edited

Loading