Revert ntree limit fix #6616

trivialfis · 2021-01-18T12:41:21Z

Close #6615 .

tests/python/test_predict.py

trivialfis · 2021-01-18T14:30:20Z

python-package/xgboost/training.py

-            best_ntree_limit=str(
-                (bst.best_iteration + 1) * num_parallel_tree * num_groups
-            )
+            best_ntree_limit=str((bst.best_iteration + 1) * num_parallel_tree)


It's not an exact revert since we also have fix for an old bug for gblinear with ntree_limit, which is valid.

So the best_ntree_limit attribute is really a misnomer; it behaves more like best_iteration in the C++ layer. Is my understanding correct?

The C++ layer considers num_group as it's a model parameter, and predictor PredictBatch has GBTreeModel as its argument. But it doesn't consider num_parallel_tree, which is a training parameter instead of model parameter. So the C++ function for ntree_limit is half implementation for best_iteration.

So it's best_iteration * num_parallel_tree then? We should clearly document the meaning of best_ntree_limit, e.g.

Despite its name, the best_ntree_limit attribute is actually the product (best_iteration * num_parallel_tree). If you set num_parallel_tree > 1, then best_ntree_limit won't be equal to the best boosting round. Going forward, please use model slicing in all new code.

Just to clarify and reinforce: The best_ntree_limit equals num_parallel_tree * best_iteration. num_class is multiplied inside predictor.

When using classifier, the best_ntree_limit equals to number of trees produced in best iteration / num_class.

The inplace prediction doesn't have this problem as it can use best_iteration directly.

@hcho3 I will try to deprecate the attribute after setting it as a Python @property where we can throw proper warnings.

Also, the sklearn model uses this attribute by default when running prediction.

Added a short note on train function.

The old (before fix) best_ntree_limit ignores the num_class parameters, which is incorrect. In before we workarounded it in c++ layer to avoid possible breaking changes on other language bindings. But the Python interpretation stayed incorrect. The PR fixed that in Python to consider num_class, but didn't remove the old workaround, so tree calculation in predictor is incorrect, see PredictBatch in CPUPredictor.

trivialfis added 2 commits January 18, 2021 20:22

Revert fix for best_ntree_limit.

4f0991b

Add a test.

d74aef2

trivialfis mentioned this pull request Jan 18, 2021

Incorrect best_ntree_limit. #6615

Closed

trivialfis added 2 commits January 18, 2021 21:04

fix skl test.

237559f

Fix the seed.

a14e59b

trivialfis requested a review from hcho3 January 18, 2021 14:22

trivialfis commented Jan 18, 2021

View reviewed changes

Doc.

d38e73d

hcho3 approved these changes Jan 19, 2021

View reviewed changes

trivialfis mentioned this pull request Jan 19, 2021

[Roadmap] 1.3.3 Patch release for Python #6620

Closed

trivialfis merged commit d6d72de into dmlc:master Jan 19, 2021

trivialfis deleted the revert-ntree-limit-fix branch January 19, 2021 15:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert ntree limit fix #6616

Revert ntree limit fix #6616

trivialfis commented Jan 18, 2021

trivialfis Jan 18, 2021

hcho3 Jan 19, 2021 •

edited

Loading

trivialfis Jan 19, 2021

hcho3 Jan 19, 2021 •

edited

Loading

trivialfis Jan 19, 2021

trivialfis Jan 19, 2021

trivialfis Jan 19, 2021

trivialfis Jan 19, 2021

trivialfis Jan 19, 2021

Revert ntree limit fix #6616

Revert ntree limit fix #6616

Conversation

trivialfis commented Jan 18, 2021

trivialfis Jan 18, 2021

Choose a reason for hiding this comment

hcho3 Jan 19, 2021 • edited Loading

Choose a reason for hiding this comment

trivialfis Jan 19, 2021

Choose a reason for hiding this comment

hcho3 Jan 19, 2021 • edited Loading

Choose a reason for hiding this comment

trivialfis Jan 19, 2021

Choose a reason for hiding this comment

trivialfis Jan 19, 2021

Choose a reason for hiding this comment

trivialfis Jan 19, 2021

Choose a reason for hiding this comment

trivialfis Jan 19, 2021

Choose a reason for hiding this comment

trivialfis Jan 19, 2021

Choose a reason for hiding this comment

hcho3 Jan 19, 2021 •

edited

Loading

hcho3 Jan 19, 2021 •

edited

Loading