Added SKL-like random forest Python API. #4148

canonizer · 2019-02-15T20:13:03Z

added XGBRFClassifier and XGBRFRegressor classes to Scikit-Learn-like xgboost API
added documentation describing how to use xgboost for random forests,
as well as existing caveats

- added XGBRFClassifier and XGBRFRegressor classes to SKL-like xgboost API - also added n_gpus and gpu_id parameters to SKL classes - added documentation describing how to use xgboost for random forests, as well as existing caveats

- introduced new parameters to XGBRanker - also passing **kwargs further in XGBRanker.__init__

RAMitchell

Because many of our internal parameters can be subject to change I do not like to bake them all into the Python API. Parameters such as 'gpu_id' can be passed as kwargs. I don't think this PR should require any changes to the actual function arguments.

As mentioned before, I do not like the use of self.mode to change behaviour, I think this should be handled via polymorphism.

- removed the n_gpus and gpu_id parameters - removed the mode parameter, using subclass methods instead - small fixes

canonizer · 2019-02-25T15:10:42Z

I've removed n_gpus and gpu_id. However, I've left tree_method and colsample_bynode. Please indicate if you want me to remove those two (note that other colsample_by* parameters are already part of SKL-like classes).

I've also removed self.mode, and now handle it using subclass methods.

RAMitchell · 2019-02-25T18:23:28Z

@trivialfis can I get your review.

@canonizer xgboost's default tree_method is currently 'exact'. This PR would change the sklearn interface to use 'hist' as the default. This would be a huge change for our user base. Arguably we should in fact make this change, but that is a big decision and that we need some internal discussion on.

trivialfis · 2019-02-25T22:56:11Z

@RAMitchell Will review shortly.

doc/rf.rst

python-package/xgboost/sklearn.py

trivialfis · 2019-03-08T16:03:01Z

@RAMitchell @hcho3 WDYT?

RAMitchell

I think we can't remove the 'silent' parameter yet. Given that this is a public API we should continue to support the parameter alongside 'verbosity' and issue a deprecation warning for at least a few subsequent releases.

Have you manually checked the python documentation @canonizer? It would be good to check if this formats correctly on your local machine.

I am also wondering if we can somehow mark the RF extension to the API as experimental to give ourselves some room to modify it in the near future. @hcho3 WDYT?

hcho3

LGTM. Yes, let’s mark them experimental so that we can fix API design as necessary.

trivialfis · 2019-03-12T14:26:28Z

I will make a separated PR for marking this experimental later. Let's merge it now. @canonizer @RAMitchell @hcho3 .

canonizer · 2019-03-12T19:56:22Z

#4255 brings back the silent parameter and marks it deprecated.

canonizer and others added 6 commits February 15, 2019 21:05

Added SKL-like random forest Python API.

49aac63

- added XGBRFClassifier and XGBRFRegressor classes to SKL-like xgboost API - also added n_gpus and gpu_id parameters to SKL classes - added documentation describing how to use xgboost for random forests, as well as existing caveats

Merge branch 'upstream-master' into fea-ext-rf-api

1012579

Merge branch 'upstream-master' into fea-ext-rf-api

f2f3fe2

Fixed tests for SKL-like API.

9a14c58

- introduced new parameters to XGBRanker - also passing **kwargs further in XGBRanker.__init__

Merge branch 'upstream-master' into fea-ext-rf-api

c9e6b76

Add XGBRFClassifier, XGBRFRegressor to dir(xgboost)

bb46eb6

RAMitchell reviewed Feb 24, 2019

View reviewed changes

canonizer added 2 commits February 25, 2019 16:04

Addressed review comments, small fixes.

e3769d5

- removed the n_gpus and gpu_id parameters - removed the mode parameter, using subclass methods instead - small fixes

Merge branch 'upstream-master' into fea-ext-rf-api

cb94423

trivialfis requested changes Feb 26, 2019

View reviewed changes

canonizer added 3 commits March 6, 2019 17:51

Merge branch 'upstream-master' into fea-ext-rf-api

b4a0fe0

Addressed review comments.

5e45f31

Merge branch 'upstream-master' into fea-ext-rf-api

9457fd6

trivialfis approved these changes Mar 8, 2019

View reviewed changes

RAMitchell reviewed Mar 8, 2019

View reviewed changes

hcho3 approved these changes Mar 9, 2019

View reviewed changes

trivialfis merged commit a36c3ed into dmlc:master Mar 12, 2019

trivialfis mentioned this pull request Mar 12, 2019

Mark Random forest APIs as beta. #4253

Closed

hcho3 mentioned this pull request Apr 21, 2019

XGBoost 0.90 Roadmap #4389

Closed

18 tasks

hcho3 mentioned this pull request May 17, 2019

[RFC] Version 0.90 release candidate #4475

Merged

lock bot locked as resolved and limited conversation to collaborators Jun 10, 2019

mtjrider deleted the fea-ext-rf-api branch August 19, 2019 17:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added SKL-like random forest Python API. #4148

Added SKL-like random forest Python API. #4148

canonizer commented Feb 15, 2019 •

edited

Loading

RAMitchell left a comment

canonizer commented Feb 25, 2019

RAMitchell commented Feb 25, 2019

trivialfis commented Feb 25, 2019

trivialfis commented Mar 8, 2019

RAMitchell left a comment

hcho3 left a comment •

edited

Loading

trivialfis commented Mar 12, 2019

canonizer commented Mar 12, 2019

Added SKL-like random forest Python API. #4148

Added SKL-like random forest Python API. #4148

Conversation

canonizer commented Feb 15, 2019 • edited Loading

RAMitchell left a comment

Choose a reason for hiding this comment

canonizer commented Feb 25, 2019

RAMitchell commented Feb 25, 2019

trivialfis commented Feb 25, 2019

trivialfis commented Mar 8, 2019

RAMitchell left a comment

Choose a reason for hiding this comment

hcho3 left a comment • edited Loading

Choose a reason for hiding this comment

trivialfis commented Mar 12, 2019

canonizer commented Mar 12, 2019

canonizer commented Feb 15, 2019 •

edited

Loading

hcho3 left a comment •

edited

Loading