[python][sklearn] respect objective aliases #4758

StrikerRUS · 2021-10-31T01:17:03Z

I believe this is quite odd to add objective alias used in sklearn (loss #4637) and simply ignore any objective aliases at the same time.

StrikerRUS · 2021-11-01T21:37:38Z

tests/python_package_test/test_sklearn.py

-    # invalid objective is replaced with default multiclass one
-    # and invalid binary metric is replaced with multiclass alternative
-    gbm = lgb.LGBMClassifier(objective='invalid_obj',
-                             **params).fit(eval_metric='binary_error', **params_fit)


I believe this behavior is controversial and I'd better remove such silent replacement of objective function and instead raise error.

I agree. We should explicitly tell the user what they intended to use is not valid. Or at least we should give a warning.

jameslamb

Totally support these changes and agree that this makes the scikit-learn API behave in a way that's closer to what we've told users to expect (where parameter aliases can always be used to override keyword arguments).

I just left one minor suggestion about some of the unrelated whitespace changes.

python-package/lightgbm/sklearn.py

shiyu1994 · 2021-11-03T06:24:16Z

python-package/lightgbm/sklearn.py

+        for alias in _ConfigAliases.get('objective'):
+            if alias in params:
+                self._objective = params.pop(alias)
+                _log_warning(f"Found `{alias}` in params. Will use it instead of argument")


Is this sentence complete? Should it be Will use it instead of argument `objective` ?

In this particular case I agree with you. Warning message was improved in 2d0392e.

However, please note that in this similar warning

LightGBM/python-package/lightgbm/engine.py

Lines 571 to 574 in da98f24

for alias in _ConfigAliases.get("num_iterations"):

if alias in params:

_log_warning(f"Found `{alias}` in params. Will use it instead of argument")

num_boost_round = params.pop(alias)

we cannot add specific argument name which is being overwritten because it depends on whether user uses train() function directly or indirectly from the sklearn-wrapper. Please consider the following example:

import lightgbm as lgb from sklearn.datasets import load_boston X, y = load_boston(return_X_y=True) lgb_train = lgb.Dataset(X, y) lgb.train({'n_iter': 5}, lgb_train) # here n_iter is used instead of num_boost_round argument lgb.LGBMRegressor(n_iter=5).fit(X, y) # here n_iter is used instead of n_estimators argument

shiyu1994 · 2021-11-03T06:38:00Z

tests/python_package_test/test_sklearn.py

-    # invalid objective is replaced with default multiclass one
-    # and invalid binary metric is replaced with multiclass alternative
-    gbm = lgb.LGBMClassifier(objective='invalid_obj',
-                             **params).fit(eval_metric='binary_error', **params_fit)


I agree. We should explicitly tell the user what they intended to use is not valid. Or at least we should give a warning.

StrikerRUS · 2021-11-03T15:51:36Z

Should we block merging of this PR for one week similarly to #4580 (comment)?

jameslamb · 2021-11-03T16:47:11Z

Should we block merging of this PR for one week similarly to #4580 (comment)?

I support not merging this for a few more days, yes.

github-actions · 2023-08-23T14:40:54Z

This pull request has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

respect objective aliases

95113b3

StrikerRUS added the feature label Oct 31, 2021

Update test_sklearn.py

252f36c

StrikerRUS commented Nov 1, 2021

View reviewed changes

StrikerRUS marked this pull request as ready for review November 1, 2021 21:38

StrikerRUS requested review from chivee, henry0312, hzy46, jameslamb, shiyu1994 and tongwu-sh as code owners November 1, 2021 21:38

jameslamb requested changes Nov 3, 2021

View reviewed changes

python-package/lightgbm/sklearn.py Outdated Show resolved Hide resolved

shiyu1994 reviewed Nov 3, 2021

View reviewed changes

StrikerRUS added 2 commits November 3, 2021 16:51

revert removal of blank lines

771cb58

add argument name which is being overwritten in warning message

2d0392e

jameslamb self-requested a review November 3, 2021 14:14

jameslamb approved these changes Nov 3, 2021

View reviewed changes

StrikerRUS mentioned this pull request Nov 3, 2021

[python] improve warning message about aliases in cv() function #4766

Merged

StrikerRUS requested a review from shiyu1994 November 3, 2021 15:51

shiyu1994 approved these changes Nov 5, 2021

View reviewed changes

StrikerRUS merged commit 0a4d190 into master Nov 10, 2021

StrikerRUS deleted the sklearn_objective branch November 10, 2021 13:15

StrikerRUS mentioned this pull request Jan 6, 2022

[DO NOT MERGE] Release 3.3.2 #4930

Closed

13 tasks

jameslamb mentioned this pull request Oct 7, 2022

[DO NOT MERGE] Release v3.3.3 #5525

Closed

40 tasks

github-actions bot locked as resolved and limited conversation to collaborators Aug 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[python][sklearn] respect objective aliases #4758

[python][sklearn] respect objective aliases #4758

StrikerRUS commented Oct 31, 2021

StrikerRUS Nov 1, 2021 •

edited

Loading

shiyu1994 Nov 3, 2021

jameslamb left a comment

shiyu1994 Nov 3, 2021 •

edited by StrikerRUS

Loading

StrikerRUS Nov 3, 2021

shiyu1994 Nov 3, 2021

StrikerRUS commented Nov 3, 2021

jameslamb commented Nov 3, 2021

github-actions bot commented Aug 23, 2023

	for alias in _ConfigAliases.get("num_iterations"):
	if alias in params:
	_log_warning(f"Found `{alias}` in params. Will use it instead of argument")
	num_boost_round = params.pop(alias)

[python][sklearn] respect objective aliases #4758

[python][sklearn] respect objective aliases #4758

Conversation

StrikerRUS commented Oct 31, 2021

StrikerRUS Nov 1, 2021 • edited Loading

Choose a reason for hiding this comment

shiyu1994 Nov 3, 2021

Choose a reason for hiding this comment

jameslamb left a comment

Choose a reason for hiding this comment

shiyu1994 Nov 3, 2021 • edited by StrikerRUS Loading

Choose a reason for hiding this comment

StrikerRUS Nov 3, 2021

Choose a reason for hiding this comment

shiyu1994 Nov 3, 2021

Choose a reason for hiding this comment

StrikerRUS commented Nov 3, 2021

jameslamb commented Nov 3, 2021

github-actions bot commented Aug 23, 2023

StrikerRUS Nov 1, 2021 •

edited

Loading

shiyu1994 Nov 3, 2021 •

edited by StrikerRUS

Loading