-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Different feature importances on different feature order even with deterministic params #6069
Comments
Thanks for using LightGBM. Before we investigate this... please see very-similar discussions in this project's issue tracker:
Once you've read those, if you're certain none of the advice there applies to your situation, let us know and someone will take a closer look. |
I've seen these issues before writing – sadly, none of them relate to our case. We care here about feature importance stability and not scores, because we're using feature importance in our feature selection algorithm. The example data is not duplicated, it has 11k rows, and we don't use parameters that bring randomness. |
@jameslamb Is there any news on this? Seems like this is a bug, at least from what I expect from the documentation. |
Please don't leave "any updates on this?" types of comments in this project. If you're interested in investigating this and trying to find and fix the root cause, or if you have new information to add, we'd be grateful for the help. Otherwise, being subscribed to the issue is sufficient guarantee that you'll be notified if something around it changes. |
Description
We've run into an issue where identical input data produces different feature importance if the column order is different. This happens even with
feature_fraction: 1.0, 'deterministic': True, 'force_row_wise': True
so it doesn't seem like an issue of subsampling.Reproducible example
Here is example data on which we were able to reproduce it: https://github.com/upgini/upgini/blob/add-lgbm-example/notebooks/lgbm_example_data.csv.zip
This param set produces less variations, but the results are still different:
Environment info
LightGBM version or commit hash:
Tested both on 3.3.5 and 4.0.0
Command(s) you used to install LightGBM
x86 build
Additional Comments
The text was updated successfully, but these errors were encountered: