-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add CTR conversion for categorical features (with bundling support) #2463
Conversation
@shiyu1994 could you resolve the conflict, and provide some benchmark results? |
include/LightGBM/bin.h
Outdated
* \param value | ||
* \return whether has value in categorical to bin | ||
*/ | ||
inline bool HasValueInCat(double value) const; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it seems this is not used anymore.
include/LightGBM/bin.h
Outdated
@@ -148,7 +155,7 @@ class BinMapper { | |||
* \param zero_as_missing True to use zero as missing value | |||
*/ | |||
void FindBin(double* values, int num_values, size_t total_sample_cnt, int max_bin, int min_data_in_bin, int min_split_data, BinType bin_type, | |||
bool use_missing, bool zero_as_missing); | |||
bool use_missing, bool zero_as_missing, bool use_ctr); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
remove use_ctr
@shiyu1994 could you rewrite this when have time? |
Ok. I will rewrite it for the latest version of master branch. |
cb6e9f4
to
f5f27ca
Compare
Replaced by Pull Request #3234 |
This pull request has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this. |
This version keeps the original categorical split decisions in trees. We will change to pure numerical decisions in trees soon by recording the mappings from categorical values to CTR values in model. The modification so far only handles the ordinary train and prediction process of LightGBM, other functions like ForceSplits requires further check.