[Feature] Support for monotonic constraints? #14

alexvorobiev · 2016-10-18T14:54:53Z

Are you planning support for monotonic constraints? See e.g. here dmlc/xgboost#1514

chivee · 2016-10-18T16:45:47Z

I'm pasting the snippets for the monotonic constraints here

IF (split is a continuous variable and monotonic)

THEN take average of left and right child nodes if current split is used

IF monotonic increasing THEN CHECK left average <= right average

IF monotonic decreasing THEN CHECK left average >= right average

@alexvorobiev , do you have referable papers for this features?

alexvorobiev · 2016-10-18T17:27:01Z

@chivee I only have the reference to the R GBM package https://cran.r-project.org/package=gbm

chivee · 2016-10-19T06:14:09Z

@alexvorobiev , thanks for your sharing. I'm trying to get the idea behind this method.

AbdealiLoKo · 2017-09-22T11:53:47Z

Note that the given pseudo code only ensures the split to be in the correct order and not the whole model as a later split could lead the model to be non monotonic

aldanor · 2018-01-20T18:32:47Z

Any thoughts on this?

mayer79 · 2018-01-21T09:51:26Z

From practical perspective (outside kaggle-world!), this feature would be extremely helpful in many applications where reasonable model behavior is relevant.

aldanor · 2018-01-22T14:20:34Z

@guolinke Would you be able to advise how to approach this and whether it's feasible? I.e., where should it belong, would it be sufficient to implement it just somewhere in feature_histogram.hpp? I guess FeatureMetainfo could just contain the -1/0/1 constraint then.

Here's the meat of the implementation in XGBoost, for reference: https://github.com/dmlc/xgboost/blob/master/src/tree/param.h#L422 -- all of it pretty much contained in CalcSplitGain(), plus CalcWeight(). Where would stuff like this go in LightGBM?

guolinke · 2018-01-22T16:22:04Z

@aldanor
I don't know the details about the monotonic constraints.
What is the idea? And why it is needed?

following may is useful:

The split gain calculation: https://github.com/Microsoft/LightGBM/blob/master/src/treelearner/feature_histogram.hpp#L291-L297

The leaf-output calculation:
https://github.com/Microsoft/LightGBM/blob/master/src/treelearner/feature_histogram.hpp#L305-L308

StrikerRUS · 2018-01-22T18:23:05Z

@guolinke I may add some links here about the implementation in XGBoost:
https://xgboost.readthedocs.io/en/latest//tutorials/monotonic.html
dmlc/xgboost#1514
dmlc/xgboost#1516

aldanor · 2018-01-23T10:00:29Z

@aldanor
I don't know the details about the monotonic constraints.
What is the idea? And why it is needed?

@guolinke Monotonic constraints may be a very important requirement for the resulting models. For many reasons: e.g., as noted above, there could be domain knowledge that must be respected - e.g., in insurance and risk management problems.

How about we all cooperate and make this work?

guolinke · 2018-01-23T10:24:52Z

@aldanor very cool, would like to work together with it.

guolinke · 2018-01-23T10:47:43Z

It seems the MC(Monotonic constraints) could be cumulative, that is, if both model A and B is MC, then A+B is MC.
So we only need to enable MC in decision tree learning.

combine @chivee 's pseudo code and @AbdealiJK 's suggestion.

I think the algorithm is:


min_value = node.min_value
max_value = node.max_value

check(min_value <= split.left_output) 
check(min_value <= split.right_output)
check(max_value >= split.left_otput)
check(max_value >= split.right_output)
mid = (split.left_output + split.right_output) / 2;

if (split.feature is monotonic increasing) {
  check(split.left_output <= split.right_output)
  node.left_child.set_max_value(mid)
  node.right_child.set_min_value(mid)
}
if (split.feature is monotonic decreasing ) {
  check(split.left_output >= split.right_output)
  node.left_child.set_min_value(mid)
  node.right_child.set_max_value(mid)
}

guolinke · 2018-01-23T10:54:51Z

@aldanor would you like to create a PR first ? I can provide my help in the PR.

aldanor · 2018-01-23T12:44:55Z

@guolinke I will give it a try, yep. Your suggested algorithm in the snippet above looks fine, that's kind of what like xgboost does (in exact mode though, not histogram; do you think there would be any complications here because of binning?)

Where would this code belong then, treelearner/feature_histogram.hpp? (I still have to read through most of the code).

Edit: what do you mean by check(...) here? E.g., if (!(...)) { return; }?

guolinke · 2018-01-23T14:21:10Z

@aldanor
The check means return gain with -inf if didn't meet the condition, as a result, that split will not be chosen.
I think there is not different for the MC in binned algorithm.

We need to update the calculation of gain: https://github.com/Microsoft/LightGBM/blob/master/src/treelearner/feature_histogram.hpp#L354-L357 and https://github.com/Microsoft/LightGBM/blob/master/src/treelearner/feature_histogram.hpp#L415-L418 .

We may need to wrap these to a new function, and implement both non-constraint and MC for them.

guolinke · 2018-01-31T07:32:52Z

@aldanor any updates ?

redditur · 2018-03-06T14:55:18Z

@guolinke @chivee

I would also be very interested in seeing this feature implemented in LightGBM. As aldanor stated above the Pseudo-code suggested earlier is correct and is how XGBoost implements monotonic constraints.

As such this feature should be fairly trivial to implement for someone with an intimate knowledge of the codebase.

j-mark-hou · 2018-03-23T02:19:55Z

< removed due to irrelevance>

guolinke · 2018-03-23T02:26:15Z

@j-mark-hou
there is one bug in your code, refer to @AbdealiJK `s comment and my algorithm below.

j-mark-hou · 2018-03-23T03:11:54Z

got it, I'll wait for someone with a better understanding of the codebase to implement this then.

guolinke · 2018-04-16T03:09:16Z

you can try #1314

guolinke changed the title ~~Support for monotonic constraints?~~ [Feature] Support for monotonic constraints? Oct 19, 2016

guolinke added feature request help wanted labels Oct 23, 2016

weidong8405347 mentioned this issue Dec 1, 2017

voting parallel thread not safe #1089

Closed

meetdave06 mentioned this issue Mar 29, 2018

[GPU] Segmentation fault on Ubuntu 14.04 #1290

Closed

guolinke mentioned this issue Apr 15, 2018

Support monotone constraint #1314

Merged

guolinke closed this as completed Apr 30, 2018

singlis mentioned this issue Dec 21, 2018

Monotone_constraints in LightGbm dotnet/machinelearning#1651

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Support for monotonic constraints? #14

[Feature] Support for monotonic constraints? #14

alexvorobiev commented Oct 18, 2016

chivee commented Oct 18, 2016

alexvorobiev commented Oct 18, 2016

chivee commented Oct 19, 2016

AbdealiLoKo commented Sep 22, 2017

aldanor commented Jan 20, 2018

mayer79 commented Jan 21, 2018

aldanor commented Jan 22, 2018

guolinke commented Jan 22, 2018

StrikerRUS commented Jan 22, 2018 •

edited

Loading

aldanor commented Jan 23, 2018 •

edited

Loading

guolinke commented Jan 23, 2018

guolinke commented Jan 23, 2018 •

edited

Loading

guolinke commented Jan 23, 2018

aldanor commented Jan 23, 2018 •

edited

Loading

guolinke commented Jan 23, 2018 •

edited

Loading

guolinke commented Jan 31, 2018

redditur commented Mar 6, 2018

j-mark-hou commented Mar 23, 2018 •

edited

Loading

guolinke commented Mar 23, 2018

j-mark-hou commented Mar 23, 2018 •

edited

Loading

guolinke commented Apr 16, 2018

[Feature] Support for monotonic constraints? #14

[Feature] Support for monotonic constraints? #14

Comments

alexvorobiev commented Oct 18, 2016

chivee commented Oct 18, 2016

alexvorobiev commented Oct 18, 2016

chivee commented Oct 19, 2016

AbdealiLoKo commented Sep 22, 2017

aldanor commented Jan 20, 2018

mayer79 commented Jan 21, 2018

aldanor commented Jan 22, 2018

guolinke commented Jan 22, 2018

StrikerRUS commented Jan 22, 2018 • edited Loading

aldanor commented Jan 23, 2018 • edited Loading

guolinke commented Jan 23, 2018

guolinke commented Jan 23, 2018 • edited Loading

guolinke commented Jan 23, 2018

aldanor commented Jan 23, 2018 • edited Loading

guolinke commented Jan 23, 2018 • edited Loading

guolinke commented Jan 31, 2018

redditur commented Mar 6, 2018

j-mark-hou commented Mar 23, 2018 • edited Loading

guolinke commented Mar 23, 2018

j-mark-hou commented Mar 23, 2018 • edited Loading

guolinke commented Apr 16, 2018

StrikerRUS commented Jan 22, 2018 •

edited

Loading

aldanor commented Jan 23, 2018 •

edited

Loading

guolinke commented Jan 23, 2018 •

edited

Loading

aldanor commented Jan 23, 2018 •

edited

Loading

guolinke commented Jan 23, 2018 •

edited

Loading

j-mark-hou commented Mar 23, 2018 •

edited

Loading

j-mark-hou commented Mar 23, 2018 •

edited

Loading