Added support for passthrough connection to stacking estimator converter #930

OKUA1 · 2022-10-30T08:59:12Z

As described in sklearn documentation, both StackingClassifier and StackingRegressor accept a passthrough argument, which (when set to True) allows to pass the original input data to the final estimator creating something similar to a skip connection.

This functionality is not realized in current converter which makes a converted model invalid in cases when passthrough was set to True during training.

Signed-off-by: Oleg Kostromin <kostromin97@gmail.com>

xadupre · 2022-11-02T10:56:33Z

Is it possible to add a unit test for it?

OKUA1 · 2022-11-02T16:40:04Z

@xadupre Yes, working on it right now. Quick question: when testing the regressor, some of the predictions are above the threshold causing the test to fail (the unit test for classification passes without issues). Could it be caused simply by float64->float32 conversion or should I look for another cause? And if so, do you have some advice how to properly debug it?

The test output:

--expected--output--

Arrays are not almost equal to 5 decimals

Mismatched elements: 11 / 125 (8.8%)
Max absolute difference: 2.32062132e-05
Max relative difference: 4.41075741e-06
onnx==1.12.0 onnxruntime==1.12.1 TARGET_OPSET=17
onnx==1.12.0 onnxruntime==1.12.1 TARGET_OPSET=17
onnx==1.12.0 onnxruntime==1.12.1 TARGET_OPSET=17

----------------------------------------------------------------------

Signed-off-by: Oleg Kostromin <kostromin97@gmail.com>

OKUA1 · 2022-11-03T22:26:20Z

I added several unit tests which match current ones, but with passthrough = True.

As mentioned earlier, the regression was/is a bit problematic one with deviation in the predictions of onnx/sklearn being slightly above the desired threshold for a small subset of tested samples. After further investigation I did not get any ideas what the cause could be except the originally assumed discrepancy due to the usage of float32 in onnx. For now, I used factor=0.1 in the respective unit test which prevents it from failing but probably not the best thing to do.

xadupre · 2022-11-10T09:36:16Z

tests/test_sklearn_stacking.py

+    @ignore_warnings(category=FutureWarning)
+    def test_concat_stacking_passthrough(self):
+
+        class CustomTransformer:


Just curious, why not inheriting from Estimator, TransformerMixin instead of implementing a new parser?

OKUA1 force-pushed the stacking_est_passthrough_conn branch from 17036f6 to cc95d9c Compare October 30, 2022 09:05

OKUA1 added 2 commits October 30, 2022 10:34

added support for passthrough conn to stacking est

09c731c

Signed-off-by: Oleg Kostromin <kostromin97@gmail.com>

fixed flake8 related issues

a2b25c1

Signed-off-by: Oleg Kostromin <kostromin97@gmail.com>

OKUA1 force-pushed the stacking_est_passthrough_conn branch from 3cc27d4 to a2b25c1 Compare October 30, 2022 09:34

Added unit test for stacking with passthrough

0484ba6

Signed-off-by: Oleg Kostromin <kostromin97@gmail.com>

xadupre reviewed Nov 10, 2022

View reviewed changes

xadupre merged commit 568375d into onnx:main Nov 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added support for passthrough connection to stacking estimator converter #930

Added support for passthrough connection to stacking estimator converter #930

OKUA1 commented Oct 30, 2022

xadupre commented Nov 2, 2022

OKUA1 commented Nov 2, 2022

OKUA1 commented Nov 3, 2022

xadupre Nov 10, 2022

Added support for passthrough connection to stacking estimator converter #930

Added support for passthrough connection to stacking estimator converter #930

Conversation

OKUA1 commented Oct 30, 2022

xadupre commented Nov 2, 2022

OKUA1 commented Nov 2, 2022

OKUA1 commented Nov 3, 2022

xadupre Nov 10, 2022

Choose a reason for hiding this comment