[Feature]: Meta `OrdinalClassifier` estimator #611

FBruzzesi · 2024-01-22T16:59:27Z

Description

Fix #607 by introducing OrdinalClassifier (meta) estimator.

Checklist

Add docstrings
Add documentation section
Add unit tests

sklego/meta/ordinal_classification.py

* unit tests * docs * change calibration api

tmp/ordinal_classification_demo.ipynb

tests/test_meta/test_ordinal_classification.py

docs/user-guide/meta-models.md

koaning · 2024-01-26T14:58:28Z

docs/user-guide/meta-models.md

+Examples of this kind of problem are: predicting customer satisfaction on a scale from 1 to 5, predicting the severity of a disease, predicting the quality of a product, etc.
+
+The [`OrdinalClassifier`][ordinal-classifier-api] is a meta-model that can be used to transform any classifier into an ordinal classifier by fitting N-1 binary classifiers, each handling a specific class boundary, namely: $P(y <= 1), P(y <= 2), ..., P(y <= N-1)$.
+


Bit of a nit, but might it make sense to add this diagram ...

... somewhere over here? ...

Makes 110% sense! Collapsed admonition should do the trick

koaning · 2024-01-26T15:04:14Z

sklego/meta/ordinal_classification.py

+    def score(self, X, y):
+        """Returns the accuracy score on the given test data and labels.
+
+        Parameters
+        ----------
+        X : array-like of shape (n_samples, n_features )
+            The training data.
+        y : array-like of shape (n_samples,)
+            The target values.
+
+        Returns
+        -------
+        score : float
+            Accuracy score of self.predict(X) wrt. y.
+        """
+        return accuracy_score(y, self.predict(X))


Doesn't the ClassifierMixin provide this already? If we want to do something custom here, perhaps it's better to re-use the score() method of the underlying classifier?

Oh you are right! I am removing it for now 😊
Not sure what something custom and default would be.

I am personally not paying much attention to accuracy in the ordinal problem I am dealing with, but don't want to put anything very opinionated, so let's go with the hierarchical implementation from ClassifierMixin

koaning

Nice work! It feels like this is about 90% done, but I did find some things to address. Relatively minor things though!

koaning · 2024-01-26T15:06:47Z

sklego/meta/ordinal_classification.py

+        self.classes_ = np.sort(np.unique(y))
+        self.n_features_in_ = X.shape[1]
+
+        if self.n_classes_ < 2:


I wonder ... don't we need at least three classes for ordinal regression to make sense? Any binary classification with two classes is automatically ordinal, no?

Also very true! I cheated for the scikit-learn tests 😂
Let me figure out how to deal with those

FBruzzesi · 2024-01-26T15:45:03Z

tests/test_meta/test_ordinal_classification.py

+            "check_dict_unchanged",
+            "check_fit2d_1feature",
+            "check_classifier_data_not_an_array",
+            "check_classifiers_classes",
+            "check_classifiers_train",


This is a fair amount of skipping, yet:

coverage is still same

the internal estimator will most likely deal directly with these cases (e.g. data not array)

Yeah, seems fair.

FBruzzesi added 3 commits January 20, 2024 12:10

implementation of main methods

8a4f011

docstrings

4175741

demo notebook

63652cd

FBruzzesi commented Jan 22, 2024

View reviewed changes

sklego/meta/ordinal_classification.py Show resolved Hide resolved

FBruzzesi commented Jan 22, 2024

View reviewed changes

sklego/meta/ordinal_classification.py Outdated Show resolved Hide resolved

FBruzzesi mentioned this pull request Jan 23, 2024

[FEATURE] Meta Ordinal Classification #607

Closed

tests,docs,api change

95cfd85

* unit tests * docs * change calibration api