[ENH] Enable multitarget problem types for OWTestAndScore and OWPredictions #5848

JakaKokosar · 2022-02-17T14:54:15Z

Issue

In survival analysis, it is expected to have two target variables. First is the duration of time until the event of interest, and the second is the indicator of censorship.

Preferably, the Survival Analysis add-on would then use the same infrastructure for testing and scoring survival models as it is currently in place for classification and regression related problems. To achieve this, we need to loosen the constraint of a single target variable for the input data.

Description of changes

With this pull request, the task was not to change the interface to accommodate for all future tasks one would like to support but to:

allow the use of multitarget data and make no breaking changes to the existing codebase (do not change the default behaviour),
for a start, make survival models/scorers available in Orange.

Since the registration of Scorers is already implemented, the next step was how to find the usable scorers given the input data. The Scorer base class now holds additional [information](https://github.com/biolab/orange3/compare/master...JakaKokosar:multi_target?expand=1#diff-ebac791194327a764153704a5e2567261585ad3971623c2138be81e8c02b8da5R69) to recognise Scorers that are built-in and those implemented through add-ons.
If Scorer is defined as built-in, nothing changes. For non-built-in scorers, we look into Table attributes to determine the 'problem type' of input data. For example, for the survival analysis, As Survival data widget will set class variables to the output table and set attributes of the table as follows:

{ ..., 'problem_type': 'time_to_event' }

Usable scorers are those that match the same problem_type with input data. This is not necessarily the best solution and could use further debate. At this stage, no significant changes to the code-base were needed. In theory, everything else should be handled by Learners, Models and Scorers defined for related tasks biolab/orange3-survival-analysis#27.

Some examples:

Includes

Code changes

Tests

Documentation

codecov · 2022-02-17T15:11:15Z

Codecov Report

Merging #5848 (f820dea) into master (9af442d) will decrease coverage by 0.01%.
The diff coverage is 92.07%.

@@            Coverage Diff             @@
##           master    #5848      +/-   ##
==========================================
- Coverage   86.29%   86.28%   -0.02%     
==========================================
  Files         315      315              
  Lines       66830    66884      +54     
==========================================
+ Hits        57674    57712      +38     
- Misses       9156     9172      +16

janezd

I read the code to see the idea. My comments refer to what I spotted, and do not mean I like the idea. (I don't. :)

The problem is not the idea itself. I dislike is that it is rather a patch over bad overall design. We should consider some deeper changes, though I fear they lead towards discussing finally moving to pandas.

I think that in light of survival analysis and similar problems, we should consider multiple roles of variables. Currently, a variable can be an independent variable, a dependent variable or a meta, and they are stored in different matrices. We had another type (a weight), which was supposed to be unique (e.g. you cannot choose between multiple weight variables on the same data). You propose to add another role...

Going pandas would mean abandoning X, Y and metas as permanently materialized, and instead having column-based representation. At the same time, every column could be assigned a role. class_var would then be a property that would return the variable that is assigned a "target role".

We need to decide whether to continue patching or bite into pandas.

Orange/classification/base_classification.py

Orange/evaluation/scoring.py

Orange/widgets/evaluate/owpredictions.py

Orange/widgets/evaluate/owtestandscore.py

Orange/widgets/evaluate/utils.py

Orange/regression/base_regression.py

Orange/base.py

Orange/evaluation/clustering.py

Orange/widgets/utils/owlearnerwidget.py

Orange/base.py

Orange/widgets/utils/owlearnerwidget.py

biolab/orange3#5848

janezd assigned janezd and markotoplak Feb 18, 2022

janezd reviewed Feb 25, 2022

View reviewed changes

janezd removed their assignment Feb 25, 2022

JakaKokosar force-pushed the multi_target branch from fdcee4b to f76ef62 Compare March 8, 2022 11:47

JakaKokosar commented Mar 8, 2022

View reviewed changes

Orange/regression/base_regression.py Outdated Show resolved Hide resolved

JakaKokosar force-pushed the multi_target branch 3 times, most recently from d9b8d98 to 63b7129 Compare March 9, 2022 07:36

JakaKokosar changed the title ~~[RFC] Remove single target variable constraint~~ [ENH] Remove single target variable constraint Mar 10, 2022

JakaKokosar force-pushed the multi_target branch from 50eb9a0 to 5d4e3aa Compare March 11, 2022 14:38

JakaKokosar changed the title ~~[ENH] Remove single target variable constraint~~ [ENH] Enable multitarget problem types for OWTestAndScore and OWPredictions Mar 11, 2022

JakaKokosar force-pushed the multi_target branch 2 times, most recently from ba37414 to adc46c7 Compare March 14, 2022 08:44

markotoplak reviewed Mar 14, 2022

View reviewed changes

Orange/base.py Show resolved Hide resolved

markotoplak reviewed Mar 14, 2022

View reviewed changes

Orange/base.py Show resolved Hide resolved

markotoplak reviewed Mar 14, 2022

View reviewed changes

Orange/evaluation/clustering.py Outdated Show resolved Hide resolved

JakaKokosar force-pushed the multi_target branch 4 times, most recently from 83163ea to 04cf2e3 Compare March 15, 2022 12:20

markotoplak reviewed Mar 15, 2022

View reviewed changes

Orange/widgets/utils/owlearnerwidget.py Outdated Show resolved Hide resolved

JakaKokosar force-pushed the multi_target branch 7 times, most recently from 8b327bd to cc5b1be Compare March 16, 2022 16:21

JakaKokosar force-pushed the multi_target branch from cc5b1be to e80e45f Compare March 16, 2022 16:27

JakaKokosar added 2 commits March 16, 2022 17:29

Enable multi target data for testing and predictions

004324e

scoring: find usable scorers for non-built-in problem types

19c1be7

JakaKokosar force-pushed the multi_target branch 7 times, most recently from b29e861 to b9ee08b Compare March 18, 2022 14:25

markotoplak reviewed Mar 18, 2022

View reviewed changes

Orange/base.py Outdated Show resolved Hide resolved

markotoplak reviewed Mar 18, 2022

View reviewed changes

Orange/widgets/utils/owlearnerwidget.py Outdated Show resolved Hide resolved

learner adequacy check refactor

e1e7419

JakaKokosar force-pushed the multi_target branch from b9ee08b to 04937ae Compare March 18, 2022 14:53

add multi_target_input tests

f820dea

JakaKokosar force-pushed the multi_target branch from 04937ae to f820dea Compare March 21, 2022 09:01

markotoplak merged commit 28326e0 into biolab:master Mar 21, 2022

JakaKokosar added a commit to biolab/orange3-survival-analysis that referenced this pull request Mar 21, 2022

use new interface for testing and scoring learners

332dc79

biolab/orange3#5848

JakaKokosar mentioned this pull request Mar 21, 2022

[ENH] use new interface for testing and scoring learners biolab/orange3-survival-analysis#42

Merged

3 tasks

markotoplak mentioned this pull request Mar 22, 2022

[FIX] Orange.data.Table: deprecated is_view and is_copy #5913

Merged

2 tasks

VesnaT mentioned this pull request Apr 7, 2022

Permutation feature importance: Orange core compatibility biolab/orange3-explain#42

Merged

JakaKokosar deleted the multi_target branch September 15, 2022 09:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] Enable multitarget problem types for OWTestAndScore and OWPredictions #5848

[ENH] Enable multitarget problem types for OWTestAndScore and OWPredictions #5848

JakaKokosar commented Feb 17, 2022 •

edited

Loading

codecov bot commented Feb 17, 2022 •

edited

Loading

janezd left a comment

[ENH] Enable multitarget problem types for OWTestAndScore and OWPredictions #5848

[ENH] Enable multitarget problem types for OWTestAndScore and OWPredictions #5848

Conversation

JakaKokosar commented Feb 17, 2022 • edited Loading

Issue

Description of changes

Includes

codecov bot commented Feb 17, 2022 • edited Loading

Codecov Report

janezd left a comment

Choose a reason for hiding this comment

JakaKokosar commented Feb 17, 2022 •

edited

Loading

codecov bot commented Feb 17, 2022 •

edited

Loading