Add wrapper to transform targets of (semi)supervised models #678

ablaom · 2021-11-17T05:06:59Z

This PR adds a model wrapper TransformedTargetModel implementing this suggestion of @CameronBieganek. Together with the Pipelinemodel already implemented (on the target branch) this wrapper renders the @pipeline macro redundant and whence resolves this Pluto notebook issue.

The doc-string for the new model appears below.

This PR is not breaking but I am basing it on the 0.19 release branch as it was convenient to use some utilities introduced there for the new pipelines.

To do:

add deprecation warnings for @pipeline
review test coverage

TransformedTargetModel(model; target=nothing, inverse=nothing, cache=true)

Wrap the supervised or semi-supervised model in a transformation of
the target variable.

Here target is either:

The Unsupervised model that is to transform the training target.
By default (inverse=nothing) the parameters learned by this
transformer are also used to inverse-transform the predictions of
model, which means target must implement the inverse_transform
method. If this is not the case, specify inverse=identity to
suppress inversion.

or

A callable object for transforming the target, such as y -> log.(y). In this case
a callable inverse, such as z -> exp.(z), should be specified.

Specify cache=false to prioritize memory over speed, or to guarantee data
anonymity.

Specify inverse=identity if model is a probabilistic predictor, as
inverse-transforming sample spaces is not supported. Alternatively,
replace model with a deterministic model, such as Pipeline(model, y -> mode.(y)).

Examples

A model that normalizes the target before applying ridge regression,
with predictions returned on the original scale:

@load RidgeRegressor pkg=MLJLinearModels
model = RidgeRegressor()
tmodel = TransformedTargetModel(model, target=Standardizer())

A model that instead applies a static log transformation to the data, again
returning predictions to the original scale:

tmodel2 = TransformedTargetModel(model, target=y->log.(y), inverse=z->exp.(y))

ablaom · 2021-11-17T05:09:26Z

@CameronBieganek Would be great if you could comment on the doc-string. If you also have time for a more detailed review over the next two weeks, let me know.

codecov-commenter · 2021-11-17T05:19:43Z

Codecov Report

Merging #678 (e49a339) into for-0-point-19-release (0f79fe9) will decrease coverage by 2.35%.
The diff coverage is 84.90%.

@@                    Coverage Diff                     @@
##           for-0-point-19-release     #678      +/-   ##
==========================================================
- Coverage                   85.84%   83.48%   -2.36%     
==========================================================
  Files                          40       41       +1     
  Lines                        3610     3173     -437     
==========================================================
- Hits                         3099     2649     -450     
- Misses                        511      524      +13

Impacted Files	Coverage Δ
src/MLJBase.jl	`100.00% <ø> (ø)`
src/composition/models/transformed_target_model.jl	`84.90% <84.90%> (ø)`
src/sources.jl	`70.00% <0.00%> (-18.00%)`	⬇️
src/data/datasets.jl	`86.84% <0.00%> (-13.16%)`	⬇️
src/measures/continuous.jl	`87.80% <0.00%> (-8.03%)`	⬇️
src/show.jl	`29.92% <0.00%> (-6.72%)`	⬇️
src/measures/measures.jl	`68.29% <0.00%> (-5.40%)`	⬇️
src/measures/probabilistic.jl	`58.46% <0.00%> (-4.70%)`	⬇️
src/composition/models/inspection.jl	`95.83% <0.00%> (-4.17%)`	⬇️
src/measures/finite.jl	`93.99% <0.00%> (-4.14%)`	⬇️
... and 27 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0f79fe9...e49a339. Read the comment docs.

more test coverage

ablaom added 2 commits November 17, 2021 17:13

add target transformer

aee6133

rename TargetTransformedModel -> TransformedTargetModel

7dcd9b8

ablaom changed the title ~~Add wrapper to transform targets of (semi)supervised model~~ Add wrapper to transform targets of (semi)supervised models Nov 17, 2021

ablaom mentioned this pull request May 16, 2022

Wrapper to convert arbitrary clusterer into a classifying one #768

Open

ablaom added 3 commits November 24, 2021 09:07

boost test coverage

2825c1d

more test coverage

forgotten end in test

e49a339

doc-string tweaks

c1f5a6e

ablaom merged commit e11052a into for-0-point-19-release Nov 23, 2021

ablaom mentioned this pull request Nov 23, 2021

For a 0.19 release #665

Closed

18 tasks

ablaom mentioned this pull request Dec 14, 2021

Add a supervised model wrapper to implement target transformations #642

Closed

This was referenced Dec 23, 2021

Issue to trigger releases #345

Closed

For 0.17 release JuliaAI/MLJ.jl#864

Closed

Issue to trigger new releases JuliaAI/MLJ.jl#571

Closed

JuliaRegistrator mentioned this pull request Dec 29, 2021

New version: MLJ v0.17.0 JuliaRegistries/General#51365

Merged

DilumAluthge deleted the target-transformer branch January 23, 2022 20:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add wrapper to transform targets of (semi)supervised models #678

Add wrapper to transform targets of (semi)supervised models #678

ablaom commented Nov 17, 2021 •

edited

Loading

ablaom commented Nov 17, 2021

codecov-commenter commented Nov 17, 2021 •

edited

Loading

Add wrapper to transform targets of (semi)supervised models #678

Add wrapper to transform targets of (semi)supervised models #678

Conversation

ablaom commented Nov 17, 2021 • edited Loading

Examples

ablaom commented Nov 17, 2021

codecov-commenter commented Nov 17, 2021 • edited Loading

Codecov Report

ablaom commented Nov 17, 2021 •

edited

Loading

codecov-commenter commented Nov 17, 2021 •

edited

Loading