Adding a Link object #14

theogf · 2020-09-06T12:09:37Z

We need to support different type of mapping when a constraint is applied on the GP, e.g. for Poisson the given standard was exp(f) but there are more existing options : f^2, or s * logistic(f)

The convention is to call the connection between the parameter(s) m needed for the likelihood and the input(s) f the inverse link. Cause traditionally the link would be the function to go from m to f. See for example : http://web.pdx.edu/~newsomj/mvclass/ho_link.pdf

This PR aims at introducing this with AbstractLink type.
Some likelihoods need an inverse link, for example BernoulliLikelihood or PoissonLikelihood. So it makes sense to have these likelihoods contain the inverse link invlink and parametrize it. This allows furthermore to differentiate different overloads later and for people to use their own transformations. For example my augmentation stuff only works for the LogitLink but not the ProbitLink.

For a small collection of AbstractLinks I defined just apply(link, x) (which is called by (link::LinkType)(x). and also Base.inv which returns the existing inverse link if it exists (similarly to Bijectors).

I already did a basic implementation but I realised a lot of it is overlapping with Bijectors.jl and we could consider using them. However this is also a heavier dependency ? I think NNlib is pretty heavy right?

devmotion · 2020-09-06T12:48:04Z

An alternative would be to not use different link functions but define additional likelihood functions (i.e., one for Poisson with the exp link, one with x^2 etc.).

theogf · 2020-09-06T12:56:32Z

I thought that it would be cooler to be able to input really any kind of link! One could even think about passing a constrained NN (paper idea here 😆 )

sharanry

Is it possible to keep links separated from the likelihoods? That is, not have it part of every likelihood? We could think of this as an optional step between the GP outputs and likelihood function.

We could also define LinkedLikelihood similar to transforms in KernelFunctions.jl such that it is just another likelihood.

struct LinkedLikelihood{T}
link::Link
lik::T
end

(ll::LinkedLikelihood)(f::Real) = ll.lik(ll.link(f))
(ll::LinkedLikelihood)(fs::AbstractVector{<:Real}) = ll.lik(ll.link.(fs))

or something like

struct LinkedLikelihood{T1,T2}
link::T1
lik::T2
end

(ll::LinkedLikelihood)(f::Real) = ll.lik(ll.link(f))
(ll::LinkedLikelihood)(fs::AbstractVector{<:Real}) = ll.lik(ll.link.(fs))

I am supportive of the second approach as both links and likelihoods are essentially functions. And the second approach provides the maximum amount of flexibility.

src/likelihoods/gaussian.jl

src/likelihoods/link.jl

theogf · 2020-09-07T10:41:29Z

Is it possible to keep links separated from the likelihoods? That is, not have it part of every likelihood? We could think of this as an optional step between the GP outputs and likelihood function.

We could also define LinkedLikelihood similar to transforms in KernelFunctions.jl such that it is just another likelihood.

I don't think it is possible to separate links and likelihoods. Unlike in KernelFunctions, for some Likelihoods links/transformations are necessary. I like the idea of LinkedLikelihood to have link for Likelihood which don't need a Link but for which it could be interesting to have one.

sharanry · 2020-09-07T10:46:11Z

Unlike in KernelFunctions, for some Likelihoods links/transformations are necessary.

Okay that makes sense. 🙂 But we should always have a sensible default link in such cases.

theogf · 2020-09-07T10:47:35Z

Actually one change that could be need is the naming. Should it be InvLink or Link? For example in GPFlow they use invlink : https://github.com/GPflow/GPflow/blob/5a945d67b37120610880c3323224a4e86404ae1d/gpflow/likelihoods/scalar_discrete.py#L16
Additionally here: https://en.wikipedia.org/wiki/Generalized_linear_model#Link_function, the link is defined as logit and not logistic. Never really understood why btw...

theogf · 2021-04-14T16:45:15Z

As an alternative (I feel I am rewriting existing code) we could use Bijectors.jl To create the links?
I think for the most important ones, there exist a bijector and its inverse for it.

devmotion · 2021-04-14T17:41:05Z

IMO we shouldn't use Bijectors.jl, mainly because it's a very heavy dependency and, more importantly, since my impression is that its design is very much focused on Turing and AdvancedHMC. In particular, all inputs and outputs have to be arrays and have to be of the same dimension and size. IMO this is quite limiting in more general applications where you want to map a submanifold of lower dimension (e.g. matrices with a special structure or the simplex - there are some open issues but I don't think the design will be changed in the near future since it would be very breaking for Turing).

However, I also think it would be good to use existing functionality in other packages, if possible. Maybe TransformVariables could be helpful? IIRC it supports more flexible mappings and input and output types and would be a lighter dependency.

theogf · 2021-04-14T19:43:24Z

I had a look at TransformVariables and it looks great indeed, only I think we would need to use the low level API the as API is too limiting. Funnily enough they have a transform function 😂. Sounds like the change for KernelFunctions.jl comes right in time!

src/GPLikelihoods.jl

st-- · 2021-04-19T11:32:52Z

Actually one change that could be need is the naming. Should it be InvLink or Link? For example in GPFlow they use invlink : https://github.com/GPflow/GPflow/blob/5a945d67b37120610880c3323224a4e86404ae1d/gpflow/likelihoods/scalar_discrete.py#L16
Additionally here: https://en.wikipedia.org/wiki/Generalized_linear_model#Link_function, the link is defined as logit and not logistic. Never really understood why btw...

Depends on which way around you define the bijector. In the statistics community, the link function is well defined to be the object that satisfies link(y) = f, where y is your observations/likelihood parameters, and f is your latent (generally assumed linear) model. So y = invlink(f). In implementing likelihoods for our GP models we generally care about the f -> y direction only, so we define the invlink explicitly.

theogf · 2021-04-19T14:06:08Z

@st-- Thanks for the explanation. I was indeed not familiar with the statistics nomenclature. The names have been changed accordingly already. One can switch from one representation to another (when possible) with inv

theogf · 2021-07-07T13:24:36Z

Can I get a fresh review on this @devmotion @st-- @willtebbutt ? Happy to get it back from the deads

willtebbutt

Just done a quick pass.

src/GPLikelihoods.jl

src/likelihoods/bernoulli.jl

src/likelihoods/categorical.jl

codecov-commenter · 2021-07-21T14:40:50Z

Codecov Report

Merging #14 (e308d72) into master (c46ad70) will increase coverage by 12.97%.
The diff coverage is 85.00%.

@@             Coverage Diff             @@
##           master      #14       +/-   ##
===========================================
+ Coverage   69.56%   82.53%   +12.97%     
===========================================
  Files           6        7        +1     
  Lines          23       63       +40     
===========================================
+ Hits           16       52       +36     
- Misses          7       11        +4

Impacted Files	Coverage Δ
src/likelihoods/gamma.jl	`60.00% <66.66%> (+10.00%)`	⬆️
src/likelihoods/bernoulli.jl	`75.00% <75.00%> (+8.33%)`	⬆️
src/likelihoods/exponential.jl	`75.00% <75.00%> (+8.33%)`	⬆️
src/likelihoods/poisson.jl	`75.00% <75.00%> (+8.33%)`	⬆️
src/likelihoods/gaussian.jl	`77.77% <87.50%> (+6.34%)`	⬆️
src/links.jl	`87.87% <87.87%> (ø)`
src/likelihoods/categorical.jl	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c46ad70...e308d72. Read the comment docs.

theogf · 2021-07-21T14:56:20Z

@st-- @willtebbutt
Ready for another pass

willtebbutt

Broadly looks really nice -- a number of style things / comments on docstrings, and a couple of small design things.

src/likelihoods/categorical.jl

src/likelihoods/exponential.jl

src/likelihoods/gamma.jl

src/likelihoods/gaussian.jl

src/links.jl

test/links.jl

willtebbutt

I'm happy with this now.

theogf added 5 commits September 6, 2020 13:55

Adding the AbstractLink / Link objects and use it in PoissonLikelihood

a2447bb

Add tests

763bd9d

Use vector for the GaussianLikelihood for inplace changes

50ddf5e

Fixing various bugs

3952a43

Add functor to PoissonLikelihood and export Link and LogisticLink

6b6dcf2

theogf mentioned this pull request Sep 7, 2020

Add Bernoulli likelihood #15

Merged

sharanry reviewed Sep 7, 2020

View reviewed changes

src/likelihoods/gaussian.jl Show resolved Hide resolved

src/likelihoods/link.jl Outdated Show resolved Hide resolved

willtebbutt mentioned this pull request Sep 22, 2020

TestUtils #18

Closed

theogf added 3 commits February 15, 2021 10:49

Merge branch 'master' into create_link

b23ff5f

Moved file and added more definitions

63d10a3

Use links directly in the likelihoods

22d018f

devmotion reviewed Apr 14, 2021

View reviewed changes

src/GPLikelihoods.jl Outdated Show resolved Hide resolved

More beautiful stuff added

e35d509

willtebbutt reviewed Jul 7, 2021

View reviewed changes

src/GPLikelihoods.jl Outdated Show resolved Hide resolved

src/GPLikelihoods.jl Show resolved Hide resolved

src/likelihoods/bernoulli.jl Show resolved Hide resolved

src/likelihoods/categorical.jl Outdated Show resolved Hide resolved

theogf added 5 commits July 21, 2021 14:44

Ignore vscode

df4403b

Fixing default links for likelihoods

dafd38e

Merge branch 'master' into create_link

d686693

Use LogExpFunctions instead of StatsFuns

b0072e2

Readds StatsFuns

954fd50

Added links for exponential and Gamma

8db3e70

theogf added 2 commits July 21, 2021 16:46

Add more tests

5054cee

Fixed errors in the links

e308d72

willtebbutt reviewed Jul 21, 2021

View reviewed changes

theogf added 5 commits July 21, 2021 18:43

Apply formatting (on src)

2f8f7e0

Fix docstrings

93455a4

Correct more docstrings

55a5021

Add inv(inv(x)) == x tests

17fdb3b

remove apply and correct simplex definition

d6723c2

willtebbutt approved these changes Jul 21, 2021

View reviewed changes

theogf merged commit 56531dd into master Jul 21, 2021

theogf deleted the create_link branch October 14, 2021 08:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding a Link object #14

Adding a Link object #14

theogf commented Sep 6, 2020 •

edited

Loading

devmotion commented Sep 6, 2020

theogf commented Sep 6, 2020

sharanry left a comment •

edited

Loading

theogf commented Sep 7, 2020

sharanry commented Sep 7, 2020

theogf commented Sep 7, 2020

theogf commented Apr 14, 2021

devmotion commented Apr 14, 2021

theogf commented Apr 14, 2021

st-- commented Apr 19, 2021

theogf commented Apr 19, 2021

theogf commented Jul 7, 2021

willtebbutt left a comment

codecov-commenter commented Jul 21, 2021 •

edited

Loading

theogf commented Jul 21, 2021

willtebbutt left a comment

willtebbutt left a comment

Adding a Link object #14

Adding a Link object #14

Conversation

theogf commented Sep 6, 2020 • edited Loading

devmotion commented Sep 6, 2020

theogf commented Sep 6, 2020

sharanry left a comment • edited Loading

Choose a reason for hiding this comment

theogf commented Sep 7, 2020

sharanry commented Sep 7, 2020

theogf commented Sep 7, 2020

theogf commented Apr 14, 2021

devmotion commented Apr 14, 2021

theogf commented Apr 14, 2021

st-- commented Apr 19, 2021

theogf commented Apr 19, 2021

theogf commented Jul 7, 2021

willtebbutt left a comment

Choose a reason for hiding this comment

codecov-commenter commented Jul 21, 2021 • edited Loading

Codecov Report

theogf commented Jul 21, 2021

willtebbutt left a comment

Choose a reason for hiding this comment

willtebbutt left a comment

Choose a reason for hiding this comment

theogf commented Sep 6, 2020 •

edited

Loading

sharanry left a comment •

edited

Loading

codecov-commenter commented Jul 21, 2021 •

edited

Loading