add combine_terms option to exact MLL #1863

samuelstanton · 2021-12-16T18:06:46Z

I've found that logging the inv_quad terms and logdet terms separately (rather than just the train loss) to be very helpful for debugging. Right now classes like VariationalELBO have a combine_terms option that allow the user to sum the terms after the MLL call. This is a nice feature, since otherwise you essentially have to pay for an extra training step just to log the terms separately.

In this PR I've demonstrated how we could go about adding this option to the subclasses of MarginalLogLikelihood, starting with the Gaussian likelihood case. There are a few unit tests that aren't passing yet, but I wanted to check and see if this feature would be approved before fixing it up.

samuelstanton · 2021-12-18T17:34:45Z

@jacobrgardner @gpleiss any thoughts?

gpleiss · 2021-12-21T01:14:25Z

Yeah, this would be awesome to add!

samuelstanton · 2021-12-23T18:08:54Z

@gpleiss how does everything look?

wjmaddox · 2022-02-03T16:57:13Z

gpytorch/distributions/multitask_multivariate_normal.py

@@ -203,12 +203,12 @@ def get_base_samples(self, sample_shape=torch.Size()):
            return base_samples.view(new_shape).transpose(-1, -2).contiguous()
        return base_samples.view(*sample_shape, *self._output_shape)

-    def log_prob(self, value):
+    def log_prob(self, value, combine_terms=True):


In general, I don't think we want to be adding flags to the standard log_prob call here to maintain compatibility with the MVN api in pytorch. let's have this be a _log_prob method with the log_prob just calling _log_prob(value=value, combine_terms=True) ?

wjmaddox · 2022-02-03T16:57:44Z

gpytorch/distributions/multivariate_normal.py

@@ -142,7 +146,7 @@ def lazy_covariance_matrix(self):
        else:
            return lazify(super().covariance_matrix)

-    def log_prob(self, value):
+    def log_prob(self, value, combine_terms=True):


Same here, change to _log_prob?

wjmaddox · 2022-02-03T16:59:07Z

Looks like the failing unit test was flaky.

samuelstanton · 2022-02-03T20:23:53Z

gpytorch/mlls/exact_marginal_log_likelihood.py

@@ -59,9 +62,17 @@ def forward(self, function_dist, target, *params):

        # Get the log prob of the marginal distribution
        output = self.likelihood(function_dist, *params)
-        res = output.log_prob(target)
-        res = self._add_other_terms(res, params)
+        res = output.log_prob(target, combine_terms=self.combine_terms)


Stylistically, the proposed change from log_prob to _log_prob is problematic here, because you would essentially be calling a "private" method publicly. More generally I think the combine_terms option is broadly useful and burying it inside the class makes it harder to use.

Personally I don't see why the GPyTorch log_prob API can't allow optional keyword arguments like combine_terms, as long as the default behavior is consistent.

@gpleiss @jacobrgardner care to weigh in?

a compromise would be to just call it log_prob_terms instead of _log_prob

Balandat · 2022-02-05T19:03:22Z

gpytorch/distributions/multivariate_normal.py

+        split_terms = [inv_quad, logdet, norm_const]
+        split_terms = [-0.5 * term for term in split_terms]


Suggested change

split_terms = [inv_quad, logdet, norm_const]

split_terms = [-0.5 * term for term in split_terms]

split_terms = [-0.5 * inv_quad, logdet, -0.5 * norm_const]

Balandat · 2022-02-05T19:07:07Z

gpytorch/mlls/exact_marginal_log_likelihood.py

@@ -17,6 +19,7 @@ class ExactMarginalLogLikelihood(MarginalLogLikelihood):

    :param ~gpytorch.likelihoods.GaussianLikelihood likelihood: The Gaussian likelihood for the model
    :param ~gpytorch.models.ExactGP model: The exact GP model
+    :param ~bool combine_terms (optional): If `False`, the MLL call returns each MLL term separately


Should probably also describe what happens if there are "other terms" (i.e. that they are added to the return elements)

Balandat · 2022-02-05T19:11:14Z

test/distributions/test_multivariate_normal.py

            actual = TMultivariateNormal(mean, torch.eye(4, device=device, dtype=dtype) * var).log_prob(values)
            self.assertLess((res - actual).div(res).abs().item(), 1e-2)

+            res2 = mvn.log_prob_terms(values)
+            assert len(res2) == 3


Suggested change

assert len(res2) == 3

self.assertEqual(len(res2), 3)

also in other places in the tests below

samuelstanton added 2 commits December 23, 2021 12:54

add combine_terms option to exact MLL

d8bd497

fix failing unittests and increase coverage

328ebd0

samuelstanton force-pushed the exact_mll_combine_terms branch from ba187db to 328ebd0 Compare December 23, 2021 17:54

run precommit hooks

b85418e

samuelstanton and others added 2 commits December 23, 2021 13:22

always return data fit term first if combine terms is False

d6ca1bf

Merge branch 'master' into exact_mll_combine_terms

8069b7e

wjmaddox reviewed Feb 3, 2022

View reviewed changes

samuelstanton commented Feb 3, 2022

View reviewed changes

samuelstanton added 3 commits February 3, 2022 16:47

add log_prob_terms method

71ba3bf

fix exact computation case

2160a7f

fix leave-one-out case

e13a318

Balandat reviewed Feb 5, 2022

View reviewed changes

Merge branch 'master' into exact_mll_combine_terms

877f271

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add combine_terms option to exact MLL #1863

add combine_terms option to exact MLL #1863

samuelstanton commented Dec 16, 2021 •

edited

Loading

samuelstanton commented Dec 18, 2021

gpleiss commented Dec 21, 2021

samuelstanton commented Dec 23, 2021

wjmaddox Feb 3, 2022

wjmaddox Feb 3, 2022

wjmaddox commented Feb 3, 2022

samuelstanton Feb 3, 2022

samuelstanton Feb 3, 2022

samuelstanton Feb 3, 2022

Balandat Feb 5, 2022

Balandat Feb 5, 2022

Balandat Feb 5, 2022

Balandat Feb 5, 2022

		split_terms = [inv_quad, logdet, norm_const]
		split_terms = [-0.5 * term for term in split_terms]

	split_terms = [inv_quad, logdet, norm_const]
	split_terms = [-0.5 * term for term in split_terms]
	split_terms = [-0.5 * inv_quad, logdet, -0.5 * norm_const]

add combine_terms option to exact MLL #1863

Are you sure you want to change the base?

add combine_terms option to exact MLL #1863

Conversation

samuelstanton commented Dec 16, 2021 • edited Loading

samuelstanton commented Dec 18, 2021

gpleiss commented Dec 21, 2021

samuelstanton commented Dec 23, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wjmaddox commented Feb 3, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

samuelstanton commented Dec 16, 2021 •

edited

Loading