Add an AutoStructured guide and StructuredReparam #2812

fritzo · 2021-04-21T01:24:39Z

Addresses #2813

This adds a flexible AutoStructured guide that allows a variety of distributions modeling each latent site (Delta, Normal, or MultivariateNormal), together with a mechanism to declare (link-)linear dependencies between latent variables. As discussed with @fehiepsi this aims to (1) generalize guides with arrowhead covariance structure while (2) learning parameters that can be cheaply used to precondition NUTS via a reparameterizer StructuredReparam.

This also adds a simple StructuredReparam that uses a trained AutoStructured guide to precondition a model for use in HMC. This new (guide,reparam) pair can be seen as a structured version of the monolithic (AutoContinuous,NeuTraReparam) pair in the same sense that AutoNormal is a structured version of the monolithic AutoDiagonalNormal guide.

My main motivation is to use this for high-dimensional models (e.g. 300000 latent variables) with a structured precision matrix, and then use that structured precision matrix as a preconditioner for NUTS.

(Note this does not implement Automatic structured variational inference, a variational family whose stricture is severely limited to dependencies in the model. Nor does this first PR implement automatic suggestion of the guide structure as in Faithful inversion of generative models for effective amortized inference.)

Tested

unit tests for AutoStructured
unit tests for StructuredReparam
test on a real world problem (private repo from which this was abstracted)

fehiepsi

Looks clean to me! I can confirm that this is equivalent to if we learn arrowhead matrix directly:

A = [[L @ L.t + w @ D @ w.T, w @ D], [D @ w.T, D]]

where L is scale_tril of x_aux, D is the variance of y_aux, w is dep.weight.

pyro/infer/autoguide/guides.py

fehiepsi · 2021-04-25T02:22:54Z

pyro/infer/autoguide/guides.py

+                scale_tril = scale[..., None] * scale_tril
+                aux_value = pyro.sample(
+                    name + "_aux",
+                    dist.MultivariateNormal(zero, scale_tril=scale_tril),


If we factor this out to scale_tril @ Normal(0, 1), I guess HMC will be a bit happier.

Great point, I guess that is equivalent to reparametrizing. I've also changed Normal(0,scale) to Normal(0,1) * scale in the "normal" case.

I think you will need to add logdet of those affine transforms. How about using dist.TransformedDistribution(dist.Normal(...), LowerCholeskyAffine(...)) so that we can use TransformReparam in the reparam?

Thanks, I've added the logdet terms by hand here since it's simpler. Does it look right now?

Yes, it looks correct to me.

Hmm, I'm seeing very different results with the two versions, and this change seems to have broken my SVI inference. I've been staring at these two versions and I can't seem to see the difference:

# Version 1. This works. aux_value = pyro.sample(..., Normal(zero, scale).to_event(1)) # Version 2. This is in pyro dev, but no longer works. aux_value = pyro.sample(..., Normal(zero, 1).to_event(1)) aux_value = aux_value * scale log_density = log_density - scale.log().sum(-1)

Any ideas @fehiepsi?

I believe two versions are equivalent... Not sure what's going on. Let me play with some tests to see if elbo is the same for the two versions.

Thanks I'll do the same, at least to create a unit test I can run locally (not on some huge model on a GPU cloud machine)

I think I find the issue. Here log_density is calculated as the sum over all dimensions of the site. However, the ldj term, which is used to calculate the logdet of unconstrained->constrained values, maintains the batch dimension. So sum of them will give wrong result if this site is under some plate. I guess we should use pyro.factor for those log_density terms. What do you think?

@fehiepsi thanks, yes I now see the error. I'll think about this and submit a fix ASAP.

fehiepsi

This is great! I'll try to find some time next week to port this to NumPyro (hopefully it will be straightforward).

fritzo · 2021-04-25T14:14:17Z

Thanks for your careful review @fehiepsi! I'll try to add automated dependency structure with @eb8680 next week.

Add AutoStructured guide

b0480a0

fritzo added enhancement WIP labels Apr 21, 2021

fritzo mentioned this pull request Apr 21, 2021

Automatic structured variational preconditioners for HMC #2813

Open

6 tasks

fritzo added 3 commits April 21, 2021 15:14

Support custom nn.Modules for dependencies

15bfb9f

Fix bugs

b3c9406

Implement a StructuredReparam

52f7426

fritzo changed the title ~~Add an AutoStructured guide~~ Add an AutoStructured guide and StructuredReparam Apr 22, 2021

fritzo added 9 commits April 22, 2021 14:21

Fix dependencies; add docs and helpful errors

11f3bb4

Fix bug in StructuredReparam

039f017

Merge branch 'dev' into auto-structured

ba10be5

Add failing shape test

f219bfe

Fix shape tests

fc748fd

Add more tests

338d524

Add test

14e9601

Add test

d08d5ff

Add test for StructuredReparam

1410ad3

fritzo added awaiting review and removed WIP labels Apr 24, 2021

fritzo requested a review from fehiepsi April 24, 2021 18:43

fritzo marked this pull request as ready for review April 24, 2021 18:43

fritzo added 2 commits April 24, 2021 15:59

Add tests with callables and modules for conditionals

fc91707

Add test of callable dependency

cad6701

fehiepsi mentioned this pull request Apr 24, 2021

Parameterizing scale_tril by scale and corr_tril #2820

Open

fehiepsi reviewed Apr 25, 2021

View reviewed changes

fritzo added 2 commits April 24, 2021 23:25

Replace MVN(0,scale_tril) -> Normal(0,1) @ scale_tril.T

021ef49

Add missing density terms

3bd47e2

fehiepsi approved these changes Apr 25, 2021

View reviewed changes

fritzo mentioned this pull request Apr 25, 2021

Switch to softplus transforms for autoguide scales #2823

Merged

fehiepsi merged commit 3a776f9 into dev Apr 25, 2021

fritzo mentioned this pull request Apr 29, 2021

Fix AutoStructured guide #2829

Merged

4 tasks

fritzo mentioned this pull request Aug 6, 2021

Automatic structured variational inference pyro-ppl/numpyro#1117

Open

fritzo deleted the auto-structured branch September 27, 2021 14:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an AutoStructured guide and StructuredReparam #2812

Add an AutoStructured guide and StructuredReparam #2812

fritzo commented Apr 21, 2021 •

edited

Loading

fehiepsi left a comment

fehiepsi Apr 25, 2021

fritzo Apr 25, 2021

fehiepsi Apr 25, 2021

fritzo Apr 25, 2021

fehiepsi Apr 25, 2021

fritzo Apr 28, 2021 •

edited

Loading

fehiepsi Apr 28, 2021

fritzo Apr 28, 2021

fehiepsi Apr 29, 2021

fritzo Apr 29, 2021

fehiepsi left a comment

fritzo commented Apr 25, 2021 •

edited

Loading

Add an AutoStructured guide and StructuredReparam #2812

Add an AutoStructured guide and StructuredReparam #2812

Conversation

fritzo commented Apr 21, 2021 • edited Loading

Tested

fehiepsi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fritzo Apr 28, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fehiepsi left a comment

Choose a reason for hiding this comment

fritzo commented Apr 25, 2021 • edited Loading

fritzo commented Apr 21, 2021 •

edited

Loading

fritzo Apr 28, 2021 •

edited

Loading

fritzo commented Apr 25, 2021 •

edited

Loading