Test for whether `TransformModule`s work for density estimation #2544

stefanwebb · 2020-06-28T20:20:29Z

I've added a test for whether TransformModule's work for density estimation after discovering that Spline doesn't... I'll fix the bug in Spline in this PR

stefanwebb · 2020-07-02T20:43:42Z

I worked out that the bug was that several transforms didn't update their parameters for each call of the forward or inverse operation. This is a problem after you've taken a gradient step during learning...

We now test that density estimation is possible and it passes for all transforms!

stefanwebb · 2020-07-02T20:44:16Z

@fritzo when this merges it will unblock #2542

pyro/distributions/transforms/householder.py

fritzo · 2020-07-02T22:21:55Z

pyro/distributions/transforms/householder.py

+            # u_unnormed ~ (count_transforms, input_dim)
+            # Hence, input_dim must divide
+            u_unnormed = self.nn(context)
+            if self.count_transforms == 1:


Not part of this PR, but it is an antipattern to make return type depend on a scalar value. Doing so breaks the ability to write generic code using this interface. Has this behavior been released yet?

Yes I'm afraid it's been released... Can you elaborate on why this would break the ability to write generic code? I think the underlying problem is actually how DenseNN shapes it's outputs, and this is something we should think about more carefully during a major refactoring

warning: the following is highly subjective

An example of some generic code you might like to write is, say, a ConditionalCatTransform that concatenates two conditional transforms. You might want to fuse their neural nets to save on tensor ops and share strength. How would you do this? Maybe

class ConditionalCatTransform(...): def __init__(self, parts): super().__init__(self, cache_size=1) self.parts = parts # ignoring ModuleList magic for demo purposes self.nn = memoize(cat_dense_nn([part.nn for part in self.parts])) end = 0 for part in self.parts: beg, end = end, end + part.count_transforms # The following line assumes DenseNN returns a tuple: part.nn = lambda context: self.nn(context)[beg:end]

then we could define helpers memoize = functools.lru_cache(max_size=1) and

def cat_dense_nn(parts): input_dims = parts[0].input_dims hidden_dims = parts[0].hidden_dims param_dims = sum([part.param_dims for part in parts], []) return DenseNN(input_dims, hidden_dims, param_dims)

That's what we could have written if the types were consistent. It would have been pretty simple generic code. But it looks like DenseNN returns an output type that depends on an int value, so we would need to complicate our wrapper with extra logic:

class ConditionalCatTransform(...): def __init__(self, parts): ... for part in self.parts: beg, end = end, end + part.count_transforms - part.nn = lambda context: self.nn(context)[beg:end] + if beg + 1 == end: + part.nn = lamba context: self.nn(context)[beg] + else: + part.nn = lambda context: self.nn(context)[beg:end]

Now that's just a little more complex. But I feel, as an author of abstract code, that the complexity tax is best paid by one-off code, so that abstractions can be built tax free. I think this tax structure is one of the main differences between programming languages, e.g. R and MATLAB tend to tax library code, whereas Python and C tend to tax one-off code.

Okay, let's think about this when we have a chance to overhaul DenseNN

stefanwebb · 2020-07-03T00:06:03Z

@fritzo I'm not sure what's wrong with the AIR tutorial in Travis...

fritzo · 2020-07-05T19:37:04Z

@neerajprad Do you have any ideas why the air tutorial might be failing? I haven't been able to reproduce locally.

neerajprad · 2020-07-06T06:57:26Z

I think #2549 should temporarily silence the FutureWarning from jupyter_client that's causing the tutorial tests to fail.

fritzo · 2020-07-06T17:40:01Z

@stefanwebb can you try merging in dev and pushing to trigger ci?

stefanwebb · 2020-07-07T00:03:24Z

Thanks @neerajprad! It all passes now 😄

stefanwebb added 2 commits June 28, 2020 12:34

Discovered bug that you can't take inverse of a TransformModule... :(

76a5590

Autodiff test for maximum-likelihood learning

d063d41

stefanwebb marked this pull request as draft June 28, 2020 20:20

stefanwebb added bug WIP labels Jun 28, 2020

stefanwebb mentioned this pull request Jun 28, 2020

Spline and a few other transforms don't work for density estimation... #2545

Closed

stefanwebb added 4 commits July 2, 2020 12:22

Fixed bugs in Spline

d0bb36d

Fixed bugs in Householder, Planar, and Radial transforms

2e389dd

Fixed bug in SplineAutoregressive

6b3f70d

All tests pass!

25f1db8

stefanwebb marked this pull request as ready for review July 2, 2020 20:34

stefanwebb requested review from fritzo, neerajprad and martinjankowiak July 2, 2020 20:34

stefanwebb added awaiting review and removed WIP labels Jul 2, 2020

fritzo reviewed Jul 2, 2020

View reviewed changes

stefanwebb added 2 commits July 2, 2020 15:41

Fixed paradigm for Householder

517984c

Fixed other transforms

a859f58

stefanwebb requested a review from fritzo July 2, 2020 23:22

Removed debug code from test_transforms.py

8d4b925

fritzo approved these changes Jul 3, 2020

View reviewed changes

Merge branch 'dev' of https://github.com/pyro-ppl/pyro into autodiff-bug

037c8de

stefanwebb requested a review from fritzo July 7, 2020 00:02

stefanwebb mentioned this pull request Jul 7, 2020

Dense MatrixExponential Transform #2551

Merged

fritzo merged commit 939c04d into pyro-ppl:dev Jul 7, 2020

stefanwebb deleted the autodiff-bug branch July 7, 2020 15:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test for whether `TransformModule`s work for density estimation #2544

Test for whether `TransformModule`s work for density estimation #2544

stefanwebb commented Jun 28, 2020

stefanwebb commented Jul 2, 2020

stefanwebb commented Jul 2, 2020

fritzo Jul 2, 2020

stefanwebb Jul 2, 2020

fritzo Jul 2, 2020 •

edited

Loading

stefanwebb Jul 2, 2020

stefanwebb commented Jul 3, 2020

fritzo commented Jul 5, 2020

neerajprad commented Jul 6, 2020

fritzo commented Jul 6, 2020

stefanwebb commented Jul 7, 2020

Test for whether TransformModules work for density estimation #2544

Test for whether TransformModules work for density estimation #2544

Conversation

stefanwebb commented Jun 28, 2020

stefanwebb commented Jul 2, 2020

stefanwebb commented Jul 2, 2020

fritzo Jul 2, 2020

Choose a reason for hiding this comment

stefanwebb Jul 2, 2020

Choose a reason for hiding this comment

fritzo Jul 2, 2020 • edited Loading

Choose a reason for hiding this comment

stefanwebb Jul 2, 2020

Choose a reason for hiding this comment

stefanwebb commented Jul 3, 2020

fritzo commented Jul 5, 2020

neerajprad commented Jul 6, 2020

fritzo commented Jul 6, 2020

stefanwebb commented Jul 7, 2020

Test for whether `TransformModule`s work for density estimation #2544

Test for whether `TransformModule`s work for density estimation #2544

fritzo Jul 2, 2020 •

edited

Loading