Stabilize autoguide scale parameters via SoftplusTransform #2767

fritzo · 2021-02-19T15:29:49Z

Resolves #2766
~~Blocked by #2753~~
Ports pytorch/pytorch#52300

This switches to using softplus transforms for autoguide scale parameters (point 2 in pyro-ppl/numpyro#855 (comment)), and adds relevant machinery:

constraints.softplus_positive and SoftplusTransform
constraints.softplus_lower_cholesky and SoftplusLowerCholeskyTransform

This does not use softplus transforms for latent variables that are scales (point 1 in pyro-ppl/numpyro#855 (comment)). While this PR adds machinery to declare and perform those transforms, it remains to detect which positive latent variables should be softplus-transformed rather than exp-transformed.

cc @vitkl @fehiepsi

Tested

needed to tweak parameters in a couple inference tests (test now actually run a bit faster)

pyro/distributions/transforms/stable.py

fehiepsi · 2021-02-19T17:13:21Z

Just curious, what makes you think this will be more stable? (is it numerical or theoretical?)

fritzo · 2021-02-19T17:30:38Z

Just curious, what makes you think this will be more stable?

I don't understand why this PR would be more stable 🤷 I'm just trying to implement @vitkl's point 2 pyro-ppl/numpyro#855 (comment) which he claimed was more stable. It's plausible I've misunderstood something, and also plausible that a better solution would be to use ClippedAdam or a lower learning rate.

@vitkl can you confirm (1) this PR implements roughly what you're requesting, (2) that you've shown it is more numerically stable, and (3) that you tried simpler solutions like ClippedAdam or tweaking optimizer parameters?

fritzo · 2021-02-21T15:15:28Z

@fehiepsi do you think we should name these constraints softplus_positive and softplus_lower_cholesky? That seemed like mixing metaphors to me, but I suppose it's better than introducing yet another word 'stable'. I appreciate your help in choosing good names.

fehiepsi · 2021-02-21T15:29:20Z

@fritzo I don't have a better solution. Those names softplus_foo looks good to me. In TFP wrapper, because distributions there do not have constraints, I created a class named BijectorConstraint(bijector), which is similar here (:D) Bijector <-> softplus, Constraint <-> positive.

vitkl · 2021-02-21T16:22:04Z

Hi @fehiepsi @fritzo

Thanks for this.

This implements exactly what I requested - however in the numpyro issue (Softplus transform as a more numerically stable way to enforce positive constraint numpyro#855 (comment)) you also discussed making that an option rather than the default:

class AutoNormal(..., use_softplus=False):

    if use_softplus:
        _deep_setattr(self.scales, name,
                                PyroParam(init_scale, constraints.stable_positive, event_dim))
    else:
        _deep_setattr(self.scales, name,
                                PyroParam(init_scale, constraints.positive, event_dim))

It is more numerically stable for the cell2location model (tested in numpyro). Softplus is also used by pymc3.
We see that ClippedAdam and reduced learning rate (0.002 -> 0.0002) does not help (again tested in numpyro). We did not do an exhaustive search of training hyperparameters yet.

I am planning to add a benchmark (to https://github.com/pyro-ppl/sandbox)that shows the impact of this modification of stability and accuracy for our model. To simplify the analysis, it would help to have the switch option AutoNormal(..., use_softplus=False) in both pyro and numpyro.

fritzo · 2021-02-22T20:23:04Z

Hi @vitkl, I believe this should be ready to go.

making that an option rather than the default

Hmm, I'm hesitant to add interface complexity to our so-called AutoGuides, especially since I suspect your experiments will show softplus should be the default. At the same time I'd like to make it easy for you to run experiments. WDYT of this compromise: How about we make the constraint hackable but not publicly configurable? That is, we create a class level variable

class AutoNormal(AutoGuide):
    scale_constraint = constraints.softplus_positive    # <--- hackable but not documented
    def __init__(self, model, *,
                 init_loc_fn=init_to_feasible,
                 init_scale=0.1,
                 create_plates=None):
        ...

Then in your experiments you can override this via

guide_exp = AutoNormal(model)
guide_exp.scale_constraint = constraints.positive

guide_softplus = AutoNormal(model)
guide_softplus.scale_constraint = constraints.softplus_positive

Hope I'm not overthinking this, I'd just like to avoid interface bloat 😄 @fehiepsi does this seem ok to you?

fehiepsi · 2021-02-22T22:15:15Z

Sure! This only changes how we optimize the scale parameters. Probably this will affect the current inference code of some users but I think it is easy to fix... Btw, we should mention why we make this change in the next release notes. :)

fehiepsi

LGTM with passing tests and exposing those new transforms in docs. Thanks for addressing this issue, @fritzo!

vitkl · 2021-02-22T22:45:37Z

Thanks a lot for implementing this. Class variable works great. We are in the middle of resubmitting the paper revisions this week - but I will try to do the testing soon!

fehiepsi

LGTM, thanks @fritzo! I'll port this to NumPyro soon. Do you want to expose those transforms in docs?

fritzo added 17 commits January 27, 2021 15:37

Update to PyTorch nightly

a4afd43

Update README.md

25cbfc1

Commit to PyTorch nightly on dev branch

649e00a

Merge branch 'dev' into pytorch-nightly

6ca82a7

Fix constraint bugs

bfbe8a5

Relax torchvision version

53cbf30

Fix CorrCholesky constraint and test data

3dca80d

Fix funsor bugs

d34ce73

Try harder to generate positive data

604fe53

xfail; switch to torch_test channel

e1a7e4e

Fix versions

3b37c97

Pin to fixed nightly version

4ec4981

lint

20381df

xfail some funsor tests

699659a

Remove accidental file

93e9287

Use softplus transforms for autoguide scales

121a81b

Add transform tests

cbb2f2b

fritzo added discussion WIP labels Feb 19, 2021

fritzo mentioned this pull request Feb 19, 2021

Softplus transform for AutoNormal scales [feature request] #2766

Closed

fritzo changed the title ~~Stable autoguide scale~~ Stablize autoguide scale Feb 19, 2021

fritzo changed the title ~~Stablize autoguide scale~~ Stabilize autoguide scale Feb 19, 2021

fritzo changed the title ~~Stabilize autoguide scale~~ Stabilize autoguide scale parameters Feb 19, 2021

fehiepsi reviewed Feb 19, 2021

View reviewed changes

pyro/distributions/transforms/stable.py Outdated Show resolved Hide resolved

fehiepsi reviewed Feb 19, 2021

View reviewed changes

pyro/distributions/transforms/stable.py Outdated Show resolved Hide resolved

fritzo changed the title ~~Stabilize autoguide scale parameters~~ Stabilize autoguide scale parameters via SoftplusTransform Feb 21, 2021

fritzo added 3 commits February 22, 2021 14:53

Merge branch 'dev' into stable-autoguide-scale

772ab4d

Rename stable_positive -> softplus_positive

1875d46

Make autoguide constraints configurable

2504b30

fritzo removed the discussion label Feb 22, 2021

Address review comments

11a5f07

fritzo marked this pull request as ready for review February 22, 2021 20:09

fritzo changed the base branch from pytorch-nightly to dev February 22, 2021 20:09

fritzo added awaiting review and removed WIP labels Feb 22, 2021

lint

6f4cad0

fehiepsi previously approved these changes Feb 22, 2021

View reviewed changes

Tweak parameters in inference test

10309f2

fritzo dismissed fehiepsi’s stale review via 10309f2 February 23, 2021 14:51

fritzo added this to the 1.6 release milestone Feb 23, 2021

fritzo requested a review from fehiepsi February 28, 2021 15:00

fritzo mentioned this pull request Feb 28, 2021

Get global isort working again; add to lint stage #2603

Merged

fehiepsi previously approved these changes Feb 28, 2021

View reviewed changes

fritzo added 2 commits February 28, 2021 12:25

Merge branch 'dev' into stable-autoguide-scale

4d42ebb

Regster transforms in docs

2acbd3b

fritzo dismissed fehiepsi’s stale review via 2acbd3b February 28, 2021 17:30

fehiepsi approved these changes Feb 28, 2021

View reviewed changes

fehiepsi merged commit 1395109 into dev Feb 28, 2021

fehiepsi mentioned this pull request Mar 4, 2021

Use softplus transform in autoguide pyro-ppl/numpyro#941

Merged

fritzo mentioned this pull request Mar 4, 2021

Revert to old default scale constraint in autoguides #2774

Merged

fritzo deleted the stable-autoguide-scale branch September 27, 2021 14:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stabilize autoguide scale parameters via SoftplusTransform #2767

Stabilize autoguide scale parameters via SoftplusTransform #2767

fritzo commented Feb 19, 2021 •

edited

Loading

fehiepsi commented Feb 19, 2021 •

edited

Loading

fritzo commented Feb 19, 2021

fritzo commented Feb 21, 2021

fehiepsi commented Feb 21, 2021

vitkl commented Feb 21, 2021

fritzo commented Feb 22, 2021 •

edited

Loading

fehiepsi commented Feb 22, 2021

fehiepsi left a comment

vitkl commented Feb 22, 2021

fehiepsi left a comment •

edited

Loading

Stabilize autoguide scale parameters via SoftplusTransform #2767

Stabilize autoguide scale parameters via SoftplusTransform #2767

Conversation

fritzo commented Feb 19, 2021 • edited Loading

Tested

fehiepsi commented Feb 19, 2021 • edited Loading

fritzo commented Feb 19, 2021

fritzo commented Feb 21, 2021

fehiepsi commented Feb 21, 2021

vitkl commented Feb 21, 2021

fritzo commented Feb 22, 2021 • edited Loading

fehiepsi commented Feb 22, 2021

fehiepsi left a comment

Choose a reason for hiding this comment

vitkl commented Feb 22, 2021

fehiepsi left a comment • edited Loading

Choose a reason for hiding this comment

fritzo commented Feb 19, 2021 •

edited

Loading

fehiepsi commented Feb 19, 2021 •

edited

Loading

fritzo commented Feb 22, 2021 •

edited

Loading

fehiepsi left a comment •

edited

Loading