Use approximate log_beta() in .fit(), .predict() #2502

fritzo · 2020-05-24T23:29:20Z

Addresses #2426
Blocking #2498 (from which this simpler PR was extracted)

This PR:

Fixes a nan gradient issue in ExtendedBetaBinomial related to nan propagates through backward pass even when not accessed pytorch/pytorch#15506. This may have caused spurious rejections during heuristic initialization.
Starts using the fast log_beta() in CompartmentalModel.fit() and .predict() now that gradients are safe. This reduces likelihood compute cost by about 14% in SuperspredingSEIRModel, and will reduce Add overdispersed models to contrib.epidemiology #2498 more.
Moves safe_log() to pyro.ops.special which seems like a better home. (this change is non-breaking since safe_log() has not yet been released).

Tested

added unit tests for nan gradients in ExtendedBinomial and ExtendedBetaBinomial
ran locally to verify sane behavior

fritzo · 2020-05-24T23:30:07Z

pyro/distributions/extended.py

+        n = total_count.clamp(min=0)
+        k = value.masked_fill(invalid, 0)


this is the crux of the NAN gradient fix

martinjankowiak · 2020-05-24T23:49:44Z

pyro/distributions/extended.py

@@ -23,7 +23,7 @@ class ExtendedBinomial(Binomial):

    def log_prob(self, value):
        result = super().log_prob(value)
-        invalid = ~super().support.check(value)
+        invalid = (value < 0) | (value > self.total_count)


curous: why this change?

The new version is a little cheaper. The old integer_interval version additionally checks value % 1 == 0, but that is already checked in validation by the above line, and it doesn't affect numerical stability.

fritzo · 2020-05-25T18:40:51Z

Thanks for reviewing!

fritzo added 2 commits May 24, 2020 16:10

Fix ExtendedBetaBinomial gradient issue

26041fc

Use approximate log_beta in CompartmentalModel

9d524da

fritzo added the awaiting review label May 24, 2020

fritzo requested a review from martinjankowiak May 24, 2020 23:29

fritzo commented May 24, 2020

View reviewed changes

martinjankowiak previously approved these changes May 24, 2020

View reviewed changes

Revive change that had been lost in merge conflict

ca2e084

fritzo dismissed martinjankowiak’s stale review via ca2e084 May 25, 2020 01:40

Fix another merge conflict error

9d2351d

martinjankowiak approved these changes May 25, 2020

View reviewed changes

martinjankowiak merged commit aa40beb into dev May 25, 2020

fritzo deleted the sir-approx branch June 5, 2020 15:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use approximate log_beta() in .fit(), .predict() #2502

Use approximate log_beta() in .fit(), .predict() #2502

fritzo commented May 24, 2020

fritzo May 24, 2020 •

edited

Loading

martinjankowiak May 24, 2020

fritzo May 25, 2020 •

edited

Loading

fritzo commented May 25, 2020

		n = total_count.clamp(min=0)
		k = value.masked_fill(invalid, 0)

Use approximate log_beta() in .fit(), .predict() #2502

Use approximate log_beta() in .fit(), .predict() #2502

Conversation

fritzo commented May 24, 2020

Tested

fritzo May 24, 2020 • edited Loading

Choose a reason for hiding this comment

martinjankowiak May 24, 2020

Choose a reason for hiding this comment

fritzo May 25, 2020 • edited Loading

Choose a reason for hiding this comment

fritzo commented May 25, 2020

fritzo May 24, 2020 •

edited

Loading

fritzo May 25, 2020 •

edited

Loading