Revised distributions for stochasticmux #171

bmcfee · 2024-02-01T03:14:37Z

This PR implements several changes described in #148

New distributions for stochasticmux: const and binomial (new default)
Adjusted the poisson mode so that the expected value is actually rate and not rate+1

I've also relaxed the uniform convergence unit tests. A p-value of >=0.95 was probably overkill for the sample size we were drawing, and I've reduced it to 0.5. Strangely, poisson was giving me the most trouble here, while const and binomial were behaving better. It's probably an artifact of setting rate=2 in the test.

codecov · 2024-02-01T03:17:21Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 97.67%. Comparing base (9ad3511) to head (1f480e5).

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #171      +/-   ##
==========================================
- Coverage   97.78%   97.67%   -0.12%     
==========================================
  Files           8        8              
  Lines         542      559      +17     
==========================================
+ Hits          530      546      +16     
- Misses         12       13       +1

Flag	Coverage Δ
unittests	`97.67% <100.00%> (-0.12%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

bmcfee · 2024-02-01T03:31:30Z

There's a bit of weirdness here in the initialization with binomial mode. (Will come back to this later...)

The binomials are parametrized by Bin((rate-1)/(1-p), 1-p) where p is the probability of selecting the streamer from the active set. The reason for this dependence is subtle, but it ensures that the replacement times for streamers don't concentrate too much; a streamer gets replaced on average every rate * N_active samples, with variance like rate * N_active * (N_active-1). (At least, assuming uniform weights on the streamers.)

The weirdness arises when we initialize streamers on at a time: the weight distribution is not fully known until the first batch of active streamers are fully initialized, so the calculations for p above are generally going to be wrong. In the extreme case, the first active streamer will use the poisson approximation (since p=1 when there are no other streamers active yet); the second streamer will have bin((rate-1)/0.5, 0.5) (assuming uniform weights again), the third will have bin((rate-1)/0.666, 0.666), and so on. In all cases, the expected values will be the same, but the first few streamers will have higher rate variance than the later ones. Specifically, the variance sequence will look like (again, assuming uniform streamer weights) of (rate-1)/n (for n=1,2, ..., n_active).

Now, we could hack around this by pre-determining the weights so that everything is primed properly. However, I think it might actually be beneficial to leave it as is because it injects more randomness in the rate distributions early on, which ought to have a comparable effect to having random offsets in a burn-in phase as @ejhumphrey suggested in #132 .

It's a bit weird, but given the potentially dynamic nature of the active stream distribution (especially in exhaustive mode), it won't be possible to always ensure that the rate distribution for a streamer is "correct" over time. The best we can do is sample the rate value according to whatever the distribution will be at the time the streamer is activated.

bmcfee · 2024-02-01T14:58:14Z

Having slept on it, i think a better solution here is to initialize the active set weights by a random draw from the weights array instead of with zeros. This won't have any effect on const or poisson, but it will put the binomial mode in a less quirky position at initialization time.

bmcfee · 2024-03-08T16:00:53Z

@cjacoby 👋 I know it's been a gajillion years, but do you have any interest in looking this over? I think it's basically good to go, but it does have some kinda breaky behavior relative to older versions that I'd like to get another set of eyes on.

Quick TLDR is summarized in #148 (comment)

cjacoby

lgtm other than I think a small comment improvement would improve it (for me in 6mo to a year when I come back and can't remember what this is about).

pescador/mux.py

bmcfee · 2024-03-11T19:47:38Z

Ok, doc section is added and back-link is included. I tried to clean it up a bit from my original notebook (4 years ago!) and put in some expository text. Hopefully it makes sense?

implemented #148

dae9d93

bmcfee added enhancement API labels Feb 1, 2024

bmcfee added this to the 3.0.0 milestone Feb 1, 2024

linting

d0b0e3b

bmcfee added 6 commits February 1, 2024 10:47

warm-start the active set distribution for binomial mode consistency

4cfb069

smoothing out edge cases in new distribution modes

a0fc545

linting

2eb45de

updated codecov action

839ee2b

trying without directory info

a1091ef

updating test comments

1d79e1e

cjacoby requested changes Mar 8, 2024

View reviewed changes

pescador/mux.py Show resolved Hide resolved

bmcfee added 2 commits March 11, 2024 11:35

adding mux analysis doc stub

5c7a453

updated documentation for stochastic mux

1f480e5

cjacoby approved these changes Mar 11, 2024

View reviewed changes

cjacoby merged commit 19a3f37 into main Mar 11, 2024
11 of 12 checks passed

cjacoby deleted the distributions branch March 11, 2024 23:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revised distributions for stochasticmux #171

Revised distributions for stochasticmux #171

bmcfee commented Feb 1, 2024

codecov bot commented Feb 1, 2024 •

edited

Loading

bmcfee commented Feb 1, 2024

bmcfee commented Feb 1, 2024

bmcfee commented Mar 8, 2024

cjacoby left a comment

bmcfee commented Mar 11, 2024

Revised distributions for stochasticmux #171

Revised distributions for stochasticmux #171

Conversation

bmcfee commented Feb 1, 2024

codecov bot commented Feb 1, 2024 • edited Loading

Codecov Report

bmcfee commented Feb 1, 2024

bmcfee commented Feb 1, 2024

bmcfee commented Mar 8, 2024

cjacoby left a comment

Choose a reason for hiding this comment

bmcfee commented Mar 11, 2024

codecov bot commented Feb 1, 2024 •

edited

Loading