Improve performance for large sample sizes #30

longouyang · 2016-06-14T04:58:49Z

No description provided.

hawkrobe · 2016-10-13T01:09:53Z

Any ideas on how to start on this? Calling viz.marginals on a probmods example with >100,000 samples (and four variables in the joint distribution) takes an order of magnitude more time than inference

longouyang · 2016-10-13T02:37:01Z

There are a couple of bottlenecks:

The projected-out distributions are computed in a concise but inefficient way
Density estimation for continuous data is implemented naively. There are various tricks (e.g., FFT and tree-based computation, other stuff) to make this faster but it might not be worth the effort, given that we'll probably want kernel-based aggregators in core webppl anyway (Thoughts on design of aggregators / marginal ERPs webppl#369). Also, if I had to guess, the main bottleneck is probably the projection, not kde.

hawkrobe · 2016-10-13T03:20:37Z

Thanks for the tips -- I might take a look.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance for large sample sizes #30

Improve performance for large sample sizes #30

longouyang commented Jun 14, 2016

hawkrobe commented Oct 13, 2016

longouyang commented Oct 13, 2016

hawkrobe commented Oct 13, 2016

Improve performance for large sample sizes #30

Improve performance for large sample sizes #30

Comments

longouyang commented Jun 14, 2016

hawkrobe commented Oct 13, 2016

longouyang commented Oct 13, 2016

hawkrobe commented Oct 13, 2016