New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add probability tests to check vectorized/scalar lpmfs/lpdfs are the same #1989

Closed

bbbales2 wants to merge 57 commits into develop from bugfix/issue-1861-scalars-vs-vectors

Member

bbbales2 commented Jul 28, 2020 •

edited

Loading

Summary

This addresses the missing tests from #1861

Edit: Breaking up this pull into pieces. I'm gonna leave this here until it's done.

So far #2039, #2041, and #2042

Tests

Side Effects

Release notes

Added extra tests to check lpdfs/lpmfs evaluated with vectors produce the same results as with scalars

Checklist

Math issue Probability test framework didn't catch bug with vectorization #1861
Copyright holder: Columbia University

The copyright holder is typically you or your assignee, such as a university or company. By submitting this pull request, the copyright holder is agreeing to the license the submitted work under the following licenses:
- Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)
- Documentation: CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/)
the basic tests are passing
- unit tests pass (to run, use: ./runTests.py test/unit)
- header checks pass, (make test-headers)
- dependencies checks pass, (make test-math-dependencies)
- docs build, (make doxygen)
- code passes the built in C++ standards checks (make cpplint)
the code is written in idiomatic C++ and changes are documented in the doxygen
the new changes are tested

bbbales2 and others added 3 commits

July 27, 2020 21:11


          Added test to check that vectorized lpdfs/lpmfs the same as evaluatin…

d6c5225

…g them in a non-vectorized way (Issue #1861)


          Re-indenting code (Issue #1861)

bd2ebc1


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

ea07b9e

…4.1 (tags/RELEASE_600/final)

bbbales2 changed the title ~~Bugfix/issue 1861 scalars vs vectors~~ Add probability tests to check vectorized/scalar lpmfs/lpdfs are the same

bbbales2 and others added 4 commits

July 28, 2020 14:08


          Changed how memory is handled to avoid segfaults. Fixed vector handli…

0aa88c1

…ng in as_scalars_vs_as_vectors to be like repeat_as_vectors (Issue #1861)


          Merge branch 'bugfix/issue-1861-scalars-vs-vectors' of https://github…

fdc7757

….com/stan-dev/math into bugfix/issue-1861-scalars-vs-vectors


          Merge commit 'fdf5db851ea6f2c5dc59fcb9e9aa45b24b202afe' into HEAD

5db698a


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

35b74c4

…4.1 (tags/RELEASE_600/final)

Member Author

bbbales2 commented Jul 28, 2020

I also intend to fix: #1978 with this pull request. And similarly there should be tests for the lccdfs and the cdfs. These should basically be copy-paste to add once the lpdf/lpmf code is there.

bbbales2 and others added 21 commits

July 28, 2020 15:16


          Finished merge (Issue #1861)

6ebcaad


          Merge branch 'bugfix/issue-1861-scalars-vs-vectors' of https://github…

0ef320e

….com/stan-dev/math into bugfix/issue-1861-scalars-vs-vectors


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

778280f

…4.1 (tags/RELEASE_600/final)


          Fixed handling of Eigen matrices of fvar<T> types in test framework (…

95b4288

…Issue #1861)


          Merge branch 'bugfix/issue-1861-scalars-vs-vectors' of https://github…

a631664

….com/stan-dev/math into bugfix/issue-1861-scalars-vs-vectors


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

f2d7504

…4.1 (tags/RELEASE_600/final)


          Fixed Frechet distribution for higher order autodiff (Issue #1861)

a080c63


          Merge branch 'bugfix/issue-1861-scalars-vs-vectors' of https://github…

84310eb

….com/stan-dev/math into bugfix/issue-1861-scalars-vs-vectors


          Added Frechet test in mix (Issue #1861)

3f2c5ea


          Merge commit 'd34f10a67df9affb3e12af4b7f2a7fd4d6f757d3' into HEAD

69045e7


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

acb7a3f

…4.1 (tags/RELEASE_600/final)


          Switched test framework to use equality checks from unit tests which …

9304cf9

…work with things near zero better (Issue #1861)


          Merge branch 'bugfix/issue-1861-scalars-vs-vectors' of https://github…

79231ca

….com/stan-dev/math into bugfix/issue-1861-scalars-vs-vectors


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

e68772a

…4.1 (tags/RELEASE_600/final)


          Fixed Gumbel test distribution implementation (Issue #1861)

70de131


          Merge branch 'bugfix/issue-1861-scalars-vs-vectors' of https://github…

e84ff0a

….com/stan-dev/math into bugfix/issue-1861-scalars-vs-vectors


          Replaced the comparisons in probability distribution comparisons with…

8e4886c

… expect_near_rel from test/unit (Issue #1861)


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

791d99c

…4.1 (tags/RELEASE_600/final)


          Adjusted tolerances for finite difference comparison (Issue #1861)

bd880ee


          Merge branch 'bugfix/issue-1861-scalars-vs-vectors' of https://github…

e0277ca

….com/stan-dev/math into bugfix/issue-1861-scalars-vs-vectors


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

f934306

…4.1 (tags/RELEASE_600/final)

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

-                  }
-                  plus[n] += e;
-                  minus[n] -= e;
+                  auto f_wrap = [&](const Eigen::VectorXd& e) {

Member Author

bbbales2 Aug 10, 2020

The stan math finite difference function is higher order and easier to use, so I defer to it.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

@@ @@ -257,8 +260,8 @@ class AgradCcdfLogTestFixture : public ::testing::Test { @@
                 // works for <var>
                 double calculate_gradients_1storder(vector<double>& grad, var& ccdf_log,
                                                     vector<var>& x) {
+                  stan::math::set_zero_all_adjoints();

Member Author

bbbales2 Aug 10, 2020

These gradient functions get called a lot in a sequence. If we do stan::math::recover_memory we clear the autodiff stack and then the tests aren't meaningful. I switched the recover_memory s to set_zero_all_adjoint s and put recover_memory calls in the tests that use the calculate_gradients_* functions.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

@@ @@ -273,7 +276,7 @@ class AgradCcdfLogTestFixture : public ::testing::Test { @@
                 // works for fvar<double>
                 double calculate_gradients_1storder(vector<double>& grad,
                                                     fvar<double>& ccdf_log, vector<var>& x) {
-                  x.push_back(ccdf_log.d_);
+                  grad.push_back(ccdf_log.d_);

Member Author

bbbales2 Aug 10, 2020

Pushing stuff into x doesn't do anything. I think this was a bug. grad is the thing that gets checked.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

+                           << "  grads:        " << gradients;
+                    stan::test::expect_near_rel(stream.str(), finite_dif[i], gradients[i],
+                                                stan::test::relative_tolerance(1e-4, 1e-7));

Member Author

bbbales2 Aug 10, 2020

1e-4 relative error (this is what the unit tests use for gradients, see here: https://github.com/stan-dev/math/blob/develop/test/unit/math/ad_tolerances.hpp)

I think I used 1e-7 for the minimum error tolerance cause 1e-8 didn't work for some function. I'm fuzzy on this.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

-                    test_gradients_equal(expected_gradients1, gradients1);
-                    test_gradients_equal(expected_gradients2, gradients2);
-                    test_gradients_equal(expected_gradients3, gradients3);
+                    test_gradients_equal(expected_gradients1, gradients1, 1e-3);

Member Author

bbbales2 Aug 10, 2020

The reference implementation gradients are quite bad. I had to use a relative tolerance of 1e-3 to get them to pass (finite difference worked with 1e-4).

It's stuff like gamma_p and gamma_q I think (#2006).

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

-                    add_vars(s2, p0_, p1_, p2_, p3_, p4_, p5_);
-                    add_vars(s3, p0_, p1_, p2_, p3_, p4_, p5_);
+                    vector<var> scalar_vars;
+                    add_vars(scalar_vars, p0_, p1_, p2_, p3_, p4_, p5_);

Member Author

bbbales2 Aug 10, 2020

s1, s2, and s3 seemed like duplicates so I simplified things.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

-                    calculate_gradients_1storder(multiple_gradients3, multiple_ccdf_log, x1);
+                    calculate_gradients_1storder(multiple_gradients1, multiple_ccdf_log,
+                                                 vector_vars);
+                    calculate_gradients_2ndorder(multiple_gradients2, multiple_ccdf_log,

Member Author

bbbales2 Aug 10, 2020

Previously we were only computing 1st order gradients.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_ccdf_log.hpp

+                  }
+                }
+                void test_as_scalars_vs_as_vector() {

Member Author

bbbales2 Aug 10, 2020

This test should catch errors like #1978 and #1861 for lccdfs.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_cdf.hpp

                   }
                 }
+                void test_as_scalars_vs_as_vector() {

Member Author

bbbales2 Aug 10, 2020

This test should catch errors like #1978 and #1861 for cdfs.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_cdf.hpp

@@ @@ -3,6 +3,7 @@ @@
               #include <stan/math/rev.hpp>
               #include <test/prob/utility.hpp>
+              #include <test/unit/math/expect_near_rel.hpp>

Member Author

bbbales2 Aug 10, 2020

All the changes in this file are similar to the equivalent ones in the lccdf file.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_cdf_log.hpp

@@ @@ -3,6 +3,7 @@ @@
               #include <stan/math/rev.hpp>
               #include <test/prob/utility.hpp>
+              #include <test/unit/math/expect_near_rel.hpp>

Member Author

bbbales2 Aug 10, 2020

This should fix #1978. The changes in this file are similar to the ones in the lccdf and cdf checks.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_cdf.hpp

-                    T_return_type cdf
-                        = TestClass.template cdf<Scalar0, Scalar1, Scalar2, Scalar3, Scalar4,
-                                                 Scalar5>(p0_, p1_, p2_, p3_, p4_, p5_);
+                    T_return_type single_cdf = pow(

Member Author

bbbales2 Aug 10, 2020

You'll notice a pow here. In the old version we were comparing gradients of something like grad(x) with the gradients of something like grad(x^r). To do this there was extra compare logic in test_multiple_gradient_values.

As I recall this was confusing for higher order things, so I just added a pow here so we're comparing grad(x^r) directly against grad(x^r) computed another way and there's no confusion.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_distr.hpp

@@ @@ -3,6 +3,7 @@ @@
               #include <stan/math/mix.hpp>
               #include <test/prob/utility.hpp>
+              #include <test/unit/math/expect_near_rel.hpp>

Member Author

bbbales2 Aug 10, 2020

This should basically be the same as the lccdf, cdf, and lcdf checks.

bbbales2 commented

View reviewed changes

test/prob/test_fixture_distr.hpp

                   }
                 }
+                void test_as_scalars_vs_as_vector() {

Member Author

bbbales2 Aug 10, 2020

This should fix #1861.

bbbales2 commented

View reviewed changes

test/prob/utility.hpp

               }  // namespace std
               // ------------------------------------------------------------
+              template <typename T>

Member Author

bbbales2 Aug 10, 2020

I moved these definitions up here so they are visible to get_params.

bbbales2 commented

View reviewed changes

test/prob/utility.hpp

    
              // default template handles Eigen::Matrix

              template <typename T>

              T get_params(const vector<vector<double>>& parameters, const size_t p) {

                T param(parameters.size());

                for (size_t n = 0; n < parameters.size(); n++)

                  if (p < parameters[0].size())

                    param(n) = parameters[n][p];

                    param(n) = get_param<stan::scalar_type_t<T>>(parameters[n], p);

Member Author

bbbales2 Aug 10, 2020

For higher order autodiff types, we need to initialize each of the params with something more than just casting up a double.

For vars, we assign the derivative term to 1.0, for instance.

If we don't do this code that depends on get_params and get_param getting the same params fails.

bbbales2 commented

View reviewed changes

test/prob/von_mises/von_mises_test.hpp

                   param[0] = boost::math::constants::third_pi<double>();
                   param[1] = boost::math::constants::sixth_pi<double>();
-                  param[2] = 1e-8;
+                  param[2] = 1e-2;

Member Author

bbbales2 Aug 10, 2020

Larger test value to avoid finite difference out of range errors

bbbales2 commented

View reviewed changes

test/unit/math/mix/prob/frechet_test.cpp

+                  return stan::math::frechet_lpdf<false>(y, alpha, beta);
+                };
+                stan::test::expect_ad(f, 2.0, 1.0, 1.0);

Member Author

bbbales2 Aug 10, 2020

I added this test since the higher order Frechet stuff was failing. I think I could remove it but it seems fine to me.

bbbales2 commented

View reviewed changes

test/unit/math/prim/functor/ode_rk45_prim_test.cpp

@@ @@ -1,68 +1,10 @@ @@
               #include <stan/math/prim.hpp>
               #include <gtest/gtest.h>
               #include <test/unit/util.hpp>
+              #include <test/unit/math/prim/functor/ode_test_functors.hpp>

Member Author

bbbales2 Aug 10, 2020

All the changes in the ODE files were an alternate fix to a problem that popped up here: #1993 (comment)

They don't change behavior. They just rearrange test code a bit so that the jumbo tests (#1965) build correctly.

Member Author

bbbales2 commented Aug 10, 2020

This is ready to review. There's a ton of different sorts of changes in here. I went through I tried to explain each of them, cause some of them probably look pretty weird. When I got into the testing framework and started pulling threads I ended up working on a lot more things than I intended to.

Member Author

bbbales2 commented Aug 17, 2020

@syclik you think you'd have a chance to review this in like the next week or so? If not I'll grab someone else.

Member Author

bbbales2 commented Aug 20, 2020

@t4c1 yo can you review this? There were a few of problems with the testing framework. I wanna get these in before we make all the expression-compatibility changes.

Contributor

t4c1 commented Aug 21, 2020

This is a huge PR. Could you split it into 2 or 3 smaller ones?

There is a lot of math stuff in here. I am not sure I feel comfortable reviewing that.

This was referenced Aug 24, 2020

Generalize cauchy #1944

Merged

Reduced some duplicate code in ODE tests #2039

Merged

bbbales2 marked this pull request as draft

August 26, 2020 20:41

bbbales2 mentioned this pull request

Fix problems with higher order gradients in probability test framework #2042

Merged

5 tasks

bbbales2 mentioned this pull request

Add probability tests to check vectorized/scalar lpmfs/lpdfs are the same #2085

Merged

5 tasks

Member Author

bbbales2 commented Oct 12, 2020

Closed by #2085

bbbales2 closed this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet