Improve floating point accuracy in percentile #4617

wphicks · 2021-02-03T19:23:34Z

Improve the floating point accuracy of the linear interpolation formula used in percentile_weightnening kernel of cupy.percentile
Resolve #4607

wphicks · 2021-02-03T20:03:38Z

The fix here is relatively straightforward and exactly mirrors the fix used in numpy. The associated unit test is a little more annoying. In numpy, Hypothesis was used to create a more general test of this problem, but since it looks like we aren't using Hypothesis in cupy, I used Hypothesis to find a "magic value" that would produce the error and then just hardcoded that value into the test that I've submitted here. Does that seem reasonable or do folks have some other idea on how this might be tested?

leofang

The fix here is relatively straightforward and exactly mirrors the fix used in numpy.

Any chance you could give a pointer to the Numpy counterpart for the record (either issue/PR or link to actual code is fine)?

The associated unit test is a little more annoying. In numpy, Hypothesis was used to create a more general test of this problem, but since it looks like we aren't using Hypothesis in cupy, I used Hypothesis to find a "magic value" that would produce the error and then just hardcoded that value into the test that I've submitted here. Does that seem reasonable or do folks have some other idea on how this might be tested?

Well, even Hypothesis itself is fairly new to NumPy. It landed in NumPy only about a year ago (numpy/numpy#15189) so don't feel annoyed that it's not adopted here 😄

Could you add a few more magic values here, and also test against NumPy's outputs (as done in all other tests throughout the codebase)? You could return the output in the test function, decorated by @testing.numpy_cupy_allclose(). This way we get a stronger confidence in the results.

cupy/_statistics/order.py

wphicks · 2021-02-03T20:43:07Z

Any chance you could give a pointer to the Numpy counterpart for the record (either issue/PR or link to actual code is fine)?

Oh yes! Sorry about that; included it in the issue but forgot to repost it here. The numpy PR is here: numpy/numpy#16273, and it addressed numpy/numpy#14685.

Well, even Hypothesis itself is fairly new to NumPy. It landed in NumPy only about a year ago (numpy/numpy#15189) so don't feel annoyed that it's not adopted here smile

Haha! No worries; I was more annoyed with myself that I hadn't found a more elegant test without Hypothesis...

Could you add a few more magic values here, and also test against NumPy's outputs (as done in all other tests throughout the codebase)? You could return the output in the test function, decorated by @testing.numpy_cupy_allclose(). This way we get a stronger confidence in the results.

Absolutely! Will update momentarily.

wphicks · 2021-02-03T23:47:23Z

All right, got the test updated and posted some benchmark data. Let me know if I can tweak anything further or if you'd like to see something more from that test.

tests/cupy_tests/statistics_tests/test_order.py

asi1024 · 2021-02-08T17:04:52Z

Jenkins, test this please.

chainer-ci · 2021-02-08T20:52:33Z

Jenkins CI test (for commit 10e03d7, target branch master) failed with status FAILURE.

asi1024 · 2021-02-09T05:52:20Z

@wphicks Could you check test failure?

wphicks · 2021-02-09T15:28:42Z

Ah, interesting. It looks like numpy is violating the percentile monotonicity test in CI. I couldn't reproduce this with numpy==1.20.0, but earlier versions (which don't include numpy/numpy#16273) unsurprisingly demonstrate this issue. I wasn't immediately able to determine what numpy version was being used in CI to confirm that this is definitely the problem, but it seems likely. Does someone with better familiarity of the CI system know of a good way to check that version?

In terms of a fix, we can either add a requirement to the test section of setup.py for numpy>=1.20.0 or we can skip the test if we detect an old version of numpy. I'm going to push a commit that changes the test requirement, but if that seems like too heavy-handed of a change, we can go with the skip instead.

asi1024 · 2021-02-09T16:15:36Z

I see! Could you use decorator @testing.with_requires('numpy>=1.20')?

This reverts commit 55b37e9 in favor of using the testing.with_requires decorator

Skip test when numpy version is old enough to violate the percentile monotonicity constraint itself

wphicks · 2021-02-09T16:26:46Z

I see! Could you use decorator @testing.with_requires('numpy>=1.20')?

Done! Thank you. I reverted the change to setup.py and added the decorator.

wphicks · 2021-02-09T16:54:43Z

Looks like Github actions are having some issues right now, which seems to be responsible for the latest failure.

jakirkham · 2021-02-09T17:43:22Z

Probably from this ( https://www.githubstatus.com/incidents/5qdkkyg958vy ). Maybe try pushing an empty commit? They say it was since resolved.

leofang · 2021-02-09T18:02:09Z

Power-cycle (close and reopen) should restart it.

leofang · 2021-02-09T18:24:28Z

Jenkins, test this please

chainer-ci · 2021-02-09T20:44:36Z

Jenkins CI test (for commit 949a25c, target branch master) succeeded!

jakirkham · 2021-02-10T20:45:16Z

This shows up in the Windows CI log. Does cuTENSOR get installed on Windows typically? If so, is this just an issue of a download timing out or something?

FAILED tests/cupyx_tests/tools_tests/test_install_library.py::TestInstallLibrary::test_install_cutensor

https://ci.preferred.jp/cupy.experimental.win.cuda100/68442/

leofang · 2021-02-10T21:00:01Z

There're still tons of errors in the Windows CI so hopefully we can address them all before v9 is out. For now it's expected to see failures on Windows.

For this specific case, it's tracked in #4601. The issue is cuTENSOR for Windows is packaged as .exe which cannot be extracted without installing 3rd party tools like 7zip, so this part cannot be automated and requires users' action. On conda-forge we don't have this problem because 7zip was made a dependency on Windows.

@jakirkham Perhaps it's easier if you could request the cuTENSOR team to upload a different, machine-readable format? 🙂 At least cuDNN does not have this issue.

leofang · 2021-02-10T21:03:40Z

(btw all tests in test_order.py passed on Windows.)

jakirkham · 2021-02-10T21:22:02Z

Ok let's move the Windows discussion to that issue then 🙂

asi1024 · 2021-02-14T08:48:24Z

The CI failure is unrelated to this PR. LGTM! Thank you for the PR!

wphicks added 2 commits February 3, 2021 14:07

Provide test for non-monotonic percentile bug

ed73cea

Use interpolation with better float accuracy in percentile

6f005a7

wphicks marked this pull request as draft February 3, 2021 19:24

wphicks marked this pull request as ready for review February 3, 2021 20:03

wphicks changed the title ~~[WIP] Improve floating point accuracy in percentile~~ Improve floating point accuracy in percentile Feb 3, 2021

leofang reviewed Feb 3, 2021

View reviewed changes

cupy/_statistics/order.py Show resolved Hide resolved

Make percentile monotonicity test more rigorous

2417d11

wphicks mentioned this pull request Feb 4, 2021

[BUG] Sporadic KBinsDiscretizer pytests fail with quantile strategy rapidsai/cuml#2933

Open

kmaehashi assigned asi1024 Feb 4, 2021

kmaehashi added cat:bug Bugs prio:high cat:numpy-compat Follow the NumPy/SciPy spec changes and removed cat:bug Bugs labels Feb 4, 2021

asi1024 reviewed Feb 8, 2021

View reviewed changes

tests/cupy_tests/statistics_tests/test_order.py Outdated Show resolved Hide resolved

Correct testing decorator usage

10e03d7

asi1024 added this to the v9.0.0b3 milestone Feb 8, 2021

Require numpy version with more accurate percentile

55b37e9

wphicks added 2 commits February 9, 2021 11:18

Revert "Require numpy version with more accurate percentile"

db2d723

This reverts commit 55b37e9 in favor of using the testing.with_requires decorator

Skip percentile monotonicity test for numpy<=1.20

949a25c

Skip test when numpy version is old enough to violate the percentile monotonicity constraint itself

wphicks closed this Feb 9, 2021

wphicks reopened this Feb 9, 2021

leofang mentioned this pull request Feb 10, 2021

Add Windows support in cuTENSOR download tool #4601

Closed

asi1024 merged commit 9fed304 into cupy:master Feb 14, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve floating point accuracy in percentile #4617

Improve floating point accuracy in percentile #4617

wphicks commented Feb 3, 2021

wphicks commented Feb 3, 2021

leofang left a comment

wphicks commented Feb 3, 2021 •

edited

Loading

wphicks commented Feb 3, 2021

asi1024 commented Feb 8, 2021

chainer-ci commented Feb 8, 2021

asi1024 commented Feb 9, 2021

wphicks commented Feb 9, 2021

asi1024 commented Feb 9, 2021

wphicks commented Feb 9, 2021

wphicks commented Feb 9, 2021

jakirkham commented Feb 9, 2021 •

edited

Loading

leofang commented Feb 9, 2021

leofang commented Feb 9, 2021

chainer-ci commented Feb 9, 2021

jakirkham commented Feb 10, 2021

leofang commented Feb 10, 2021 •

edited

Loading

leofang commented Feb 10, 2021

jakirkham commented Feb 10, 2021

asi1024 commented Feb 14, 2021 •

edited

Loading

Improve floating point accuracy in percentile #4617

Improve floating point accuracy in percentile #4617

Conversation

wphicks commented Feb 3, 2021

wphicks commented Feb 3, 2021

leofang left a comment

Choose a reason for hiding this comment

wphicks commented Feb 3, 2021 • edited Loading

wphicks commented Feb 3, 2021

asi1024 commented Feb 8, 2021

chainer-ci commented Feb 8, 2021

asi1024 commented Feb 9, 2021

wphicks commented Feb 9, 2021

asi1024 commented Feb 9, 2021

wphicks commented Feb 9, 2021

wphicks commented Feb 9, 2021

jakirkham commented Feb 9, 2021 • edited Loading

leofang commented Feb 9, 2021

leofang commented Feb 9, 2021

chainer-ci commented Feb 9, 2021

jakirkham commented Feb 10, 2021

leofang commented Feb 10, 2021 • edited Loading

leofang commented Feb 10, 2021

jakirkham commented Feb 10, 2021

asi1024 commented Feb 14, 2021 • edited Loading

wphicks commented Feb 3, 2021 •

edited

Loading

jakirkham commented Feb 9, 2021 •

edited

Loading

leofang commented Feb 10, 2021 •

edited

Loading

asi1024 commented Feb 14, 2021 •

edited

Loading