Fixes numerical accuracy issues in quantile. #16572

simonbyrne · 2016-05-25T08:28:34Z

Fixes issue JuliaStats/StatsBase.jl#164, and another when p < eps().

Fixes issue JuliaStats/StatsBase.jl#164, and another when `p < eps()`.

simonbyrne · 2016-05-26T09:48:39Z

cc @andreasnoack can you review and merge?

andreasnoack · 2016-05-26T13:04:49Z

base/statistics.jl


-    indlo = floor(index)
-    i = trunc(Int,indlo)
+    i = trunc(Int,t0) + 1


To avoid confusion, maybe just t0 = Int(f0) since f0 has already been truncated.

trunc(Int, t0) is technically faster, as it avoids the check of an integer.

Makes sense.

andreasnoack · 2016-05-26T13:05:28Z

Except for the minor comment, it looks good.

The `a + γ*(b-a)` introduced by JuliaLang/julia#16572 has the advantage that it increases with `γ` even when `a` and `b` are very close, but it has the drawback that it is not robust to overflow. This is likely to happen in practice with small integer and floating point types. Conversely, the `(1-γ)*a + γ*b` which is currently used only for non-finite quantities is robust to overflow but may not always increase with `γ` as when `a` and `b` are very close or (more frequently) equal since precision loss can give a slightly smaller value for a larger `γ`. This can be problematic as it breaks an expected invariant. So keep using the `a + γ*(b-a)` formula when `a ≈ b`, in which case it's almost like returning either `a` or `b` but less arbitrary.

Fixes numerical accuracy issues in quantile.

4e81d34

Fixes issue JuliaStats/StatsBase.jl#164, and another when `p < eps()`.

andreasnoack reviewed May 26, 2016
View reviewed changes

andreasnoack merged commit 5e32883 into master May 26, 2016

andreasnoack deleted the sb/quantile-acc branch May 26, 2016 13:43

simonbyrne mentioned this pull request Dec 15, 2016

Incorrect computation of quantiles of vector of infinities #19542

Closed

nalimilan mentioned this pull request Jun 17, 2023

Incorrect quantiles for floating-point and integer arrays JuliaStats/Statistics.jl#144

Closed

nalimilan mentioned this pull request Jul 1, 2023

Fix overflows in quantile JuliaStats/Statistics.jl#145

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes numerical accuracy issues in quantile. #16572

Fixes numerical accuracy issues in quantile. #16572

simonbyrne commented May 25, 2016

simonbyrne commented May 26, 2016

andreasnoack May 26, 2016

simonbyrne May 26, 2016

andreasnoack May 26, 2016

andreasnoack commented May 26, 2016

Fixes numerical accuracy issues in quantile. #16572

Fixes numerical accuracy issues in quantile. #16572

Conversation

simonbyrne commented May 25, 2016

simonbyrne commented May 26, 2016

andreasnoack May 26, 2016

Choose a reason for hiding this comment

simonbyrne May 26, 2016

Choose a reason for hiding this comment

andreasnoack May 26, 2016

Choose a reason for hiding this comment

andreasnoack commented May 26, 2016