rayon_1_0_0::sort perf regression #56283

meven · 2018-11-27T14:45:08Z

According to the work done on lolbench there was a perf regression on
rayon_1_0_0::sort::demo_merge_sort_descending and rayon_1_0_0::sort::par_sort_descending between nightly 2018-11-03 and 2018-11-04.

The benchmarks grew 100%, and close to 200 % in branch_instructions / iteration.

cc @anp

https://lolbench.rs/benchmarks/rayon-1-0-0-sort-demo-merge-sort-descending.html
https://lolbench.rs/benchmarks/rayon-1-0-0-sort-par-sort-descending.html

pnkfelix · 2018-11-29T15:40:54Z

tagging as P-high for the initial investigation of the scope of the performance regression.

nikic · 2018-11-29T16:07:37Z

Commit range is 8b09631...04fdb44, which notably includes the jemalloc removal. Most likely this is just an allocator regression and can be recovered by explicitly enabling jemalloc.

pnkfelix · 2018-12-03T12:58:55Z

I can observe the ~2.0x slowdown regression on my Linux desktop.

My mac does not exhibit it nearly as cleanly.

I'm going to see if switching on jemallocator manually (by adding it as a #[global_allocator] to rayon-demo's main.rs) eliminates the regression.

Update: yes, turning on #[global_allocator] static ALLOC: jemallocator::Jemalloc = jemallocator::Jemalloc; eliminates the regression.

pnkfelix · 2018-12-03T13:04:49Z

Okay so I think that confirms the hypothesis from @nikic

You can see from the description on PR #55238 that switching from jemalloc to glibc alloc was expected to cause regressions. That included the note that you would get that performance back (and then some) by switching to jemalloc-sys from crates.io. The Rust stdlib don't do it by default, but as far as I can tell it is what we are recommending as the path for clients who want to maximize performance on this axis?

pnkfelix · 2018-12-03T13:08:05Z

In any case my personal recommendation is that Rayon should consider turning on jemalloc for its demo code.

Here is the necessary diff:

diff --git a/rayon-demo/Cargo.toml b/rayon-demo/Cargo.toml
index 897b5c8..e1aba34 100644
--- a/rayon-demo/Cargo.toml
+++ b/rayon-demo/Cargo.toml
@@ -5,6 +5,7 @@ authors = ["Niko Matsakis <niko@alum.mit.edu>"]
 publish = false

 [dependencies]
+jemallocator = "0.1.8"
 rayon = { path = "../" }
 cgmath = "0.16"
 docopt = "1"
diff --git a/rayon-demo/src/main.rs b/rayon-demo/src/main.rs
index 84619c0..e21d7b0 100644
--- a/rayon-demo/src/main.rs
+++ b/rayon-demo/src/main.rs
@@ -1,5 +1,9 @@
 #![cfg_attr(test, feature(test))]

+extern crate jemallocator;
+#[global_allocator]
+static ALLOC: jemallocator::Jemalloc = jemallocator::Jemalloc;
+
 use std::env;
 use std::io;
 use std::io::prelude::*;

pnkfelix · 2018-12-03T13:12:25Z

(I think there isn't much more to investigate here with respect to the scope of the regression nor its source. Downgrading from P-high to P-medium, and nominating for discussion at next compiler meeting with inclination to close as "not-a-bug"/"wont-fix")

anp · 2018-12-03T17:02:08Z

FWIW I hope to support measuring against multiple different allocators in lolbench in the future, just not there yet.

pnkfelix · 2018-12-06T15:32:42Z

closing as wontfix.

Centril added the I-slow Issue: Problems and improvements with respect to performance of generated code. label Nov 27, 2018

nagisa added regression-from-stable-to-nightly Performance or correctness regression from stable to nightly. T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Nov 27, 2018

pnkfelix self-assigned this Nov 29, 2018

pnkfelix added the P-high High priority label Nov 29, 2018

pnkfelix added I-nominated P-medium Medium priority and removed P-high High priority labels Dec 3, 2018

pnkfelix closed this as completed Dec 6, 2018

nikic mentioned this issue Dec 7, 2018

performance regression introduced in nightly-2018-11-04 #56592

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rayon_1_0_0::sort perf regression #56283

rayon_1_0_0::sort perf regression #56283

meven commented Nov 27, 2018

pnkfelix commented Nov 29, 2018

nikic commented Nov 29, 2018

pnkfelix commented Dec 3, 2018 •

edited

Loading

pnkfelix commented Dec 3, 2018

pnkfelix commented Dec 3, 2018

pnkfelix commented Dec 3, 2018

anp commented Dec 3, 2018

pnkfelix commented Dec 6, 2018

rayon_1_0_0::sort perf regression #56283

rayon_1_0_0::sort perf regression #56283

Comments

meven commented Nov 27, 2018

pnkfelix commented Nov 29, 2018

nikic commented Nov 29, 2018

pnkfelix commented Dec 3, 2018 • edited Loading

pnkfelix commented Dec 3, 2018

pnkfelix commented Dec 3, 2018

pnkfelix commented Dec 3, 2018

anp commented Dec 3, 2018

pnkfelix commented Dec 6, 2018

pnkfelix commented Dec 3, 2018 •

edited

Loading