Avoid unnecessary allocations in `float_lit` and `integer_lit`. #55384

nnethercote · 2018-10-26T11:09:22Z

This commit avoids an allocation when parsing any float and integer
literals that don't involved underscores.

This reduces the number of allocations done for the tuple-stress
benchmark by 10%, reducing its instruction count by just under 1%.

This commit avoids an allocation when parsing any float and integer literals that don't involved underscores. This reduces the number of allocations done for the `tuple-stress` benchmark by 10%, reducing its instruction count by just under 1%.

rust-highfive · 2018-10-26T11:09:32Z

r? @michaelwoerister

(rust_highfive has picked a reviewer for you, use r? to override)

Mark-Simulacrum · 2018-10-26T12:52:15Z

src/libsyntax/parse/mod.rs

+    // Strip underscores without allocating a new String unless necessary.
+    let s2;
+    let s = if s.chars().any(|c| c == '_') {
+        s2 = s.chars().filter(|&c| c != '_').collect::<String>();


Hm, since underscores are in ASCII, couldn't we into_bytes, retain, and then String::from_utf8, maybe unchecked?

That would work, but it's not clear to me that it would be any faster; it might even be slower because there are two allocations involved? (Because it converts to a vector, and then back to a new String?)

Besides, this is the cold path, so I'm satisfied with reusing the existing code, which until now had been considered good enough for the hot path :)

Just FYI, String::from_utf8 consumes the vector given to it, so no additional allocation would happen.

michaelwoerister · 2018-10-26T13:21:48Z

src/libsyntax/parse/mod.rs

+
+    // Strip underscores without allocating a new String unless necessary.
+    let s2;
+    let s = if s.chars().any(|c| c == '_') {


It would be interesting to see if std::slice::memchr::memchr(b'_', s.as_bytes()).is_some() is faster than s.chars().any().

I tried it. Cachegrind says it's marginally more instructions: 22,526,559,505 up from 22,516,683,388. So I think I'll stick with the original.

nnethercote · 2018-10-29T06:54:07Z

Although I haven't changed the code, IMO I've addressed the comments above, so this is ready for re-review.

michaelwoerister · 2018-10-29T08:33:34Z

Thanks, @nnethercote!

@bors r+

bors · 2018-10-29T08:33:35Z

📌 Commit eb637d2 has been approved by michaelwoerister

…t_lit, r=michaelwoerister Avoid unnecessary allocations in `float_lit` and `integer_lit`. This commit avoids an allocation when parsing any float and integer literals that don't involved underscores. This reduces the number of allocations done for the `tuple-stress` benchmark by 10%, reducing its instruction count by just under 1%.

@ghost

Rollup of 9 pull requests Successful merges: - #54965 (update tcp stream documentation) - #55269 (fix typos in various places) - #55384 (Avoid unnecessary allocations in `float_lit` and `integer_lit`.) - #55423 (back out bogus `Ok`-wrapping suggestion on `?` arm type mismatch) - #55426 (Make a bunch of trivial methods of NonNull be `#[inline]`) - #55438 (Avoid directly catching BaseException in bootstrap configure script) - #55439 (Remove unused sys import from generate-deriving-span-tests) - #55440 (Remove unreachable code in hasClass function in Rustdoc) - #55447 (Fix invalid path in generate-deriving-span-tests.py.) Failed merges: r? @ghost

rust-highfive assigned michaelwoerister Oct 26, 2018

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Oct 26, 2018

Mark-Simulacrum reviewed Oct 26, 2018

View reviewed changes

michaelwoerister reviewed Oct 26, 2018

View reviewed changes

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Oct 29, 2018

pietroalbini mentioned this pull request Oct 29, 2018

Rollup of 9 pull requests #55462

Merged

bors merged commit eb637d2 into rust-lang:master Oct 29, 2018

nnethercote deleted the better-integer_lit-float_lit branch October 29, 2018 21:47

nnethercote mentioned this pull request Oct 29, 2018

Avoid allocating when parsing \u{...} literals. #50052

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid unnecessary allocations in `float_lit` and `integer_lit`. #55384

Avoid unnecessary allocations in `float_lit` and `integer_lit`. #55384

nnethercote commented Oct 26, 2018

rust-highfive commented Oct 26, 2018

Mark-Simulacrum Oct 26, 2018

nnethercote Oct 28, 2018

michaelwoerister Oct 29, 2018

michaelwoerister Oct 26, 2018

nnethercote Oct 29, 2018

nnethercote commented Oct 29, 2018

michaelwoerister commented Oct 29, 2018

bors commented Oct 29, 2018

Avoid unnecessary allocations in float_lit and integer_lit. #55384

Avoid unnecessary allocations in float_lit and integer_lit. #55384

Conversation

nnethercote commented Oct 26, 2018

rust-highfive commented Oct 26, 2018

Mark-Simulacrum Oct 26, 2018

Choose a reason for hiding this comment

nnethercote Oct 28, 2018

Choose a reason for hiding this comment

michaelwoerister Oct 29, 2018

Choose a reason for hiding this comment

michaelwoerister Oct 26, 2018

Choose a reason for hiding this comment

nnethercote Oct 29, 2018

Choose a reason for hiding this comment

nnethercote commented Oct 29, 2018

michaelwoerister commented Oct 29, 2018

bors commented Oct 29, 2018

Avoid unnecessary allocations in `float_lit` and `integer_lit`. #55384

Avoid unnecessary allocations in `float_lit` and `integer_lit`. #55384