Speed up `TokenStream` concatenation #65198

nnethercote · 2019-10-08T06:39:53Z

This PR fixes the quadratic behaviour identified in #65080.

Currently, this function creates a new empty stream, and then appends the elements from each given stream onto that stream. This can cause quadratic behaviour. This commit changes the function so that it modifies the first stream (which can be long) by extending it with the elements from the subsequent streams (which are almost always short), which avoids the quadratic behaviour.

Currently, when two tokens must be glued together, this function duplicates large chunks of the existing streams. This can cause quadratic behaviour. This commit changes the function so that it overwrites the last token with a glued token, which avoids the quadratic behaviour. This removes the need for `TokenStreamBuilder::push_all_but_{first,last}_tree`. The commit also restructures `push` somewhat, by removing `TokenStream::{first_tree_and_joint,last_tree_if_joint}` in favour of more pattern matching and some comments. This makes the code shorter, and in my opinion, more readable.

nnethercote · 2019-10-08T06:46:12Z

In my local measurements, this provided a very small (<1%) improvement on some of the standard benchmarks.

@bors try @rust-timer queue

rust-timer · 2019-10-08T06:46:13Z

Awaiting bors try build completion

bors · 2019-10-08T06:46:23Z

⌛ Trying commit 75e0078 with merge c255427...

@Mark-Simulacrum

Speed up `TokenStream` concatenation This PR fixes the quadratic behaviour identified in #65080. r? @Mark-Simulacrum

Centril · 2019-10-08T06:53:20Z

src/libsyntax/tokenstream.rs

+                // Get the first stream. If it's `None`, create an empty
+                // stream.
+                let mut iter = streams.drain();
+                let mut first_stream_lrc = match iter.next().unwrap().0 {


Suggested change

let mut first_stream_lrc = match iter.next().unwrap().0 {

let mut first_stream_lrc = match iter.next().unwrap().0.unwrap_or_default();

Centril · 2019-10-08T06:57:05Z

src/libsyntax/tokenstream.rs

+                // space for them.
+                let first_vec_mut = Lrc::make_mut(&mut first_stream_lrc);
+                first_vec_mut.reserve(num_appends);
+                for stream in iter {


I think this would be cleaner using .filter_map(|x| x) and .flat_map(|s| s.iter().cloned()).

Current version is easy to read, and the alternatives seem to spend more Rust functions to do the same thing, IMO one can prefer the current.

bors · 2019-10-08T09:48:14Z

☀️ Try build successful - checks-azure
Build commit: c255427 (c25542724f393647fc93a8a0319edaa827e701d9)

rust-timer · 2019-10-08T09:48:16Z

Queued c255427 with parent d304f5c, future comparison URL.

Mark-Simulacrum

Code changes look good to me. I'd like to get @petrochenkov or perhaps @matklad to sign off too though since I'm not too familiar with this code.

matklad · 2019-10-08T12:54:36Z

LGTM

Mark-Simulacrum · 2019-10-08T13:01:40Z

@bors r+ rollup=never

bors · 2019-10-08T13:01:41Z

📌 Commit 75e0078 has been approved by Mark-Simulacrum

petrochenkov · 2019-10-08T14:35:53Z

r? @Mark-Simulacrum

nnethercote · 2019-10-08T23:07:00Z

The perf run didn't work. Let's try it the old-fashioned way:

@rust-timer build c255427

rust-timer · 2019-10-08T23:07:01Z

Queued c255427 with parent d304f5c, future comparison URL.

mati865 · 2019-10-08T23:14:00Z

@nnethercote it worked but still haven't finished, you can see it here https://perf.rust-lang.org/status.html

That 700000% wall time regression made queue a bit long...

rust-timer · 2019-10-09T00:20:45Z

Finished benchmarking try commit c255427, comparison URL.

nnethercote · 2019-10-09T01:06:55Z

Lots of small (<1%) improvements, mostly in the short-running benchmarks.

bors · 2019-10-09T08:57:32Z

⌛ Testing commit 75e0078 with merge e59dab5...

@Mark-Simulacrum

Speed up `TokenStream` concatenation This PR fixes the quadratic behaviour identified in #65080. r? @Mark-Simulacrum

bors · 2019-10-09T12:51:00Z

☀️ Test successful - checks-azure
Approved by: Mark-Simulacrum
Pushing e59dab5 to master...

nnethercote added 2 commits October 8, 2019 16:57

rust-highfive assigned Mark-Simulacrum Oct 8, 2019

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Oct 8, 2019

nnethercote mentioned this pull request Oct 8, 2019

TokenStream manipulations are 1000x too slow #65080

Closed

bors added a commit that referenced this pull request Oct 8, 2019

Auto merge of #65198 - nnethercote:fix-65080, r=<try>

c255427

Speed up `TokenStream` concatenation This PR fixes the quadratic behaviour identified in #65080. r? @Mark-Simulacrum

Centril reviewed Oct 8, 2019

View reviewed changes

Mark-Simulacrum approved these changes Oct 8, 2019

View reviewed changes

Mark-Simulacrum assigned petrochenkov and matklad and unassigned Mark-Simulacrum Oct 8, 2019

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Oct 8, 2019

rust-highfive assigned Mark-Simulacrum and unassigned matklad and petrochenkov Oct 8, 2019

bors added a commit that referenced this pull request Oct 9, 2019

Auto merge of #65198 - nnethercote:fix-65080, r=Mark-Simulacrum

e59dab5

Speed up `TokenStream` concatenation This PR fixes the quadratic behaviour identified in #65080. r? @Mark-Simulacrum

bors added the merged-by-bors This PR was explicitly merged by bors. label Oct 9, 2019

bors merged commit 75e0078 into rust-lang:master Oct 9, 2019

nnethercote deleted the fix-65080 branch October 9, 2019 22:38

nnethercote mentioned this pull request Oct 18, 2019

Add token-stream-stress benchmark. rust-lang/rustc-perf#516

Merged

jyn514 added I-compiletime Issue: Problems and improvements with respect to compile times. A-proc-macros Area: Procedural macros T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. labels Sep 17, 2020

dtolnay mentioned this pull request Dec 26, 2021

Update benchmark example output dtolnay/proc-macro2#314

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up `TokenStream` concatenation #65198

Speed up `TokenStream` concatenation #65198

nnethercote commented Oct 8, 2019

nnethercote commented Oct 8, 2019

rust-timer commented Oct 8, 2019

bors commented Oct 8, 2019

Centril Oct 8, 2019

Centril Oct 8, 2019

bluss Oct 8, 2019

bors commented Oct 8, 2019

rust-timer commented Oct 8, 2019

Mark-Simulacrum left a comment

matklad commented Oct 8, 2019

Mark-Simulacrum commented Oct 8, 2019

bors commented Oct 8, 2019

petrochenkov commented Oct 8, 2019

nnethercote commented Oct 8, 2019

rust-timer commented Oct 8, 2019

mati865 commented Oct 8, 2019

rust-timer commented Oct 9, 2019

nnethercote commented Oct 9, 2019

bors commented Oct 9, 2019

bors commented Oct 9, 2019

	let mut first_stream_lrc = match iter.next().unwrap().0 {
	let mut first_stream_lrc = match iter.next().unwrap().0.unwrap_or_default();

Speed up TokenStream concatenation #65198

Speed up TokenStream concatenation #65198

Conversation

nnethercote commented Oct 8, 2019

nnethercote commented Oct 8, 2019

rust-timer commented Oct 8, 2019

bors commented Oct 8, 2019

Centril Oct 8, 2019

Choose a reason for hiding this comment

Centril Oct 8, 2019

Choose a reason for hiding this comment

bluss Oct 8, 2019

Choose a reason for hiding this comment

bors commented Oct 8, 2019

rust-timer commented Oct 8, 2019

Mark-Simulacrum left a comment

Choose a reason for hiding this comment

matklad commented Oct 8, 2019

Mark-Simulacrum commented Oct 8, 2019

bors commented Oct 8, 2019

petrochenkov commented Oct 8, 2019

nnethercote commented Oct 8, 2019

rust-timer commented Oct 8, 2019

mati865 commented Oct 8, 2019

rust-timer commented Oct 9, 2019

nnethercote commented Oct 9, 2019

bors commented Oct 9, 2019

bors commented Oct 9, 2019

Speed up `TokenStream` concatenation #65198

Speed up `TokenStream` concatenation #65198