Improve computation of offset in `EscapeUnicode` #31253

ranma42 · 2016-01-28T09:03:50Z

Unify the computation of offset and use leading_zeros instead of manually scanning the bits.
This PR removes some duplicated code and makes it a little simpler .
The computation of offset is also faster, but it is unlikely to have an impact on actual code.

(split from #31049)

rust-highfive · 2016-01-28T09:04:02Z

r? @brson

(rust_highfive has picked a reviewer for you, use r? to override)

nagisa · 2016-01-28T12:22:11Z

src/libcore/char.rs

+        // digit should be printed and (which is the same) avoids the
+        // (31 - 32) underflow
+        let msb = 31 - (c | 1).leading_zeros();
+        let msdigit = msb / 4;


This code is pretty confusing (I had to re-interpret it several times), so bear with me…

Should this variable be named nibbles/hex_digits (to-be-output) or something along the lines?

I am afraid that could be misleading, because it is not the number of (hex)digits, but the index of the most significant one.
I hoped that the parallel with MSB would guide the reader and I thought that that presenting it as number of hex_digits would make them believe that an hex_digits = 0 would not be acceptable, while it is the correct offset for any c between 0 and 0xf.

The `offset` value was computed both in `next` and in `size_hint`; computing it in a single place ensures consistency and makes it easier to apply improvements. The value is now computed as soon as the iterator is constructed. This means that the time to compute it is spent immediately and cannot be avoided, but it also guarantees that it is only spent once.

Instead of iteratively scanning the bits, use `leading_zeros`.

ranma42 · 2016-01-28T14:15:53Z

I tried to improve the naming (offset -> hex_digit_idx, msdigit -> ms_hex_digit) and to add some comments.

brson · 2016-01-29T22:14:06Z

I don't feel qualified to review this. Anybody know who is? @SimonSapin maybe?

SimonSapin · 2016-02-03T19:30:54Z

I believe this code is equivalent to the one before, and it does look nicer. I haven’t benchmarked it to check if it’s faster.

brson · 2016-04-20T01:14:39Z

@bors r+

bors · 2016-04-20T01:14:41Z

📌 Commit 8984242 has been approved by brson

bors · 2016-04-20T02:17:09Z

⌛ Testing commit 8984242 with merge 9cf6fba...

Improve computation of offset in `EscapeUnicode` Unify the computation of `offset` and use `leading_zeros` instead of manually scanning the bits. This PR removes some duplicated code and makes it a little simpler . The computation of `offset` is also faster, but it is unlikely to have an impact on actual code. (split from #31049)

bors · 2016-04-20T04:41:28Z

rust-highfive assigned brson Jan 28, 2016

nagisa reviewed Jan 28, 2016
View reviewed changes

ranma42 added 3 commits January 28, 2016 15:13

Improve computation of EscapeUnicode offset field

7b33d39

Instead of iteratively scanning the bits, use `leading_zeros`.

Improve naming and explanations

79dfa25

ranma42 force-pushed the improve-unicode-iter-offset branch from 283f3b8 to 79dfa25 Compare January 28, 2016 14:13

Fix make tidy and name what is being computed

8984242

bors merged commit 8984242 into rust-lang:master Apr 20, 2016

ranma42 deleted the improve-unicode-iter-offset branch April 20, 2016 08:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve computation of offset in `EscapeUnicode` #31253

Improve computation of offset in `EscapeUnicode` #31253

ranma42 commented Jan 28, 2016

rust-highfive commented Jan 28, 2016

nagisa Jan 28, 2016

ranma42 Jan 28, 2016

ranma42 commented Jan 28, 2016

brson commented Jan 29, 2016

SimonSapin commented Feb 3, 2016

brson commented Apr 20, 2016

bors commented Apr 20, 2016

bors commented Apr 20, 2016

bors commented Apr 20, 2016

Improve computation of offset in EscapeUnicode #31253

Improve computation of offset in EscapeUnicode #31253

Conversation

ranma42 commented Jan 28, 2016

rust-highfive commented Jan 28, 2016

nagisa Jan 28, 2016

Choose a reason for hiding this comment

ranma42 Jan 28, 2016

Choose a reason for hiding this comment

ranma42 commented Jan 28, 2016

brson commented Jan 29, 2016

SimonSapin commented Feb 3, 2016

brson commented Apr 20, 2016

bors commented Apr 20, 2016

bors commented Apr 20, 2016

bors commented Apr 20, 2016

Improve computation of offset in `EscapeUnicode` #31253

Improve computation of offset in `EscapeUnicode` #31253