Optimize counting digits in line numbers during error reporting #82248

nhwn · 2021-02-18T06:34:36Z

Replaces .to_string().len() with simple loop and integer division, which avoids an unnecessary allocation.

Although I couldn't figure out how to directly profile rustc's error reporting, I ran a microbenchmark on my machine (2.9 GHz Dual-Core Intel Core i5) on the two strategies for 0..100_000, and the results seem promising:

test to_string_len ... bench:  12,124,792 ns/iter (+/- 700,652)
test while_loop    ... bench:      30,333 ns/iter (+/- 562)

The x86_64 disassembly reduces integer division to a multiplication + shift, so I don't think there's any problems with using integer division.

For more (micro)optimization, it would be nice if we could avoid the initial check to see if the line number is nonzero, but I don't think self.get_max_line_num(span, children) guarantees a nonzero line number.

rust-highfive · 2021-02-18T06:34:39Z

r? @varkor

(rust-highfive has picked a reviewer for you, use r? to override)

leonardo-m · 2021-02-18T09:00:26Z

An alternative version:

let mut num_digits = 0;
loop {
    num_digits += 1;
    n /= 10;
    if n == 0 { break; }
}
num_digits

nhwn · 2021-02-18T09:08:51Z

An alternative version:

let mut num_digits = 0;
loop {
    num_digits += 1;
    n /= 10;
    if n == 0 { break; }
}
num_digits

That's a much better implementation than the original one, thanks!

varkor · 2021-02-18T13:10:14Z

compiler/rustc_errors/src/emitter.rs

@@ -1713,7 +1713,15 @@ impl EmitterWriter {
        let max_line_num_len = if self.ui_testing {
            ANONYMIZED_LINE_NUM.len()
        } else {
-            self.get_max_line_num(span, children).to_string().len()
+            let mut n = self.get_max_line_num(span, children);


Could you add a comment explaining why we calculate the number of digits this way? (To make sure someone doesn't revert this change without realising it's intentional, later.)

Added an explanatory comment.

varkor · 2021-02-18T13:10:42Z

Could you squash the commits? Thanks.

varkor · 2021-02-18T13:58:20Z

Thanks!

@bors r+ rollup

bors · 2021-02-18T13:58:22Z

📌 Commit 2cd7a5148ce00e403f030e1b9437bd99e93cb08e has been approved by varkor

nhwn · 2021-02-18T14:25:39Z

It seems my tidy hook didn't auto-run when I rebased, so a pesky space slipped through in the comment. Sorry about that.

varkor · 2021-02-18T14:34:27Z

@bors r+ rollup

bors · 2021-02-18T14:34:29Z

📌 Commit 8a5c568 has been approved by varkor

…rkor Optimize counting digits in line numbers during error reporting Replaces `.to_string().len()` with simple loop and integer division, which avoids an unnecessary allocation. Although I couldn't figure out how to directly profile `rustc`'s error reporting, I ran a microbenchmark on my machine (2.9 GHz Dual-Core Intel Core i5) on the two strategies for `0..100_000`, and the results seem promising: ``` test to_string_len ... bench: 12,124,792 ns/iter (+/- 700,652) test while_loop ... bench: 30,333 ns/iter (+/- 562) ``` The x86_64 disassembly reduces integer division to a multiplication + shift, so I don't think there's any problems with using integer division. For more (micro)optimization, it would be nice if we could avoid the initial check to see if the line number is nonzero, but I don't think `self.get_max_line_num(span, children)` _guarantees_ a nonzero line number.

Rollup of 10 pull requests Successful merges: - rust-lang#81546 ([libtest] Run the test synchronously when hitting thread limit) - rust-lang#82066 (Ensure valid TraitRefs are created for GATs) - rust-lang#82112 (const_generics: Dont evaluate array length const when handling yet another error ) - rust-lang#82194 (In some limited cases, suggest `where` bounds for non-type params) - rust-lang#82215 (Replace if-let and while-let with `if let` and `while let`) - rust-lang#82218 (Make sure pdbs are copied along with exe and dlls when bootstrapping) - rust-lang#82236 (avoid converting types into themselves (clippy::useless_conversion)) - rust-lang#82246 (Add long explanation for E0549) - rust-lang#82248 (Optimize counting digits in line numbers during error reporting) - rust-lang#82256 (Print -Ztime-passes (and misc stats/logs) on stderr, not stdout.) Failed merges: r? `@ghost` `@rustbot` modify labels: rollup

matthieu-m · 2021-02-25T10:35:07Z

I actually just realized there's doesn't appear to be a fast way to compute the number of (decimal) digits that an integer would take.

I find it surprising, as I would have expected integer formatting to make use of this. For reference, a simple version is to switch on the number of bits (playground):

fn count_digits(i: u32) -> u32 {
    if i < 10 {
        return 1;
    }
    if i < 100 {
        return 2;
    }

    match 32 - i.leading_zeros() {
        0 | 1 | 2 | 3 | 4 | 5 | 6
            //  100 takes 7 bits, and anything < 100 is already handled.
            => unsafe { hint::unreachable_unchecked() },
        7 | 8 | 9 | 10
            => 3 + if i < 1_000 { 0 } else { 1 },
        11 | 12 | 13 | 14
            => 4 + if i < 10_000 { 0 } else { 1 },
        15 | 16 | 17
            => 5 + if i < 100_000 { 0 } else { 1 },
        18 | 19 | 20
            => 6 + if i < 1_000_000 { 0 } else { 1 },
        21 | 22 | 23 | 24
            => 7 + if i < 10_000_000 { 0 } else { 1 },
        25 | 26 | 27
            => 8 + if i < 100_000_000 { 0 } else { 1 },
        28 | 29 | 30
            => 9 + if i < 1_000_000_000 { 0 } else { 1 },
        31 | 32 => 10,
        //  There are not more than 32 bits in a 32 bits integer
        _ => unsafe { hint::unreachable_unchecked() },
    }
}

It may be a useful addition to the standard library.

toothbrush7777777 · 2021-02-26T16:57:04Z

I actually just realized there's doesn't appear to be a fast way to compute the number of (decimal) digits that an integer would take.
[…]

Basically, you want ceil(log10(n)).

Optimize counting digits in line numbers during error reporting further This one-ups rust-lang#82248 by switching the strategy: Instead of dividing the value by 10 repeatedly, we compare with a limit that we multiply by 10 repeatedly. In my benchmarks, this took between 50% and 25% of the original time. The reasons for being faster are: 1. While LLVM is able to replace a division by constant with a multiply + shift, a plain multiplication is still faster. However, this doesn't even factor, because 2. Multiplication, unlike division, is const. We also use a simple for-loop instead of a more complex loop + break, which allows 3. rustc to const-fold the whole loop, and indeed the assembly output simply shows a series of comparisons.

rust-highfive assigned varkor Feb 18, 2021

rust-highfive added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Feb 18, 2021

This comment has been minimized.

Sign in to view

varkor reviewed Feb 18, 2021

View reviewed changes

nhwn force-pushed the optimize-counting-digits branch from 4eb41e5 to 2cd7a51 Compare February 18, 2021 13:53

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Feb 18, 2021

This comment has been minimized.

Sign in to view

nhwn: optimize counting digits in line numbers

8a5c568

nhwn force-pushed the optimize-counting-digits branch from 2cd7a51 to 8a5c568 Compare February 18, 2021 14:23

This was referenced Feb 18, 2021

Rollup of 10 pull requests #82262

Closed

Rollup of 10 pull requests #82263

Merged

bors merged commit 555db2d into rust-lang:master Feb 18, 2021

rustbot added this to the 1.52.0 milestone Feb 18, 2021

nhwn deleted the optimize-counting-digits branch February 18, 2021 22:39

llogiq mentioned this pull request Feb 26, 2021

Optimize counting digits in line numbers during error reporting further #82562

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize counting digits in line numbers during error reporting #82248

Optimize counting digits in line numbers during error reporting #82248

nhwn commented Feb 18, 2021

rust-highfive commented Feb 18, 2021

This comment has been minimized.

leonardo-m commented Feb 18, 2021

nhwn commented Feb 18, 2021

varkor Feb 18, 2021

nhwn Feb 18, 2021

varkor commented Feb 18, 2021

varkor commented Feb 18, 2021

bors commented Feb 18, 2021

This comment has been minimized.

nhwn commented Feb 18, 2021

varkor commented Feb 18, 2021

bors commented Feb 18, 2021

matthieu-m commented Feb 25, 2021

toothbrush7777777 commented Feb 26, 2021

Optimize counting digits in line numbers during error reporting #82248

Optimize counting digits in line numbers during error reporting #82248

Conversation

nhwn commented Feb 18, 2021

rust-highfive commented Feb 18, 2021

This comment has been minimized.

leonardo-m commented Feb 18, 2021

nhwn commented Feb 18, 2021

varkor Feb 18, 2021

Choose a reason for hiding this comment

nhwn Feb 18, 2021

Choose a reason for hiding this comment

varkor commented Feb 18, 2021

varkor commented Feb 18, 2021

bors commented Feb 18, 2021

This comment has been minimized.

nhwn commented Feb 18, 2021

varkor commented Feb 18, 2021

bors commented Feb 18, 2021

matthieu-m commented Feb 25, 2021

toothbrush7777777 commented Feb 26, 2021