net: Don't use checked arithmetic when parsing numbers with known max digits #121428

okaneco · 2024-02-22T03:26:12Z

Add a branch to Parser::read_number that determines whether checked or regular arithmetic is used.

If max_digits.is_some(), then we know we are parsing a u8 or u16 because read_number is only called with Some(3) or Some(4). Both types fit within a u32 without risk of overflow. Thus, we can use plain arithmetic to avoid extra instructions from checked_mul and checked_add.

Add benches for IpAddr, Ipv4Addr, Ipv6Addr, SocketAddr, SocketAddrV4, and SocketAddrV6 parsing

rustbot · 2024-02-22T03:26:20Z

rustbot has assigned @cuviper.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

okaneco · 2024-02-22T03:31:20Z

I wasn't sure whether to add the functionality to the current parsing function or create a new function. I chose this way to reduce call-site churn.

For the benchmarks, I took the values from the tests folder. If there are better suggestions, I have no problem using those instead.

I saw a ~20% improvement on benchmarks on my laptop.

Before

benchmarks:
    net::addr_parser::bench_parse_ipaddr_v4           62.00ns/iter  +/- 2.00ns
    net::addr_parser::bench_parse_ipaddr_v6_compress 211.00ns/iter  +/- 9.00ns
    net::addr_parser::bench_parse_ipaddr_v6_full     290.00ns/iter  +/- 6.00ns
    net::addr_parser::bench_parse_ipaddr_v6_v4       194.00ns/iter  +/- 6.00ns
    net::addr_parser::bench_parse_ipv4                61.00ns/iter  +/- 3.00ns
    net::addr_parser::bench_parse_ipv6_compress      193.00ns/iter +/- 29.00ns
    net::addr_parser::bench_parse_ipv6_full          284.00ns/iter +/- 15.00ns
    net::addr_parser::bench_parse_ipv6_v4            183.00ns/iter  +/- 6.00ns
    net::addr_parser::bench_parse_socket_v4           85.00ns/iter  +/- 2.00ns
    net::addr_parser::bench_parse_socket_v6          235.00ns/iter +/- 13.00ns
    net::addr_parser::bench_parse_socket_v6_scope_id 248.00ns/iter +/- 31.00ns
    net::addr_parser::bench_parse_socketaddr_v4       87.00ns/iter  +/- 4.00ns
    net::addr_parser::bench_parse_socketaddr_v6      254.00ns/iter +/- 10.00ns

After

benchmarks:
    net::addr_parser::bench_parse_ipaddr_v4           48.00ns/iter  +/- 1.00ns
    net::addr_parser::bench_parse_ipaddr_v6_compress 170.00ns/iter  +/- 4.00ns
    net::addr_parser::bench_parse_ipaddr_v6_full     227.00ns/iter  +/- 4.00ns
    net::addr_parser::bench_parse_ipaddr_v6_v4       158.00ns/iter  +/- 3.00ns
    net::addr_parser::bench_parse_ipv4                45.00ns/iter  +/- 2.00ns
    net::addr_parser::bench_parse_ipv6_compress      160.00ns/iter +/- 12.00ns
    net::addr_parser::bench_parse_ipv6_full          225.00ns/iter  +/- 7.00ns
    net::addr_parser::bench_parse_ipv6_v4            146.00ns/iter  +/- 3.00ns
    net::addr_parser::bench_parse_socket_v4           69.00ns/iter  +/- 6.00ns
    net::addr_parser::bench_parse_socket_v6          210.00ns/iter +/- 11.00ns
    net::addr_parser::bench_parse_socket_v6_scope_id 218.00ns/iter +/- 24.00ns
    net::addr_parser::bench_parse_socketaddr_v4       71.00ns/iter  +/- 3.00ns
    net::addr_parser::bench_parse_socketaddr_v6      216.00ns/iter  +/- 5.00ns

cuviper · 2024-03-04T23:11:56Z

library/core/src/net/parser.rs

@@ -104,36 +104,62 @@ impl<'a> Parser<'a> {
    // Read a number off the front of the input in the given radix, stopping
    // at the first non-digit character or eof. Fails if the number has more
    // digits than max_digits or if there is no number.
-    fn read_number<T: ReadNumberHelper>(
+    //
+    // INVARIANT: `max_digits` must be less than or equal to the number of


It must be strictly less, because you can overflow u32 with the same number of digits as u32::MAX.

Maybe we should also add a debug_assert! for this? Just for future proofing to be checked in CI, without affecting release builds.

Changed the invariant comment and added a debug_assert! checking that max_digits is below 10.

I could have used max_digits <= u32::MAX.ilog10() but that seems less clear for future readers as that rounds down to 9.

arithmetic If `max_digits.is_some()`, then we know we are parsing a `u8` or `u16` because `read_number` is only called with `Some(3)` or `Some(4)`. Both types fit well within a `u32` without risk of overflow. Thus, we can use plain arithmetic to avoid extra instructions from `checked_mul` and `checked_add`.

Add benches for IpAddr, Ipv4Addr, Ipv6Addr, SocketAddr, SocketAddrV4, and SocketAddrV6 parsing

cuviper · 2024-03-04T23:54:54Z

Thanks!

@bors r+

bors · 2024-03-04T23:54:57Z

📌 Commit 69637c9 has been approved by cuviper

It is now in the queue for this repository.

jhpratt · 2024-03-05T02:44:06Z

@bors rollup=never

(due to perf implications)

bors · 2024-03-05T15:29:23Z

⌛ Testing commit 69637c9 with merge 96561a8...

bors · 2024-03-05T17:57:58Z

☀️ Test successful - checks-actions
Approved by: cuviper
Pushing 96561a8 to master...

rust-timer · 2024-03-05T19:14:58Z

Finished benchmarking commit (96561a8): comparison URL.

Overall result: no relevant changes - no action needed

@rustbot label: -perf-regression

Instruction count

This benchmark run did not return any relevant results for this metric.

Max RSS (memory usage)

This benchmark run did not return any relevant results for this metric.

Cycles

Results

This is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.

	mean	range	count
Regressions ❌ (primary)	-	-	0
Regressions ❌ (secondary)	2.1%	[2.1%, 2.1%]	1
Improvements ✅ (primary)	-	-	0
Improvements ✅ (secondary)	-2.9%	[-2.9%, -2.9%]	1
All ❌✅ (primary)	-	-	0

Binary size

This benchmark run did not return any relevant results for this metric.

Bootstrap: 644.654s -> 644.518s (-0.02%)
Artifact size: 175.03 MiB -> 175.01 MiB (-0.01%)

rustbot assigned cuviper Feb 22, 2024

rustbot added S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-libs Relevant to the library team, which will review and decide on the PR/issue. labels Feb 22, 2024

cuviper reviewed Mar 4, 2024

View reviewed changes

okaneco added 2 commits March 4, 2024 18:46

Add benches for net parsing

69637c9

Add benches for IpAddr, Ipv4Addr, Ipv6Addr, SocketAddr, SocketAddrV4, and SocketAddrV6 parsing

okaneco force-pushed the ipaddr_parse branch from 30b67bc to 69637c9 Compare March 4, 2024 23:50

bors added S-waiting-on-bors Status: Waiting on bors to run and complete tests. Bors will change the label on completion. and removed S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Mar 4, 2024

bors mentioned this pull request Mar 5, 2024

Move generic NonZero rustc_layout_scalar_valid_range_start attribute to inner type. #121885

Merged

bors added the merged-by-bors This PR was explicitly merged by bors. label Mar 5, 2024

bors merged commit 96561a8 into rust-lang:master Mar 5, 2024
12 checks passed

rustbot added this to the 1.78.0 milestone Mar 5, 2024

okaneco deleted the ipaddr_parse branch March 5, 2024 18:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

net: Don't use checked arithmetic when parsing numbers with known max digits #121428

net: Don't use checked arithmetic when parsing numbers with known max digits #121428

okaneco commented Feb 22, 2024

rustbot commented Feb 22, 2024

okaneco commented Feb 22, 2024

cuviper Mar 4, 2024

okaneco Mar 4, 2024

cuviper commented Mar 4, 2024

bors commented Mar 4, 2024

jhpratt commented Mar 5, 2024

bors commented Mar 5, 2024

bors commented Mar 5, 2024

rust-timer commented Mar 5, 2024

net: Don't use checked arithmetic when parsing numbers with known max digits #121428

net: Don't use checked arithmetic when parsing numbers with known max digits #121428

Conversation

okaneco commented Feb 22, 2024

rustbot commented Feb 22, 2024

okaneco commented Feb 22, 2024

cuviper Mar 4, 2024

Choose a reason for hiding this comment

okaneco Mar 4, 2024

Choose a reason for hiding this comment

cuviper commented Mar 4, 2024

bors commented Mar 4, 2024

jhpratt commented Mar 5, 2024

bors commented Mar 5, 2024

bors commented Mar 5, 2024

rust-timer commented Mar 5, 2024

Overall result: no relevant changes - no action needed

Instruction count

Max RSS (memory usage)

Cycles

Binary size