Allow additionally parsing \r newlines in PEM files #30

complexspaces · 2023-11-08T06:56:00Z

We (1Password) encountered some users on Linux who were encountering issues using custom certificate roots on their devices, which had been added to the "main" system CA bundle. The root cause of the issue turned out to be rustls-pemfile only supporting UNIX newline endings in parsed files (\n). The certificate bundles sent to us by users though contained one or more PEM objects with \r endings interspersed with ones using \n.

This PR proposes fixing the incompatibility described above by switching to a more lenient line parser function. BufReader::read_until was insufficient for this purpose because it only supports looking for a single needle. Instead, I ported the implementation of read_until out of the standard library and slightly tweaked it to look for more characters. While this uses the more low-level BufReader functions, its been well-tested already and has correct error handling. Thanks to @Ralith for helping point me in the right direction.

Additionally, I believe this is technically more correct based on RFC 7468 (assuming this is the right one 😓) as they call out support for multiple line endings:

Furthermore, parsers SHOULD ignore whitespace and other non-
base64 characters and MUST handle different newline conventions.

A full regression test is included as well. It can be confirmed by undoing the changes to src/pemfile.rs and running cargo test. It should panic with a bogus error. The data file consists of the same Amazon root CA repeated 4 times, but with 2 LF and 2 \r objects.

AFAICT, the only downside of this change is that it has a performance impact, even if almost imperceptible:

Version	Performance
`main`	1,471 ns/iter
Slow custom read_until, no \r support	1,867 ns/iter (+/- 46)
This implementation	2,842 ns/iter (+/- 146)
memchr2 custom read_until, with \r support	1,491 ns/iter (+/- 12)
libc memchr custom read_until, with \r support	1,517 ns/iter (+/- 15)

A user would need to parse an absurd number of PEM objects to notice the difference, and the only no_std compatible way of regaining it is adding a dependency on the memchr crate, which does not seem to be worthwhile.

ctz · 2023-11-08T11:16:02Z

I was thinking this might be a regression from ff11ce1, but actually std's read_line is also obsessed with \n: https://doc.rust-lang.org/stable/src/std/io/mod.rs.html#2235-2240 🤨

complexspaces · 2023-11-08T15:26:57Z

That was my conclusion as well. You have to go out of your way to support other line endings in most functions, with the exception of str::lines which supports CRLF as well.

cpu

Thank you! This is a really nice pull request. I appreciate the thorough description + testing methodology.

Allow additionally parsing \r newlines in PEM files

b5c61eb

ctz approved these changes Nov 8, 2023

View reviewed changes

cpu approved these changes Nov 8, 2023

View reviewed changes

djc approved these changes Nov 9, 2023

View reviewed changes

djc merged commit 4cade3f into rustls:main Nov 9, 2023
8 checks passed

complexspaces mentioned this pull request Nov 9, 2023

Backport: Allow additionally parsing \r newlines in PEM files #31

Merged

complexspaces deleted the parsing-other-line-endings branch November 9, 2023 18:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow additionally parsing \r newlines in PEM files #30

Allow additionally parsing \r newlines in PEM files #30

complexspaces commented Nov 8, 2023 •

edited

Loading

ctz commented Nov 8, 2023

complexspaces commented Nov 8, 2023

cpu left a comment

Allow additionally parsing \r newlines in PEM files #30

Allow additionally parsing \r newlines in PEM files #30

Conversation

complexspaces commented Nov 8, 2023 • edited Loading

ctz commented Nov 8, 2023

complexspaces commented Nov 8, 2023

cpu left a comment

Choose a reason for hiding this comment

complexspaces commented Nov 8, 2023 •

edited

Loading