Introduce `get_checksum_bytes` method and improvements #671

evanlinjin · 2022-07-16T12:22:01Z

Description

get_checksum_bytes() returns a descriptor checksum as [u8; 8] instead of String, potentially improving performance and memory usage.

In addition to this, since descriptors only use characters that fit within a UTF-8 8-bit code unit (US-ASCII), there is no need to use the char type (which is 4 bytes). This can also potentially bring in some performance and memory-usage benefits.

Notes to the reviewers

This is useful because we will be using descriptor checksums for indexing operations in the near future (multi-descriptor wallets #486 ).

Refer to comments by @afilini :

Checklists

All Submissions:

I've signed all my commits
I followed the contribution guidelines
I ran cargo fmt and cargo clippy before committing

New Features:

I've added tests for the new feature
I've added docs for the new feature
I've updated CHANGELOG.md

notmandatory · 2022-07-16T20:25:13Z

src/descriptor/checksum.rs

@@ -43,15 +41,17 @@ fn poly_mod(mut c: u64, val: u64) -> u64 {
    c
 }

-/// Compute the checksum of a descriptor
-pub fn get_checksum(desc: &str) -> Result<String, DescriptorError> {


I don't know if anyone is using get_checksum() but since it's public our policy is to deprecate it for at least one release before completely removing it. We haven't been super strict about it, but we do need to be more consistent. I think you could make a simple function to call your new get_checksum_bytes and convert to a String.

@notmandatory I haven't removed it at all! And yes, get_checksum calls get_checksum_bytes 😅 (line 79)

vladimirfomene

Tested ACK

afilini

Concept ACK, just a small nit

afilini · 2022-07-19T12:38:22Z

src/descriptor/checksum.rs

+const INPUT_CHARSET: &[u8] = "0123456789()[],'/*abcdefgh@:$%{}IJKLMNOPQRSTUVWXYZ&+-.;<=>?!^_|~ijklmnopqrstuvwxyzABCDEFGH`#\"\\ ".as_bytes();
+const CHECKSUM_CHARSET: &[u8] = "qpzry9x8gf2tvdw0s3jn54khce6mua7l".as_bytes();


On both those lines I think you can use byte string literals: https://doc.rust-lang.org/reference/tokens.html#byte-string-literals

It should make the code more portable as it works on older rust versions as well (as_bytes() is a const fn only since 1.39) and it will also cause a compile error if we accidentally put a non-ascii character in there: https://play.rust-lang.org/?version=stable&mode=debug&edition=2021&gist=81fadd81957dbc1c56844a279a9d239f

@afilini I just updated this! Hopefully the tests pass.

`get_checksum_bytes` returns a descriptor checksum as `[u8; 8]` instead of `String`, potentially improving performance and memory usage. In addition to this, since descriptors only use charaters that fit within a UTF-8 8-bit code unit, there is no need to use the `char` type (which is 4 bytes). This can also potentially bring in some performance and memory-usage benefits.

evanlinjin · 2022-07-20T05:55:48Z

@afilini @notmandatory This is ready to go imo. 😃

afilini

ACK 6db5b4a

evanlinjin force-pushed the checksum_module_additions branch from 7555ff6 to a96c7da Compare July 16, 2022 12:25

notmandatory assigned evanlinjin Jul 16, 2022

notmandatory added the new feature New feature or request label Jul 16, 2022

notmandatory reviewed Jul 16, 2022

View reviewed changes

This was referenced Jul 16, 2022

Multi descriptor: Decouple transaction building logic #647

Closed

W29 BDK Library Team Call bitcoindevkit/.github#14

Closed

vladimirfomene reviewed Jul 19, 2022

View reviewed changes

afilini reviewed Jul 19, 2022

View reviewed changes

notmandatory added this to the Release 0.21.0 Feature Freeze milestone Jul 19, 2022

evanlinjin force-pushed the checksum_module_additions branch from a96c7da to 6db5b4a Compare July 19, 2022 14:02

evanlinjin requested review from afilini and notmandatory July 20, 2022 05:54

afilini approved these changes Jul 20, 2022

View reviewed changes

afilini merged commit 45a4ae5 into bitcoindevkit:master Jul 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce `get_checksum_bytes` method and improvements #671

Introduce `get_checksum_bytes` method and improvements #671

evanlinjin commented Jul 16, 2022 •

edited

Loading

notmandatory Jul 16, 2022 •

edited

Loading

evanlinjin Jul 16, 2022 •

edited

Loading

vladimirfomene left a comment

afilini left a comment

afilini Jul 19, 2022

evanlinjin Jul 19, 2022

evanlinjin commented Jul 20, 2022

afilini left a comment

		const INPUT_CHARSET: &[u8] = "0123456789()[],'/*abcdefgh@:$%{}IJKLMNOPQRSTUVWXYZ&+-.;<=>?!^_\|~ijklmnopqrstuvwxyzABCDEFGH`#\"\\ ".as_bytes();
		const CHECKSUM_CHARSET: &[u8] = "qpzry9x8gf2tvdw0s3jn54khce6mua7l".as_bytes();

Introduce get_checksum_bytes method and improvements #671

Introduce get_checksum_bytes method and improvements #671

Conversation

evanlinjin commented Jul 16, 2022 • edited Loading

Description

Notes to the reviewers

Checklists

All Submissions:

New Features:

notmandatory Jul 16, 2022 • edited Loading

Choose a reason for hiding this comment

evanlinjin Jul 16, 2022 • edited Loading

Choose a reason for hiding this comment

vladimirfomene left a comment

Choose a reason for hiding this comment

afilini left a comment

Choose a reason for hiding this comment

afilini Jul 19, 2022

Choose a reason for hiding this comment

evanlinjin Jul 19, 2022

Choose a reason for hiding this comment

evanlinjin commented Jul 20, 2022

afilini left a comment

Choose a reason for hiding this comment

Introduce `get_checksum_bytes` method and improvements #671

Introduce `get_checksum_bytes` method and improvements #671

evanlinjin commented Jul 16, 2022 •

edited

Loading

notmandatory Jul 16, 2022 •

edited

Loading

evanlinjin Jul 16, 2022 •

edited

Loading