feat: continuously publish provider records #175

nathanielc · 2023-11-08T16:27:49Z

This adds a mechanism to continuously publish provider records. The behavior is that publishes are spread out over the entire publication interval with retries for any failed publishes. The effect is that each block in the store will have its hash published as a provider record at least once per interval.

Implementation Details

Some things of note for how this was implemented.

The Publisher type implements a Stream trait that continuously emits batches of Keys that need to be published. The node then tells kademlia to publish them and informs the Publisher of the results. When a batch has completed i.e. the publisher knows the pass/fail result of each key, it starts a new batch.

Between each batch the publisher may sleep/delay if its ahead of its projected deadline (the end of the interval). Structured logs report whether the publisher is ahead or behind.

Metrics have been added to monitor the success/fail rate of publish events.

The kademlia protocol itself has been configured to not do any of its own republishing and to store all provider records in memory. This way it can answer queries from other peers. This means that the data is duplicated on disk and in memory which is not ideal but seems to be the best we can do for now.

Performance

On my local system I was able to publish records at a very high rate, 500K in a few hours. This seems to be because we iterate the records in order of hash and so we talk to adjacent sets of peers for large numbers of records. And then we slowly move through the space. The publisher has some settings we can tweak to adjust for production workloads as needed.

smrz2001 · 2023-11-13T21:58:43Z

p2p/src/node.rs

@@ -250,6 +264,17 @@ where
                        }
                    }
                }
+                provide_records = self.publisher.next() => {
+                    if let Some(kad) = self.swarm.behaviour_mut().kad.as_mut() {
+                        if let Some(provide_records) =  provide_records {


Suggested change

if let Some(provide_records) = provide_records {

if let Some(provide_records) = provide_records {

For some reason, my eyes are very good at catching extra/missing whitespaces 😂

Hmm I would expect cargo fmt to catch this. I'll take a look.

cargo fmt can't always format code inside of a macro and this code block lives inside a select! macro invocation. This is my I generally like to put as little logic inside the macro as possible and just call out to functions in the bodies of the branches.

Fixed

AaronGoldman · 2023-11-08T19:10:27Z

one/src/lib.rs

@@ -78,7 +78,7 @@ struct DaemonOpts {
    #[arg(
        short,
        long,
-        default_value = "127.0.0.1:9090",
+        default_value = "127.0.0.1:9464",


Why this number vs any other?

9464 is the standard port for hosting metrics, 9090 is the standard port for hosting prometheus itself. I got them mixed up initially. This changes to the more standard port.

AaronGoldman · 2023-11-08T19:24:26Z

p2p/src/behaviour.rs

+                .set_parallelism(config.kademlia_parallelism)
+                .set_query_timeout(config.kademlia_query_timeout)
+                .set_provider_record_ttl(config.kademlia_provider_record_ttl)
+                // Disable record (re)-replication and (re)-publication


Maybe we should reference that we are doing this somewhere else. This looks like we are not publishing provider records.

AaronGoldman · 2023-11-08T20:16:14Z

p2p/src/node.rs

@@ -1321,12 +1345,14 @@ mod tests {
            // Using an in memory DB for the tests for realistic benchmark disk DB is needed.
            let sql_pool = SqlitePool::connect("sqlite::memory:").await?;

+            let metrics = Metrics::register(&mut prometheus_client::registry::Registry::default());


Is this recording metrics in tests?
Is this because we want the metrics from the test or just to make the tests more like release?

This is because I didn't abstract over optional metrics in the code. The types require a Metrics instance in order to be constructed. So we do not validate metrics in tests yet, but this is how we could.

nathanielc force-pushed the feat/republish-dht branch from 75a6ec3 to 9be8abb Compare November 8, 2023 17:20

nathanielc changed the title ~~feat: continously publish provider records~~ feat: continuously publish provider records Nov 8, 2023

nathanielc marked this pull request as ready for review November 8, 2023 18:02

nathanielc requested review from AaronGoldman and smrz2001 November 8, 2023 18:57

smrz2001 reviewed Nov 13, 2023

View reviewed changes

AaronGoldman approved these changes Nov 13, 2023

View reviewed changes

feat: continously publish provider records

4c8fb70

nathanielc force-pushed the feat/republish-dht branch from 9be8abb to 4c8fb70 Compare November 14, 2023 15:49

nathanielc enabled auto-merge November 14, 2023 15:50

nathanielc added this pull request to the merge queue Nov 14, 2023

Merged via the queue into main with commit 7e2d3f4 Nov 14, 2023
4 checks passed

nathanielc deleted the feat/republish-dht branch November 14, 2023 16:11

This was referenced Jan 22, 2024

chore: version v0.9.1 #226

Merged

chore: version v0.10.0 #228

Merged

chore: version v0.10.0 #229

Merged

chore: version v0.9.1 #230

Merged

smrz2001 mentioned this pull request Jan 23, 2024

chore: version v0.10.0 #231

Merged

github-actions bot mentioned this pull request Feb 6, 2024

chore: version v0.10.0 #261

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: continuously publish provider records #175

feat: continuously publish provider records #175

nathanielc commented Nov 8, 2023 •

edited

Loading

smrz2001 Nov 13, 2023

nathanielc Nov 14, 2023

nathanielc Nov 14, 2023

AaronGoldman Nov 8, 2023

nathanielc Nov 14, 2023

AaronGoldman Nov 8, 2023

AaronGoldman Nov 8, 2023

nathanielc Nov 14, 2023

	if let Some(provide_records) = provide_records {
	if let Some(provide_records) = provide_records {

feat: continuously publish provider records #175

feat: continuously publish provider records #175

Conversation

nathanielc commented Nov 8, 2023 • edited Loading

Implementation Details

Performance

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nathanielc commented Nov 8, 2023 •

edited

Loading