Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow skipping block history download in warp sync #2710

Closed

Conversation

tmpolaczyk
Copy link
Contributor

Adds a new sync mode --warp-no-block-history that is exactly the same as --warp, but doesn't download block history.

Close #8

@paritytech-cicd-pr
Copy link

The CI pipeline was cancelled due to failure one of the required jobs.
Job name: cargo-clippy
Logs: https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/4724646

@tomaka
Copy link
Contributor

tomaka commented Dec 18, 2023

I don't know whether it is already the case as I'm not familiar with the sync code, but we must make sure that, no matter what, the block headers and justifications of session change blocks are downloaded anyway.

Otherwise it won't be possible for other nodes to warp sync from your node, which means that if enough people run with this option we could completely break warp syncing on the entire network.

@dmitry-markin
Copy link
Contributor

but we must make sure that, no matter what, the block headers and justifications of session change blocks are downloaded anyway.

As far as I can read the code, that's not the case — as a result of warp sync we only import target block body, state, and justifications. So it won't be possible to warp sync from a node that skipped block history phase.

Also, it needs to be carefully evaluated if adding nodes not able to provide syncing from genesis is safe for network. And likely address #523 first.

So, right now adding an option to turn off gap sync (block history phase) sounds risky.

@tmpolaczyk
Copy link
Contributor Author

Thanks for your comments. Is the justifications problem something that affects only the relaychain, or does it affects parachains as well? Because I only need this feature for syncing parachains, so I could modify the pr to error if trying to sync a relaychain.

Otherwise is there any way to make this a hidden sync mode or something? Because for my use case it doesn't matter if other nodes are not able to sync from this one.

@tomaka
Copy link
Contributor

tomaka commented Dec 18, 2023

Also, it needs to be carefully evaluated if adding nodes not able to provide syncing from genesis is safe for network. And likely address #523 first.

I've recently noticed that more and more nodes refuse to serve warp sync requests. Currently, around 20% of warp sync requests fail on Westend, and after a small investigation it seems that it's because they target nodes that have blocks pruning enable.

While it's not alarming yet, it could become alarming if people enable the mode that this PR adds.

@tomaka
Copy link
Contributor

tomaka commented Dec 18, 2023

Because for my use case it doesn't matter if other nodes are not able to sync from this one.

The problem that we're talking about is a tragedy of the commons type of problem.
If you alone enable this mode then there's no problem, but the more people enable it (and I get that many people would want to enable it) the worse the syncing will become for everyone.

@girazoki
Copy link
Contributor

girazoki commented Dec 18, 2023

can we make it under a feature flag? Our usecase is rotating-collators that warp synch to a full-node that has the history of a chain. The collators are not intended to store the history.

Now using full-sync but then enabling block-prunning has the same implications as this PR right?

@tomaka
Copy link
Contributor

tomaka commented Dec 18, 2023

An alternative way is that nodes that have this mode enabled do not participate in the DHT at all.

Light clients currently do not participate in the DHT at all because they would otherwise be polluting the network similar to how nodes that have this mode enabled would be polluting the network.

Now using full-sync but then enabling block-prunning has the same implications as this PR right?

Yes, but that doesn't make it correct. See #2733.

bkontur added a commit that referenced this pull request Dec 18, 2023
68d8650 Bump thiserror from 1.0.50 to 1.0.51
009c989 remove no longer valid check from the ensure_weights_are_correct (#2740)
94c44a7 Added Rococo BH <> Rococo Bulletin bridge (#2724)
5fe0f2f Bump tokio from 1.34.0 to 1.35.0
25f8251 Grafana update stuff (#2733)
06fbe8b Improved `ExportXcm::validate` implementation for BridgeHubs - step 1 (#2727)
390e836 Select header that will be fully refunded in on-demand batch finality relay (#2729)
ce701dd separate constants for average and worst case relay headers (#2728)
09215c5 Backport from `polkadot-sdk` + bump (#2725)
6327261 Bump serde from 1.0.192 to 1.0.193
fff9ddd Bump sysinfo from 0.29.10 to 0.29.11
4be99fe Monitoring and alerts for Rococo/Westend (#2710)
67a683a Bump ed25519-dalek from 2.0.0 to 2.1.0
8e0e794 quick and dirty fix for the `wait -p` and older distros (#2712)
3ab6562 Add withdraw reserve assets to zombienet tests (#2711)
c2c409b increase init timeouts in zombienet tests (#2706)
a8c60b4 fix lane id and bridged chain id (#2705)
9ac0f26 removed bp-asset-hub-kusama and bp-asset-hub-polkadot (#2703)
4916475 Some fixes for zombienet tests (polkadot-staging) (#2704)
6f9a147 zombienet from Wococo to Westend (#2699)
3ba7910 Porting changes from polkadot-sdk to polkadot-staging - before update subtree with removed wococo stuff (#2696)
653448f Remove Woococo related stuff (#2692)
03aaab2 Gitspiegel polkadot staging (#2695)
702a4c1 Drop Rialto <> Millau bridges (#2663) (#2694)
6a63b5f Start version guards for the ED loop (#2678)
896b9a9 typo (#2690)
671d27c Bump serde from 1.0.190 to 1.0.192
991b229 Bump clap from 4.4.7 to 4.4.8
ec267ec Bump env_logger from 0.10.0 to 0.10.1
592e407 Bump tokio from 1.33.0 to 1.34.0
c49ce3d Bump serde_json from 1.0.107 to 1.0.108
04b3319 Update subxt-codegen version (#2674)
03f9804 backport #2139 (#2673)
49245dd removed unused PARACHAINS_FINALITY_PALLET_NAME constant (#2670)
658a3f5 BHR/BHWE spec_version according to the `polkadot-sdk` (#2668)
7666b94 Nit from `polkadot-sdk` (#2665)
b5c43bb Adjusted constant because for measuring we used mistakenly rococo constants (#2664)
062449d Add Rococo<>Westend bridge support/relay (#2647)
55eb44e Add basic zombienet test to be used in the future (#2649) (#2660)
93b6b3f Bump clap from 4.4.6 to 4.4.7
4c01ab0 Bump futures from 0.3.28 to 0.3.29
a31a6c0 Bump tempfile from 3.8.0 to 3.8.1
bcdfe83 Bump serde from 1.0.189 to 1.0.190
f7433b0 Port #2648 to polkadot-staging (#2651)
3896738 Bump scale-info from 2.9.0 to 2.10.0
12d62c5 Bump thiserror from 1.0.49 to 1.0.50
1d78aa1 Backport from `polkadot-sdk` with actual master (#2633)
ab4de94 Grandpa justifications: Avoid duplicate vote ancestries (#2634) (#2635)
465562a add missing crate descriptions (#2629)
28d3680 Bump fixed-hash
67528c4 Bump serde from 1.0.188 to 1.0.189
d450c47 Bump time from 0.3.29 to 0.3.30
6a19f83 Bump async-trait from 0.1.73 to 0.1.74
a92d213 Millau, Rialto: accept equivocation reports (#2614) (#2617)
a61f777 Bump tokio from 1.32.0 to 1.33.0
0052f64 Bump subxt from 0.32.0 to 0.32.1
ccc849d Bump num-traits from 0.2.16 to 0.2.17
22f2752 apply late suggestions for #2600 (#2603)
0320172 actualize check_obsolete_call comment (#2601)
5cbbd25 Reject transactions if bridge pallets are halted (#2600)
ca4dfe3 Bump subxt from 0.31.0 to 0.32.0
8bf7b58 Bump clap from 4.4.4 to 4.4.6
88b0b99 Bump thiserror from 1.0.48 to 1.0.49
263833b https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/3833103 (#2589)
4f44968 Backport changes from polkadot-sdk (#2588)
7200ed1 fiox overflow when computing priority boost (#2587)
e02cbd3 Bump time from 0.3.28 to 0.3.29
a097dd2 Bump clap from 4.4.3 to 4.4.4
801ce88 Merge bulletin chain changes into polkadot staging (#2574)
a3803ce Add unit tests for the equivocation detection loop (#2571)
26dfc31 Bump clap from 4.4.2 to 4.4.3
66a8beb Bump serde_json from 1.0.106 to 1.0.107
18c50da Bump trie-db from 0.27.1 to 0.28.0
4c4fa92 Equivocation detection loop: Reorganize block checking logic as state machine (#2555) (#2557)
6bd317a Bump serde_json from 1.0.105 to 1.0.106
a7e6bfd Backport for polkadot-sdk#1446 (#2546)
d9f8050 Bump sysinfo from 0.29.9 to 0.29.10
901f44c Bump thiserror from 1.0.47 to 1.0.48
82eeb50 Bump sysinfo from 0.29.8 to 0.29.9
a0c934b Bump strum from 0.24.1 to 0.25.0
1064fbf Bump subxt from 0.28.0 to 0.31.0
e50398d bridges subtree fixes (#2528)
99af075 Markdown linter (#1309) (#2526)
733ff0f `polkadot-staging` branch: Use polkadot-sdk dependencies (#2524)
e8a59f1 Fix benchmark with new XCM::V3 `MAX_INSTRUCTIONS_TO_DECODE` (#2514)
62b185d Backport `polkadot-sdk` changes to `polkadot-staging` (#2518)
d9658f4 Fix equivocation detection containers startup (#2516) (#2517)
d65db28 Backport: building images from locally built binaries (#2513)
5fdbaf4 Start the equivocation detection loop from the complex relayer (#2507) (#2512)
7fbb67d Backport: Implement basic equivocations detection loop (#2375)
cb7efe2 Manually update deps in polkadot staging (#2371)
d17981f #2351 to polkadot-staging (#2359)

git-subtree-dir: bridges
git-subtree-split: 68d8650
Copy link
Member

@bkchr bkchr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will first require: #2738 and #2733

I would also not add a new warp sync mode and instead this should be decided based on the blocks-pruning/state pruning flags.

@tmpolaczyk
Copy link
Contributor Author

@bkchr regarding block prunning, I was thinking the same. I just tested that running --sync warp --blocks-pruning=100 does still download the full block history. So in that case we want to only download HEADER | BODY | JUSTIFICATION of the latest 100 blocks, and HEADER | JUSTIFICATION of the full block history. Is that correct?

@bkchr
Copy link
Member

bkchr commented Dec 18, 2023

So in that case we want to only download HEADER | BODY | JUSTIFICATION of the latest 100 blocks

Not sure I would care that much about this part, but it would be for sure the cleanest implementation. We will need some kind of implementation that handles the "dynamic nature", as the node will continue to import and finalize new blocks. This means, the "ultimate" last block to download moves all the time. However, it should not be that complicated to implement and also downloading one more block is probably fine.

and HEADER | JUSTIFICATION of the full block history.

The beauty of #2738 would be that we do not need to download them, as we already downloaded them as part of the warp proof. When we have support for syncing from a checkpoint, we will not have all the justifications back from the genesis anymore. We could then discuss to download them or we would expect that all nodes always use some recent checkpoint.

@dmitry-markin
Copy link
Contributor

To give a heads up — as the syncing is currently being heavily refactored in preparation for Sync 2.0, it's better to wait at least for #2467 and follow-up warp sync strategy PRs to be merged before implementing improvements to warp sync. Otherwise we'll end up with a lot of conflicts.

@tmpolaczyk
Copy link
Contributor Author

Ok, I will close this PR given that it needs to be rewritten anyway. If after sync 2.0 this functionality is still missing I will open a new PR. Thank you all for your input!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

warp sync without download histrocial blocks
6 participants