peer, main, netsync, blockchain: parallel block downloads #2226

kcalvinalvin · 2024-08-07T11:35:09Z

This PR modifies netsync.Manager so that all the header-first blocks downloaded before the last checkpoint is done out of order by utilizing query.WorkManager from neutrino.

Gonna put it in draft for now as testing is sorta difficult and I'm not convinced it's downloading blocks faster for mainnet. By my testing it works just fine in testnet but mainnet seems to be slow when downloading blocks. Still identifying where the bottleneck is and will make adjustments accordingly.

If anyone else would like to give this a try please let me know if you see speed ups or slow downs from this PR.

coveralls · 2024-08-07T11:38:32Z

Pull Request Test Coverage Report for Build 10733178355

Details

13 of 392 (3.32%) changed or added relevant lines in 6 files are covered.
56 unchanged lines in 5 files lost coverage.
Overall coverage decreased (-0.4%) to 56.802%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
peer/peer.go	2	18	11.11%
server.go	0	18	0.0%
blockchain/accept.go	10	33	30.3%
netsync/blocklogger.go	0	54	0.0%
netsync/manager.go	0	268	0.0%

Files with Coverage Reduction	New Missed Lines	%
connmgr/connmanager.go	1	86.27%
peer/peer.go	9	72.68%
netsync/manager.go	10	0.0%
wire/msgaddrv2.go	16	51.72%
wire/netaddressv2.go	20	74.45%

Totals
Change from base Build 10555907676:	-0.4%
Covered Lines:	29808
Relevant Lines:	52477

💛 - Coveralls

kcalvinalvin · 2024-08-08T07:55:39Z

Seems like the slowdowns are coming from a single peer that's slowing down the block processing. peerWorkManager having capabilities to completely disconnect a peer would help in this.

Roasbeef · 2024-08-13T02:06:39Z

Seems like the slowdowns are coming from a single peer that's slowing down the block processing. peerWorkManager having capabilities to completely disconnect a peer would help in this.

Interesting, is the issue that the single peer is assigned blocks uniformly and is always the last one to send (haven't looked in PR at detail yet). I have this tracking issue in the neutrino repo for adding stuff like dynamic tuning, better work assignment, and also work stealing. With work stealing, the faster peers would steal the block request from the work queue of the slow peer, with faster peers helping us to be as slow as the fastest peer.

query.Peer is used for downloading blocks out of order during headers first download. Methods SubscribeRecvMsg() and OnDisconnect() are added to abide by the interface.

ConnectedPeers returns all the currently connected peers. This is used to provide the query.WorkManager with all the currently connected peers from the netsync package.

checkpointedBlocksQuery is a helper to create []*query.Request which can be passed off to query.Workmanager to query for wire.Messages to multiple peers. This is useful for downloading blocks out of order from multiple peers during ibd.

handleBlockMsg used to check that the block header is both valid and then process the blocks as they come in. It's now refactored so that it also handles blocks that are not in order. For out of order block downloads handleBlockMsg would mark the block as an orphan but it's now refactored to handle those cases. Whenever a block that's not the next from the chain tip is received, it's now temporarily stored in memory until the next block from the chain tip is received. And then all the blocks that are in sequence are processed.

peerSubscription is added to Manager which will allow it subscribers to receive peers through the channel whenever the Manager is aware of a new peer that it's been connected to. This is useful to alert query.Workmanager that a new peer that's been connected to is eligible to download blocks from.

ConnectedPeers returns all the currently connected peers and any new peer that's additionally connected through the returned channel. This method is required for query.Workmanager as it needs ot receive peers that it can request blocks from.

The blocks that were requested from headers are now sent over to query.Workmanager where it will rank peers based on their speed and request blocks from them accordingly. This allows for quicker block downloads as: 1: Workmanager will prioritize faster peers. 2: Workmanager is able to ask from multiple peers.

Storing block happens before the block validation is done and this can be a bottleneck on computers with slow disks. Allowing for concurrent block storage saves time as the disk operation can be done in parallel with the cpu operations of verifying the block.

headers-first block download

Resetting the requestedBlocks state in headersFirst is problematic since we may be banning peers that are still good.

saubyk · 2024-10-03T15:26:31Z

cc: @Crypt-iQ @ProofOfKeags for review

kcalvinalvin added 2 commits August 20, 2024 15:18

peer: make peer meet query.Peer interface

782dd2f

query.Peer is used for downloading blocks out of order during headers first download. Methods SubscribeRecvMsg() and OnDisconnect() are added to abide by the interface.

main: add ConnectedPeers() to server

510083d

ConnectedPeers returns all the currently connected peers. This is used to provide the query.WorkManager with all the currently connected peers from the netsync package.

kcalvinalvin force-pushed the 2024-04-01-parallel-ibd branch from 9d1c6a2 to c51d31a Compare September 5, 2024 14:05

kcalvinalvin added 9 commits September 6, 2024 14:34

netsync: add checkpointedBlocksQuery

eebcd60

checkpointedBlocksQuery is a helper to create []*query.Request which can be passed off to query.Workmanager to query for wire.Messages to multiple peers. This is useful for downloading blocks out of order from multiple peers during ibd.

netsync, main: add ConnectedPeers to Manager

ee0f49d

ConnectedPeers returns all the currently connected peers and any new peer that's additionally connected through the returned channel. This method is required for query.Workmanager as it needs ot receive peers that it can request blocks from.

netsync: add logger for blocks downloaded from different peers during

7c1b80f

headers-first block download

netsync: don't reset the requestedBlocks in headersFirst

d8d8dd7

Resetting the requestedBlocks state in headersFirst is problematic since we may be banning peers that are still good.

main: include query logging

2cbb562

kcalvinalvin force-pushed the 2024-04-01-parallel-ibd branch from c51d31a to 2cbb562 Compare September 6, 2024 05:51

kcalvinalvin marked this pull request as ready for review September 9, 2024 23:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

peer, main, netsync, blockchain: parallel block downloads #2226

peer, main, netsync, blockchain: parallel block downloads #2226

kcalvinalvin commented Aug 7, 2024

coveralls commented Aug 7, 2024 •

edited

Loading

kcalvinalvin commented Aug 8, 2024

Roasbeef commented Aug 13, 2024

saubyk commented Oct 3, 2024

peer, main, netsync, blockchain: parallel block downloads #2226

Are you sure you want to change the base?

peer, main, netsync, blockchain: parallel block downloads #2226

Conversation

kcalvinalvin commented Aug 7, 2024

coveralls commented Aug 7, 2024 • edited Loading

Pull Request Test Coverage Report for Build 10733178355

Details

💛 - Coveralls

kcalvinalvin commented Aug 8, 2024

Roasbeef commented Aug 13, 2024

saubyk commented Oct 3, 2024

coveralls commented Aug 7, 2024 •

edited

Loading