[Merged by Bors] - sync2: add sqlstore #6405

ivan4th · 2024-10-23T10:36:39Z

Motivation

The database-aware sync implementation is rather complex and splitting
it in parts might help the review process.

Description

The sqlstore package provides simple sequence-based interface to the
tables being synchronized. It is used by the FPTree data structure as
the database layer, and doesn't do range fingerprinting by itself.

SyncedTable and SyncedTableSnapshot provide methods that wrap the
necessary SQL operations. sql/expr package was added to facilitate
SQL generation.

The sql/expr package is also used by Bloom filters (#6332), which
are to be re-benchmarked and updated later. The idea is to use this
simple AST-based SQL generator in other places in the code where we're
generating SQL. The rqlite dependency which is introduced is also to
be used to convert schema SQL to a "canonical" form during schema
drift detection, and to filter out comments properly in the migration
scripts.

#6358 needs to be merged before this one.

Given that after recent item sync is done (if it's needed at all), the range set reconciliation algorithm no longer depends on newly received item being added to the set, we can save memory by not adding the received items during reconciliation. During real sync, the received items will be sent to the respective handlers and after the corresponding data are fetched and validated, they will be added to the database, without the need to add them to cloned OrderedSets which are used to sync against particular peers.

codecov · 2024-10-23T10:58:02Z

Codecov Report

Attention: Patch coverage is 80.65693% with 106 lines in your changes missing coverage. Please review.

Project coverage is 79.8%. Comparing base (8f93588) to head (3750cb7).
Report is 1 commits behind head on develop.

Files with missing lines	Patch %	Lines
sync2/rangesync/combine_seqs.go	74.8%	27 Missing and 9 partials ⚠️
sync2/sqlstore/dbseq.go	80.1%	15 Missing and 8 partials ⚠️
sync2/sqlstore/sqlidstore.go	41.9%	14 Missing and 4 partials ⚠️
sync2/sqlstore/syncedtable.go	91.8%	8 Missing and 4 partials ⚠️
sql/expr/expr.go	87.6%	8 Missing and 2 partials ⚠️
sync2/sqlstore/testdb.go	69.5%	6 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff            @@
##           develop   #6405     +/-   ##
=========================================
- Coverage     79.9%   79.8%   -0.1%     
=========================================
  Files          335     341      +6     
  Lines        43656   44204    +548     
=========================================
+ Hits         34892   35296    +404     
- Misses        6800    6915    +115     
- Partials      1964    1993     +29

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

This adds multi-peer synchronization support. When the local set differs too much from the remote sets, "torrent-style" "split sync" is attempted which splits the set into subranges and syncs each sub-range against a separate peer. Otherwise, the full sync is done, syncing the whole set against each of the synchronization peers. Full sync is also done after each split sync run. The local set can be considered synchronized after the specified number of full syncs has happened. The approach is loosely based on [SREP: Out-Of-Band Sync of Transaction Pools for Large-Scale Blockchains](https://people.bu.edu/staro/2023-ICBC-Novak.pdf) paper by Novak Boškov, Sevval Simsek, Ari Trachtenberg, and David Starobinski.

The `sqlstore` package provides simple sequence-based interface to the tables being synchronized. It is used by the FPTree data structure as the database layer, and doesn't do range fingerprinting by itself. `SyncedTable` and `SyncedTableSnapshot` provide methods that wrap the necessary SQL operations. `sql/expr` package was added to facilitate SQL generation.

sql/expr/expr.go

sync2/rangesync/combine_seqs.go

acud · 2024-10-29T13:59:50Z

sync2/sqlstore/dbseq.go

+type dbIDKey struct {
+	//nolint:unused
+	id string
+	//nolint:unused


are these linter annotations still needed?

Yes, the linter appears to be silly enough to mark fields which are never accessed directly (like foo.bar) as unused, even though it's often useful to have such fields in structs serving as map keys etc.

I'm not a big fan of structs as keys and nolint indicates an issue (although not necessarily the correct one):

Is chunkSize really part of the key and not part of the value object? If I look up the same ID with a different chunkSize should I expect to not find it in the cache or should I find it and then evaluate the chunkSize to be the one I expect?

Also if chunkSize can change dynamically doesn't this lead to the same values (by ID) being stored multiple times in the cache with different chunkSizes?

After a bit more benchmarking, I've removed the LRU cache altogether. It was more useful with an earlier FPTree iteration that did not do node-aligned range splits, but now it just doesn't improve performance at all

acud · 2024-10-29T14:02:45Z

sync2/sqlstore/dbseq.go

+// LRU cache for ID chunks.
+type lru = simplelru.LRU[dbIDKey, []rangesync.KeyBytes]
+
+const lruCacheSize = 1024 * 1024


might be good to have a comment here as to how much memory this would translate into in the LRU cache in total when it's full.

That's somewhat hard to say exactly what will be size of the cache as it stores chunks of different size.
I'll add a limit on the size of each cached chunk and will make cache size configurable

Removed the LRU cache (see above)

sync2/sqlstore/dbseq.go

acud · 2024-10-29T14:17:11Z

sync2/sqlstore/dbseq.go

+	// if the chunk size was reduced due to a short chunk before wraparound, we need
+	// to extend it back
+	if cap(s.chunk) < s.chunkSize {
+		s.chunk = make([]rangesync.KeyBytes, s.chunkSize)


I'm not sure that this is actually needed. Unless you're passing the actual slice somehow, there's no real need of allocating again. You can just over-allocate at the first time, and use the clear() builtin over it to just reuse that.

This is needed as chunk size is dynamic. As the SQL iterator progresses, it loads larger and larger chunks by means of applying larger LIMIT values. This is due to usage pattern of FPTree which may request just a few items from some iterators and a lot of items from others.
The comment was somewhat misplaced b/c it applies to the case where capacity is enough.
We still need to reallocate the chunk if chunkSize grew. Allocating max-sized chunks from the start for each iterator created might be wasteful.

sync2/sqlstore/dbseq.go

sync2/sqlstore/sqlidstore.go

fasmat · 2024-10-31T10:09:30Z

sync2/sqlstore/dbseq.go

+	s.chunkSize = min(s.chunkSize*2, s.maxChunkSize)
+	switch {
+	case err != nil || ierr != nil:
+		return errors.Join(ierr, err)


Returning both errors doesn't make sense to me, I think ierr should take precedence since the db request was interrupted because of it and err isn't interesting in this case, i.e.

case ierr != nil: return ierr case err != nil: return err case n == 0: // unchanged ...

Fixed, thanks

It didn't add much to efficiency

Re-generating SQL statements every time is quick but increases GC pressure.

ivan4th · 2024-11-03T05:33:38Z

bors merge

------- ## Motivation The database-aware sync implementation is rather complex and splitting it in parts might help the review process.

spacemesh-bors · 2024-11-03T06:23:42Z

Pull request successfully merged into develop.

Build succeeded:

ivan4th requested review from dshulyak, fasmat, poszu, acud and jellonek as code owners October 23, 2024 10:36

ivan4th mentioned this pull request Oct 23, 2024

sync2: implement database-backed sync based on FPTree #6406

Open

ivan4th force-pushed the sync2/multipeer branch from 4c57c22 to ef30f47 Compare October 23, 2024 12:19

ivan4th force-pushed the sync2/sqlstore branch from f886141 to bb43161 Compare October 23, 2024 12:20

ivan4th added 3 commits October 27, 2024 02:43

sync2: add description of multipeer reconciliation

af95c7f

sync2: doc update

b062eaa

Merge branch 'develop' into sync2/multipeer

3bfaa3f

acud reviewed Oct 29, 2024

View reviewed changes

ivan4th added 8 commits October 31, 2024 03:08

Merge branch 'develop' into sync2/multipeer

ef484c5

sync2: address comments

0cf1678

sync2: multipeer: add error decoration

287d76d

sync2: remove Dispatcher type alias

51166fa

sync2: multipeer: refactor tests

c58d690

p2p: server: pass PeerID as an explicit argument to the handler

cf46587

Merge branch 'sync2/multipeer' into sync2/sqlstore

79eb4db

sync2: sqlstore: fix comment

9442698

fasmat reviewed Oct 31, 2024

View reviewed changes

ivan4th added 5 commits November 1, 2024 06:03

sync2: removed LRU cache

8a8c60c

It didn't add much to efficiency

sync2: sqlstore: fix comments

9d6441f

sync2: fix lint issue

6424aca

sync2: sqlstore: cache generated SQL

79a31b7

Re-generating SQL statements every time is quick but increases GC pressure.

sync2: sqlstore: reduce memory use in db sequences

59f7db5

acud approved these changes Nov 2, 2024

View reviewed changes

spacemesh-bors bot changed the base branch from sync2/multipeer to develop November 2, 2024 21:41

Merge branch 'develop' into sync2/sqlstore

3750cb7

spacemesh-bors bot pushed a commit that referenced this pull request Nov 3, 2024

sync2: add sqlstore (#6405)

f6bd0a3

------- ## Motivation The database-aware sync implementation is rather complex and splitting it in parts might help the review process.

spacemesh-bors bot changed the title ~~sync2: add sqlstore~~ [Merged by Bors] - sync2: add sqlstore Nov 3, 2024

spacemesh-bors bot closed this Nov 3, 2024

spacemesh-bors bot deleted the sync2/sqlstore branch November 3, 2024 06:23

ivan4th mentioned this pull request Nov 5, 2024

Implement set reconciliation based sync (syncv2) #5769

Closed

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Merged by Bors] - sync2: add sqlstore #6405

[Merged by Bors] - sync2: add sqlstore #6405

ivan4th commented Oct 23, 2024 •

edited

Loading

codecov bot commented Oct 23, 2024 •

edited

Loading

acud Oct 29, 2024

ivan4th Oct 31, 2024

fasmat Oct 31, 2024 •

edited

Loading

fasmat Oct 31, 2024

ivan4th Nov 1, 2024

acud Oct 29, 2024

ivan4th Oct 31, 2024

ivan4th Nov 1, 2024

acud Oct 29, 2024

ivan4th Oct 31, 2024

fasmat Oct 31, 2024

ivan4th Nov 1, 2024

ivan4th commented Nov 3, 2024

spacemesh-bors bot commented Nov 3, 2024

[Merged by Bors] - sync2: add sqlstore #6405

[Merged by Bors] - sync2: add sqlstore #6405

Conversation

ivan4th commented Oct 23, 2024 • edited Loading

Motivation

Description

codecov bot commented Oct 23, 2024 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fasmat Oct 31, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ivan4th commented Nov 3, 2024

spacemesh-bors bot commented Nov 3, 2024

ivan4th commented Oct 23, 2024 •

edited

Loading

codecov bot commented Oct 23, 2024 •

edited

Loading

fasmat Oct 31, 2024 •

edited

Loading