Track buckets in DB instead of in-memory #609

neelvirdy · 2022-11-28T20:38:38Z

Addresses #556 by removing the Buckets in-memory state from the ContentManager, and replacing it with DB queries

After #535, most of the staging zone state was removed. Staging zone size is now tracked incrementally as contents are added and removed. This allows readiness to be computed on the fly in constant time with a simple size >= MinDealContentSize check.

The ContentManager now only tracks which zones are consolidating in memory. If this state is lost in a restart, all zones will be considered not consolidating, causing them to be reattempted.

recompute task validation:
Uploaded via dev:

Checked out feature branch and started estuary:

(First image also had hashes.txt, just cut it off on accident in the screenshot)

Uploaded 3GB more files and removed 1GB on the feature branch to get it over the threshold and it moved into dealmaking successfully with the correct size metadata.

Follow Ups:

Hide the recompute size task behind a startup flag so it doesn't run every time. it should only need to run once ever.
Readiness doesn't need to be checked on an interval anymore - it's knowable in constant time as soon as content is added. Can be simply added to a queue for the staging bucket worker to process
FE should remove its concept of readiness, since we don't send any readiness metadata anymore it is always "No" with a blank reason in the FE after this change

alvin-reyes

I was thinking if we can add a new buckets table which will have an bucket uuid andevery CID is then assigned to one of these bucket which can either be a dedicated bucket (per user) or a global bucket.

The bucket is then processed like a batch job with different trigger parameters:
1 - size of the bucket, meaning if it reaches a threshold (3.5GB), it'll get processed.
2 - time (we can schedule it everyday) - cron job.

neelvirdy · 2022-11-29T15:54:54Z

I was thinking if we can add a new buckets table which will have an bucket uuid andevery CID is then assigned to one of these bucket which can either be a dedicated bucket (per user) or a global bucket.

The bucket is then processed like a batch job with different trigger parameters: 1 - size of the bucket, meaning if it reaches a threshold (3.5GB), it'll get processed. 2 - time (we can schedule it everyday) - cron job.

Agreed we may need a global staging zone concept, and a dedicated zones table may end up being necessary. IMO the latter would also clean up some confusion for new contributors around the contents table. But for now, I want to keep this PR to the minimal change-set required to address the in-memory state issue.

neelvirdy · 2022-11-29T17:15:39Z

Currently writing up a task to recompute staging zone sizes at startup and update the DB if they mismatch what is in the aggregate content's row. This is a migration task and is required to move to tracking size incrementally in DB, since there staging zones will already exist when this gets deployed but their sizes will not have been accounted for in the incremental tracking.

contentmgr/replication.go

contentmgr/gc.go

contentmgr/replication.go

contentmgr/pinning.go

contentmgr/replication.go

contentmgr/gc.go

contentmgr/replication.go

…solidating before aggregating

pinning.go

neelvirdy · 2022-12-05T06:10:00Z

@en0ma I've validated that I can successfully consolidate and aggregate buckets across an API node and one shuttle. Currently the API node is not selectable as a consolidation destination in the code, so all the consolidations I tested went onto the shuttle. Included some restarts in the tests and tried adding data between consolidation and aggregation and confirmed it re-consolidated then eventually aggregated.

en0ma

LGTM

neelvirdy added 7 commits November 28, 2022 10:10

contentInStagingZone and primaryStagingLocation use DB

41e9505

Remove redundant aggregated in check

89cbe47

use DB for content locations during aggregation

9964f1d

comments for how to remove other uses of in-memory buckets

5ee8310

replace popReadyStagingZone with DB query

2a291cc

Rewrite addContent, build resps for API, remove all buckets refs

c850514

Fix inviableZones logic and some cleanup + validation

627b79b

neelvirdy requested a review from en0ma November 28, 2022 20:44

neelvirdy added 2 commits November 28, 2022 15:48

gosec fix?

e18c699

gosec fix

f777964

neelvirdy marked this pull request as ready for review November 28, 2022 21:09

alvin-reyes reviewed Nov 29, 2022

View reviewed changes

Track size incrementally

fb75d9e

neelvirdy added 3 commits November 29, 2022 12:16

Recompute staging zone sizes at startup

95e49ab

Recompute staging zone sizes at startup

e1e480f

nit: zoneSize struct

087aec7

neelvirdy requested a review from alvin-reyes November 29, 2022 18:17

alvin-reyes reviewed Nov 29, 2022

View reviewed changes

contentmgr/replication.go Show resolved Hide resolved

alvin-reyes reviewed Nov 29, 2022

View reviewed changes

contentmgr/gc.go Outdated Show resolved Hide resolved

en0ma reviewed Nov 30, 2022

View reviewed changes

contentmgr/replication.go Outdated Show resolved Hide resolved

en0ma reviewed Nov 30, 2022

View reviewed changes

contentmgr/replication.go Outdated Show resolved Hide resolved

en0ma reviewed Nov 30, 2022

View reviewed changes

contentmgr/pinning.go Outdated Show resolved Hide resolved

en0ma reviewed Nov 30, 2022

View reviewed changes

contentmgr/replication.go Outdated Show resolved Hide resolved

en0ma reviewed Nov 30, 2022

View reviewed changes

contentmgr/replication.go Show resolved Hide resolved

en0ma reviewed Nov 30, 2022

View reviewed changes

contentmgr/replication.go Outdated Show resolved Hide resolved

neelvirdy added 2 commits November 30, 2022 10:31

polish + only run recompute on size = 0 and stop if no results

47075b6

Unable to unpin content belonging to pinned aggregate

e849c7b

neelvirdy requested a review from en0ma November 30, 2022 15:49

neelvirdy requested a review from en0ma November 30, 2022 17:01

more readable size viability query

4501ea5

en0ma reviewed Dec 1, 2022

View reviewed changes

contentmgr/gc.go Outdated Show resolved Hide resolved

en0ma reviewed Dec 1, 2022

View reviewed changes

contentmgr/replication.go Show resolved Hide resolved

10d9e assigned neelvirdy Dec 1, 2022

neelvirdy added 6 commits December 1, 2022 19:37

Mark/unmark processingZones

ebcf928

unpin update contents db in one transaction

ffe7a2c

update contents db in upin after deleting blocks

c1fc46b

Remove bucketLk in addContentToStagingZone

74be82c

init processingZones map and use addStagingContentLk

4c839a7

clarifying comment

d844432

en0ma reviewed Dec 2, 2022

View reviewed changes

contentmgr/replication.go Outdated Show resolved Hide resolved

Fix processing marks timing

527499d

en0ma reviewed Dec 2, 2022

View reviewed changes

contentmgr/replication.go Outdated Show resolved Hide resolved

neelvirdy added 2 commits December 2, 2022 11:12

Dont defer finish processing for aggregation - only do it on pinComplete

654556e

Use separate consolidating and aggregating zones maps

5d37bd9

en0ma reviewed Dec 2, 2022

View reviewed changes

contentmgr/replication.go Outdated Show resolved Hide resolved

neelvirdy added 3 commits December 2, 2022 11:37

Use separate lks for zone maps, check for aggregating on add and unpin

fc14e99

Check both aggregating/consolidating before acting, mark finished con…

9b71719

…solidating before aggregating

dummy commit

3d84fa2

en0ma reviewed Dec 2, 2022

View reviewed changes

pinning.go Outdated Show resolved Hide resolved

neelvirdy added 3 commits December 2, 2022 18:13

Back to not unpinning contents belonging to pinned aggregates

e7b607b

fix lock timings and handle no dstLoc case

7aa3234

Fix cm.ShuttleCanAddContent logic

00657b0

en0ma approved these changes Dec 5, 2022

View reviewed changes

alvin-reyes merged commit ed97da8 into application-research:dev Dec 5, 2022

neelvirdy deleted the nvirdy/start-db-buckets branch December 5, 2022 16:54

This was referenced Dec 5, 2022

Staging zones should only live in DB, not in memory #556

Closed

Remove readiness table application-research/estuary-www#121

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Track buckets in DB instead of in-memory #609

Track buckets in DB instead of in-memory #609

neelvirdy commented Nov 28, 2022 •

edited

Loading

alvin-reyes left a comment

neelvirdy commented Nov 29, 2022

neelvirdy commented Nov 29, 2022

neelvirdy commented Dec 5, 2022

en0ma left a comment

Track buckets in DB instead of in-memory #609

Track buckets in DB instead of in-memory #609

Conversation

neelvirdy commented Nov 28, 2022 • edited Loading

alvin-reyes left a comment

Choose a reason for hiding this comment

neelvirdy commented Nov 29, 2022

neelvirdy commented Nov 29, 2022

neelvirdy commented Dec 5, 2022

en0ma left a comment

Choose a reason for hiding this comment

neelvirdy commented Nov 28, 2022 •

edited

Loading