Speedup bigtable block upload by factor of 8-10x #24534

buffalu · 2022-04-20T19:07:05Z

Problem

We were seeing our bigtable block upload running behind tip by several thousand blocks. We noticed it was uploading a block every ~500ms.

Summary of Changes

Added multiple blockstore read threads.
Run the bigtable upload in tokio::spawn context.
Run bigtable tx and tx-by-addr uploads in tokio::spawn context.

We're seeing around 50-60ms per block upload now, so we should be able to easily catch up and maintain bigtable state on the tip.

I think there's more to squeeze out, but curious what you guys think about this before I go down the next optimization rabbit hole.

CriesofCarrots

Generally looks pretty good; thanks for digging into this! A couple small questions, and I'd like to play around with it a bit...

ledger/src/bigtable_upload.rs

ledger/Cargo.toml

ledger/src/bigtable_upload.rs

buffalu · 2022-04-20T21:11:09Z

this line is sus: https://github.com/solana-labs/solana/blob/master/ledger/src/bigtable_upload.rs#L173

using blocking receiver and passing it to async thingy? cc @mvines

storage-bigtable/src/lib.rs

ledger/src/bigtable_upload.rs

codecov · 2022-04-21T20:21:11Z

Codecov Report

Merging #24534 (5e2042f) into master (52db2e1) will decrease coverage by 0.0%.
The diff coverage is 0.0%.

❗ Current head 5e2042f differs from pull request most recent head ca236b9. Consider uploading reports for the commit ca236b9 to get more accurate results

@@            Coverage Diff            @@
##           master   #24534     +/-   ##
=========================================
- Coverage    82.1%    82.0%   -0.1%     
=========================================
  Files         612      610      -2     
  Lines      168534   168373    -161     
=========================================
- Hits       138396   138152    -244     
- Misses      30138    30221     +83

CriesofCarrots

What kind of upload speeds are you seeing with this changeset?
Why did you change your mind about the blockstore threads?

ledger/src/bigtable_upload.rs

storage-bigtable/Cargo.toml

buffalu · 2022-04-22T00:47:32Z

What kind of upload speeds are you seeing with this changeset?

i think i was seeing ~200mbit/s, but can't remember specifics. i can test it again by disabling bigtable uploads on our node, run for ~1-2 hours, stop it, then use the ledger-tool upload command. lmk if that's something you're interested in!

we're running on 25gbps server so we have plenty of bandwidth left over. if we could have a threadpool that's constantly popping items off a queue and running i think we could get that number way up (as opposed to running NUM_PARALLEL uploads and waiting for them all to complete before popping more off.

Why did you change your mind about the blockstore threads?

@t-nelson mentioned something about number of CPUs. I feel like that's probably better, but tbh don't have any super strong feelings about it. more is probably better tho, esp. if you're trying to catchup! during normal operation im seeing it keep up just fine and upload 2-3 blocks at a time with tip

buffalu · 2022-04-22T01:02:54Z

just wrote a script to check our bigtable highest slot against yours:

[2022-04-22T01:02:42Z INFO  jito_bigtable] destination bigtable 12496 slots ahead of source (source=130764747 dest=130777243)

ledger/src/bigtable_upload.rs

storage-bigtable/src/lib.rs

buffalu · 2022-04-22T18:29:40Z

cargo b --release --bin solana-ledger-tool && ./target/release/solana-ledger-tool --ledger /mt/ledger/validator-ledger bigtable upload

git commit: 00c5ec9

num_cpus: 48
uses 12 threads (num_cpus / 4) for blockstore + upload
~100ms per block in the upload

[2022-04-22T18:27:42.137475474Z INFO  solana_ledger::bigtable_upload] Upload took 1.5s for 12 blocks
[2022-04-22T18:27:43.210726324Z INFO  solana_ledger::bigtable_upload] Upload took 1.1s for 12 blocks
[2022-04-22T18:27:44.252376864Z INFO  solana_ledger::bigtable_upload] Upload took 1.0s for 9 blocks

num_cpus: 48
uses 24 threads (num_cpus / 2) for blockstore + upload
~70ms per block in the upload

[2022-04-22T18:22:19.896359960Z INFO  solana_ledger::bigtable_upload] Upload took 1.6s for 22 blocks
[2022-04-22T18:22:21.364399516Z INFO  solana_ledger::bigtable_upload] Upload took 1.5s for 21 blocks
[2022-04-22T18:22:45.426086757Z INFO  solana_ledger::bigtable_upload] Upload took 1.6s for 23 blocks

buffalu · 2022-04-22T18:51:55Z

git commit 62a40f1
(no tokio::spawn)
~200ms per block

[2022-04-22T18:51:03.401708893Z INFO  solana_ledger::bigtable_upload] Upload took 2.9s for 12 blocks
[2022-04-22T18:51:05.920438615Z INFO  solana_ledger::bigtable_upload] Upload took 2.5s for 12 blocks
[2022-04-22T18:51:08.139508376Z INFO  solana_ledger::bigtable_upload] Upload took 2.2s for 11 blocks
[2022-04-22T18:51:09.913070966Z INFO  solana_ledger::bigtable_upload] Upload took 1.8s for 10 blocks
[2022-04-22T18:51:11.336636226Z INFO  solana_ledger::bigtable_upload] Upload took 1.4s for 8 blocks

buffalu · 2022-04-22T20:03:12Z

git commit 0d797e2 (master)

[2022-04-22T20:01:47.160736902Z INFO  solana_ledger::bigtable_upload] Upload took 6.8s for 2 blocks
[2022-04-22T20:02:09.115707196Z INFO  solana_ledger::bigtable_upload] Upload took 22.0s for 32 blocks
[2022-04-22T20:02:26.023151730Z INFO  solana_ledger::bigtable_upload] Upload took 16.9s for 32 blocks
[2022-04-22T20:03:06.173026343Z INFO  solana_ledger::bigtable_upload] Upload took 19.9s for 32 blocks

buffalu · 2022-04-22T21:07:08Z

in summary, seems like uploading blocks is network + cpu bound.

haven't profiled, but guessing attempting to compressing each item that gets uploaded to bigtable 3 different ways is expensive 😆

buffalu · 2022-04-26T18:34:16Z

@CriesofCarrots are you running this on your warehouse nodes now? looks like they're caught up :)

CriesofCarrots · 2022-04-26T19:14:49Z

are you running this on your warehouse nodes now? looks like they're caught up :)

No, I'm not sure why we're caught up suddenly 🤷‍♀️ (although Joe might know)
Btw, I'm probably going to sneak in another change in this area that's going to force some rebasing here. Apologies in advance!

buffalu · 2022-04-26T20:06:10Z

are you running this on your warehouse nodes now? looks like they're caught up :)

No, I'm not sure why we're caught up suddenly 🤷‍♀️ (although Joe might know) Btw, I'm probably going to sneak in another change in this area that's going to force some rebasing here. Apologies in advance!

all good! feel free to ping me here or discord if you need anything! just rebased!

CriesofCarrots · 2022-05-03T17:54:25Z

Hey @buffalu , sorry for the delay here. I'm hoping to wrap up my changes in this area today/tomorrow. I played around with some of your changes on top, and it seems like the new spawns do make a huge difference. But reading the blocks from Blockstore was never a limiting factor on our warehouse nodes, so I think I'd like to first try a smaller changeset here without the multiple Blockstore threads. What do you think?
No need to take any action here yet; I'll ping you when my stuff is merged. Just wanted to start chatting about slimming this a little.

buffalu · 2022-05-03T18:02:06Z

Hey @buffalu , sorry for the delay here. I'm hoping to wrap up my changes in this area today/tomorrow. I played around with some of your changes on top, and it seems like the new spawns do make a huge difference. But reading the blocks from Blockstore was never a limiting factor on our warehouse nodes, so I think I'd like to first try a smaller changeset here without the multiple Blockstore threads. What do you think? No need to take any action here yet; I'll ping you when my stuff is merged. Just wanted to start chatting about slimming this a little.

from what i remember i do think i was seeing blockstore take ~100ms per block so it might be worth checking to make sure it's not going to be a limiting factor if someone wants to crank it up more (ie using ledger-tool/custom number of cpus in the config).

CriesofCarrots · 2022-05-13T15:57:55Z

@buffalu , I'm finally done mucking around in the bigtable_upload files. Are you still willing to rebase this?
I see your point about the Blockstore threads. It's never the bottleneck on our nodes, but just talked to a partner blocked by it.

buffalu · 2022-05-13T15:58:41Z

@buffalu , I'm finally done mucking around in the bigtable_upload files. Are you still willing to rebase this? I see your point about the Blockstore threads. It's never the bottleneck on our nodes, but just talked to a partner blocked by it.

yeah! gimme a few hours, will get to later after meetings 😢

buffalu · 2022-05-13T16:00:24Z

@CriesofCarrots can you elaborate what you mean by "It's never the bottleneck on our nodes, but just talked to a partner blocked by it."

are they running with the new tokio::spawn but single blockstore thread? or just in general bigtable upload being slow

CriesofCarrots · 2022-05-13T16:03:52Z

are they running with the new tokio::spawn but single blockstore thread? or just in general bigtable upload being slow

They are running with vanilla v1.9 (don't recall which patch), and it was clear from the timings that the upload thread was sitting idle waiting for blockstore reads

buffalu · 2022-05-13T16:27:11Z

ok should be good. caught a potential unwrap error + fixed that. lmk if anything else sticks out

CriesofCarrots

I started reviewing this, but looks like we lost your changes to use num_cpus as basis for num_blocks_to_upload_in_parallel. Can you restore that? 🙏

storage-bigtable/Cargo.toml

buffalu · 2022-05-13T18:12:37Z

I started reviewing this, but looks like we lost your changes to use num_cpus as basis for num_blocks_to_upload_in_parallel. Can you restore that? 🙏

yeah, assumed we wanted to use your config already. do you think it makes sense for num blockstore threads = num sending threads = num_blocks_to_upload_in_parallel?

CriesofCarrots · 2022-05-13T18:36:18Z

yeah, assumed we wanted to use your config already.

Oh gotcha. Sorry if that was confusing. I just set up relative values.
I still think it makes sense to base the defaults on num_cpus; probably half num_cpus for num_blocks_to_upload_in_parallel...? Was that what you had before?

do you think it makes sense for num blockstore threads = num sending threads = num_blocks_to_upload_in_parallel?

Yes, that makes sense to me

buffalu · 2022-05-13T18:50:06Z

this prob doesn't make sense for this PR, but we also noticed using separate LedgerStorage instances can massively speed things up too. we can get ~500-1k blocks per second reading with this change across 64 threads and 500 slot request sizes.

cloning causes them to use the same channel and TCP connection, if you create a new instance for each thread they'd each have their own connection + channel

CriesofCarrots · 2022-05-13T19:21:41Z

we also noticed using separate LedgerStorage instances can massively speed things up too

Yeah, let's look at that separately

ledger/src/bigtable_upload.rs

CriesofCarrots · 2022-05-13T20:13:51Z

storage-bigtable/src/lib.rs

+        }
+
+        let results = futures::future::join_all(tasks).await;
+        let results: Vec<_> = results.into_iter().map(|r| r.unwrap()).collect();


Let's map the tokio Error to something, just to be safe. Can you add an Error variant?
Also, can we rewrite this whole block to only iterate once?
Something like:

let mut bytes_written = 0; let mut maybe_first_err: Option<Error> = None; for result in results { match result { Err(err) => if maybe_first_err.is_none() { maybe_first_err = Some(Error::TokioError(err)); } Ok(Err(err)) => if maybe_first_err.is_none() { maybe_first_err = Some(Error::BigTableError(err)); } Ok(Ok(bytes)) => { bytes_written += bytes; } } } if let Some(err) = maybe_first_err { return Err(err); }

CriesofCarrots · 2022-05-13T20:51:57Z

One additional request: when you pin tokio, can you also please rollback all the Cargo.lock version bumps? Would prefer to consider those separately.
I think the only Cargo.lock changes needed should be adding tokio and futures to the solana-bigtable crate.

storage-bigtable/src/lib.rs

CriesofCarrots

I have this running on two testnet warehouse nodes, and it's working great! I noticed a couple logging things (see comments).
Otherwise, just the Cargo.lock reverts, and I think that's it from me!

ledger/src/bigtable_upload.rs

Added multiple blockstore read threads. Run the bigtable upload in tokio::spawn context. Run bigtable tx and tx-by-addr uploads in tokio::spawn context.

ledger/src/bigtable_upload.rs

t-nelson

lgtm. just some nits

ledger/src/bigtable_upload.rs

storage-bigtable/src/lib.rs

Pull request has been modified.

CriesofCarrots

Thanks for all the iterations on this @buffalu !

CriesofCarrots · 2022-05-17T06:20:29Z

Merging on red; downstream-anchor-projects has been removed on master (saving CI the load of rebasing this)

Added multiple blockstore read threads. Run the bigtable upload in tokio::spawn context. Run bigtable tx and tx-by-addr uploads in tokio::spawn context. (cherry picked from commit 6bcadc7) # Conflicts: # Cargo.lock # programs/bpf/Cargo.lock # storage-bigtable/Cargo.toml

…25278) * Speedup bigtable block upload by factor of 8-10x (#24534) Added multiple blockstore read threads. Run the bigtable upload in tokio::spawn context. Run bigtable tx and tx-by-addr uploads in tokio::spawn context. (cherry picked from commit 6bcadc7) # Conflicts: # Cargo.lock # programs/bpf/Cargo.lock # storage-bigtable/Cargo.toml * Fix conflicts Co-authored-by: buffalu <85544055+buffalu@users.noreply.github.com> Co-authored-by: Tyera Eulberg <tyera@solana.com>

Added multiple blockstore read threads. Run the bigtable upload in tokio::spawn context. Run bigtable tx and tx-by-addr uploads in tokio::spawn context.

mergify bot added the community Community contribution label Apr 20, 2022

mergify bot requested a review from a team April 20, 2022 19:07

CriesofCarrots reviewed Apr 20, 2022

View reviewed changes

ledger/src/bigtable_upload.rs Outdated Show resolved Hide resolved

ledger/Cargo.toml Outdated Show resolved Hide resolved

ledger/src/bigtable_upload.rs Outdated Show resolved Hide resolved

CriesofCarrots reviewed Apr 20, 2022

View reviewed changes

ledger/src/bigtable_upload.rs Outdated Show resolved Hide resolved

buffalu commented Apr 20, 2022

View reviewed changes

ledger/src/bigtable_upload.rs Outdated Show resolved Hide resolved

t-nelson reviewed Apr 20, 2022

View reviewed changes

storage-bigtable/src/lib.rs Outdated Show resolved Hide resolved

buffalu commented Apr 21, 2022

View reviewed changes

ledger/src/bigtable_upload.rs Show resolved Hide resolved

buffalu commented Apr 21, 2022

View reviewed changes

ledger/src/bigtable_upload.rs Outdated Show resolved Hide resolved

CriesofCarrots reviewed Apr 22, 2022

View reviewed changes

ledger/src/bigtable_upload.rs Outdated Show resolved Hide resolved

ledger/src/bigtable_upload.rs Outdated Show resolved Hide resolved

storage-bigtable/Cargo.toml Outdated Show resolved Hide resolved

t-nelson reviewed Apr 22, 2022

View reviewed changes

ledger/src/bigtable_upload.rs Outdated Show resolved Hide resolved

storage-bigtable/src/lib.rs Outdated Show resolved Hide resolved

buffalu force-pushed the lb/bigtable_speedup branch from 72df590 to 92ce23b Compare April 26, 2022 18:32

CriesofCarrots mentioned this pull request May 13, 2022

Add configurable limit to number of blocks to check before Bigtable upload #24716

Merged

buffalu force-pushed the lb/bigtable_speedup branch from 92ce23b to eb95861 Compare May 13, 2022 16:10

CriesofCarrots reviewed May 13, 2022

View reviewed changes

storage-bigtable/Cargo.toml Outdated Show resolved Hide resolved

storage-bigtable/Cargo.toml Outdated Show resolved Hide resolved

buffalu force-pushed the lb/bigtable_speedup branch from f1bbebe to a29f94e Compare May 13, 2022 18:44

CriesofCarrots reviewed May 13, 2022

View reviewed changes

storage-bigtable/src/lib.rs Show resolved Hide resolved

CriesofCarrots reviewed May 13, 2022

View reviewed changes

ledger/src/bigtable_upload.rs Outdated Show resolved Hide resolved

ledger/src/bigtable_upload.rs Outdated Show resolved Hide resolved

buffalu added 2 commits May 15, 2022 14:01

Speedup bigtable upload by factor of 10x.

a8dd713

Added multiple blockstore read threads. Run the bigtable upload in tokio::spawn context. Run bigtable tx and tx-by-addr uploads in tokio::spawn context.

print out blockstore results at the end

6a299d2

buffalu force-pushed the lb/bigtable_speedup branch from 5e2042f to 6a299d2 Compare May 15, 2022 19:12

buffalu commented May 15, 2022

View reviewed changes

ledger/src/bigtable_upload.rs Outdated Show resolved Hide resolved

CriesofCarrots reviewed May 16, 2022

View reviewed changes

ledger/src/bigtable_upload.rs Outdated Show resolved Hide resolved

t-nelson previously approved these changes May 16, 2022

View reviewed changes

buffalu added 2 commits May 16, 2022 16:40

wallclock elapsed

812cea4

trent feedback

2d66456

cargo.locl

ca236b9

CriesofCarrots approved these changes May 17, 2022

View reviewed changes

CriesofCarrots added the v1.10 label May 17, 2022

CriesofCarrots merged commit 6bcadc7 into solana-labs:master May 17, 2022

mergify bot mentioned this pull request May 17, 2022

Speedup bigtable block upload by factor of 8-10x (backport #24534) #25278

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speedup bigtable block upload by factor of 8-10x #24534

Speedup bigtable block upload by factor of 8-10x #24534

buffalu commented Apr 20, 2022

CriesofCarrots left a comment

buffalu commented Apr 20, 2022 •

edited

Loading

codecov bot commented Apr 21, 2022 •

edited

Loading

CriesofCarrots left a comment •

edited

Loading

buffalu commented Apr 22, 2022

buffalu commented Apr 22, 2022

buffalu commented Apr 22, 2022 •

edited

Loading

buffalu commented Apr 22, 2022

buffalu commented Apr 22, 2022

buffalu commented Apr 22, 2022

buffalu commented Apr 26, 2022

CriesofCarrots commented Apr 26, 2022

buffalu commented Apr 26, 2022

CriesofCarrots commented May 3, 2022

buffalu commented May 3, 2022

CriesofCarrots commented May 13, 2022

buffalu commented May 13, 2022

buffalu commented May 13, 2022

CriesofCarrots commented May 13, 2022

buffalu commented May 13, 2022

CriesofCarrots left a comment

buffalu commented May 13, 2022

CriesofCarrots commented May 13, 2022

buffalu commented May 13, 2022

CriesofCarrots commented May 13, 2022

CriesofCarrots May 13, 2022

CriesofCarrots commented May 13, 2022

CriesofCarrots left a comment

t-nelson left a comment

CriesofCarrots left a comment

CriesofCarrots commented May 17, 2022

Speedup bigtable block upload by factor of 8-10x #24534

Speedup bigtable block upload by factor of 8-10x #24534

Conversation

buffalu commented Apr 20, 2022

Problem

Summary of Changes

CriesofCarrots left a comment

Choose a reason for hiding this comment

buffalu commented Apr 20, 2022 • edited Loading

codecov bot commented Apr 21, 2022 • edited Loading

Codecov Report

CriesofCarrots left a comment • edited Loading

Choose a reason for hiding this comment

buffalu commented Apr 22, 2022

buffalu commented Apr 22, 2022

buffalu commented Apr 22, 2022 • edited Loading

buffalu commented Apr 22, 2022

buffalu commented Apr 22, 2022

buffalu commented Apr 22, 2022

buffalu commented Apr 26, 2022

CriesofCarrots commented Apr 26, 2022

buffalu commented Apr 26, 2022

CriesofCarrots commented May 3, 2022

buffalu commented May 3, 2022

CriesofCarrots commented May 13, 2022

buffalu commented May 13, 2022

buffalu commented May 13, 2022

CriesofCarrots commented May 13, 2022

buffalu commented May 13, 2022

CriesofCarrots left a comment

Choose a reason for hiding this comment

buffalu commented May 13, 2022

CriesofCarrots commented May 13, 2022

buffalu commented May 13, 2022

CriesofCarrots commented May 13, 2022

CriesofCarrots May 13, 2022

Choose a reason for hiding this comment

CriesofCarrots commented May 13, 2022

CriesofCarrots left a comment

Choose a reason for hiding this comment

t-nelson left a comment

Choose a reason for hiding this comment

CriesofCarrots left a comment

Choose a reason for hiding this comment

CriesofCarrots commented May 17, 2022

buffalu commented Apr 20, 2022 •

edited

Loading

codecov bot commented Apr 21, 2022 •

edited

Loading

CriesofCarrots left a comment •

edited

Loading

buffalu commented Apr 22, 2022 •

edited

Loading