Implement delayed chunks application for Stateless Validation #9982

pugachAG · 2023-10-20T09:39:28Z

See the doc for more context.

This includes delaying applying block chunks until the next block is processed or a new chunk is needed to be produced.

Longarithm · 2023-10-30T09:06:05Z

It seemed to me that I can achieve all three sub-goals from Miro board for #9679 if I apply right modifications to get_apply_chunk_job_new_chunk. Unfortunately, all my attempts ended up with couple of integration-tests failing due to different reasons, including

ChunkExtra to be stored for the block which chunk was executed
misalignment with staking txs processing in epoch manager

So I decided to come back to previous non-async approach and iterate from it slower. Pair of fixes already helped to bring number of nayduck failures "just" to 60. I plan to reduce it further and then apply chunks asynchronously again.

For stateless validation, chunk execution will be delayed until next block is processed: #9982. This impacts several tests assuming that block processing includes chunk processing as well. For these tests, we need to produce and process one more block to get execution results like ChunkExtra and ExecutionOutcome. To allow production of longer forks, I want to extend client API by `produce_block_on` which can produce a block not just on top of head, but on top of any existing block. As this block isn't immediately saved or processed, it even doesn't break any guarantees. ## Testing Impacted tests should still pass. Later stateless validation PRs will rely on it. --------- Co-authored-by: Longarithm <the.aleksandr.logunov@gmail.com>

Longarithm · 2023-11-10T15:30:36Z

Latest update: https://near.zulipchat.com/#narrow/stream/407237-pagoda.2Fcore.2Fstateless-validation/topic/delaying.20chunk.20execution/near/401389927

This is a next step for #9982. Here I introduce jobs which will perform stateless validation of newly received chunk by executing txs and receipts. Later they should be executed against state witness, but for now I just set a foundation by running these jobs against state data in DB. All passing tests verify that old and new jobs generate the same result. The final switch will happen when stateful jobs will be replaced with stateless ones. ### Details This doesn't introduce any load on stable version. On nightly version there will be `num_shards` extra jobs which will check that stateless validation results are consistent with stateful execution. But as we use nightly only for testing, it shouldn't mean much overhead. I add more fields to `ShardContext` structure to simplify code. Some of them are needed to break early if there is resharding, and the logic is the same for both kinds of jobs. `StorageDataSource::DbTrieOnly` is introduced to read data only from trie in stateless jobs. This is annoying but still needed if there are a lot of missing chunks and flat storage head moved above the block at which previous chunk was created. When state witness will be implemented, `Recorded` will be used instead. ## Testing * Failure to update current_chunk_extra on the way leads to >20 tests failing in process_blocks, with errors like `assertion `left == right` failed: For stateless validation, chunk extras for block CMV88CBcnKoxa7eTnkG64psLoJzpW9JeAhFrZBVv6zDc and shard s3.v2 do not match...` * If I update current_chunk_extra only once, `tests::client::resharding::test_latest_protocol_missing_chunks_high_missing_prob` fails which was specifically introduced for that. Actually this helped to realize that `validate_chunk_with_chunk_extra` is still needed but I will introduce it later. * Nayduck: ~https://nayduck.near.org/#/run/3293 - +10 nightly tests failing, will take a look~ https://nayduck.near.org/#/run/3300 --------- Co-authored-by: Longarithm <the.aleksandr.logunov@gmail.com>

Longarithm · 2024-02-02T20:19:11Z

Quick update: we are not fully doing this anymore in scope of SV mainnet release.
Main reason is that applying of old chunks is more problematic than we thought.
We still want to do it, but the fix must include, together:

replacing "apply old chunks" concept with "update validator accounts for empty chunk range"
as new chunk in block appears, it should fix set of incoming receipts to be executed, which will be from prev chunk, included, to new chunk, excluded. Now we take "previous" range
to be able to apply chunks eagerly, it should be possible to reapply old chunk every time when new chunk is missing, because set of receipts to execute is changing.

IIRC this is valuable as we won't have to apply receipts to make just a state sync and protocol logic becomes MUCH more consistent.

pugachAG added A-chain Area: Chain, client & related Near Core labels Oct 20, 2023

pugachAG assigned Longarithm Oct 20, 2023

This was referenced Oct 20, 2023

Implement simple state witness (state transitions with proof) alongside chunk production as well as validation logic #9628

Closed

🔷 Prototype stateless validation #9292

Closed

Longarithm mentioned this issue Nov 3, 2023

test: allow producing block on top of arbitrary prev block #10093

Merged

github-actions bot mentioned this issue Nov 17, 2023

Monthly issue metrics report #10208

Open

Longarithm mentioned this issue Nov 24, 2023

feat: stateless validation jobs in test mode #10248

Merged

walnut-the-cat mentioned this issue Nov 29, 2023

[ProjectTracking]: Stateless validation MVP near/near-one-project-tracking#5

Closed

23 tasks

This was referenced Feb 2, 2024

[ProjectTracking]: Stateless validation Mainnet Release near/near-one-project-tracking#46

Open

[stateless_validation] Solve 2x Latency Problem #10584

Open

tayfunelmas added the A-stateless-validation Area: stateless validation label Apr 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement delayed chunks application for Stateless Validation #9982

Implement delayed chunks application for Stateless Validation #9982

pugachAG commented Oct 20, 2023

Longarithm commented Oct 30, 2023

Longarithm commented Nov 10, 2023

Longarithm commented Feb 2, 2024 •

edited

Loading

Implement delayed chunks application for Stateless Validation #9982

Implement delayed chunks application for Stateless Validation #9982

Comments

pugachAG commented Oct 20, 2023

Longarithm commented Oct 30, 2023

Longarithm commented Nov 10, 2023

Longarithm commented Feb 2, 2024 • edited Loading

Longarithm commented Feb 2, 2024 •

edited

Loading