replay: do not start leader for a block we already have shreds for #2416

AshwinSekar · 2024-08-02T18:29:48Z

Problem

In certain scenarios where the first leader block is not produced, however the second (or later) leader block is produced we can end up reproducing this block after resetting to a previous block.

Summary of Changes

When poh_recorder checks for leader slot, additionally check blockstore to see if shreds have already been inserted.

core/src/replay_stage.rs

poh/src/poh_recorder.rs

AshwinSekar · 2024-08-02T21:43:33Z

I'll address leader_schedule_cache::next_leader_slot in a follow up PR once we decide on a direction. It sounds like we want this blockstore check here regardless.

core/src/replay_stage.rs

ledger/src/blockstore.rs

mergify · 2024-08-08T02:46:57Z

Backports to the beta branch are to be avoided unless absolutely necessary for fixing bugs, security issues, and perf regressions. Changes intended for backport should be structured such that a minimum effective diff can be committed separately from any refactoring, plumbing, cleanup, etc that are not strictly necessary to achieve the goal. Any of the latter should go only into master and ride the normal stabilization schedule. Exceptions include CI/metrics changes, CLI improvements and documentation updates on a case by case basis.

…2416) * replay: do not start leader for a block we already have shreds for * pr feedback: comment, move existing check to blockstore fn * move blockstore read after tick height check * pr feedback: resuse blockstore fn in next_leader_slot (cherry picked from commit 15dbe7f) # Conflicts: # poh/src/poh_recorder.rs

jstarry · 2024-08-08T08:32:18Z

poh/src/poh_recorder.rs

+            // as it was not part of the rooted fork. If this slot is not the first slot for this leader,
+            // and the first slot was previously ticked over, the check in `leader_schedule_cache::next_leader_slot`
+            // will not suffice, as it only checks if there are shreds for the first slot.


Interestingly enough fn next_leader_slot returns (start_slot, last_slot) and last_slot is used to calculate leader_last_tick_height in PohRecorder but we only use leader_last_tick_height inside would_be_leader. Maybe reached_leader_tick needs to use it as well?

It could be an option, it would allow us to remove the check here for when we tick into the next leader:

agave/core/src/replay_stage.rs

Lines 2073 to 2075 in c99095d

// I guess I missed my slot

if next_leader != *my_pubkey {

return false;

However I think if we use it in reached_leader_tick, we should recompute our next leader window. Otherwise if we don't reset in time, we could end up missing our next leader window, which the current code prevents.

…for (backport of #2416) (#2484) * replay: do not start leader for a block we already have shreds for (#2416) * replay: do not start leader for a block we already have shreds for * pr feedback: comment, move existing check to blockstore fn * move blockstore read after tick height check * pr feedback: resuse blockstore fn in next_leader_slot (cherry picked from commit 15dbe7f) # Conflicts: # poh/src/poh_recorder.rs * fix conflicts --------- Co-authored-by: Ashwin Sekar <ashwin@anza.xyz> Co-authored-by: Ashwin Sekar <ashwin@solana.com>

…nza-xyz#2416) * replay: do not start leader for a block we already have shreds for * pr feedback: comment, move existing check to blockstore fn * move blockstore read after tick height check * pr feedback: resuse blockstore fn in next_leader_slot

carllin reviewed Aug 2, 2024

View reviewed changes

core/src/replay_stage.rs Outdated Show resolved Hide resolved

poh/src/poh_recorder.rs Outdated Show resolved Hide resolved

AshwinSekar requested review from bw-solana and steviez August 2, 2024 21:36

AshwinSekar requested a review from carllin August 5, 2024 15:42

AshwinSekar mentioned this pull request Aug 5, 2024

leader: do not perform blockstore check in next_leader_slot #2445

Closed

bw-solana reviewed Aug 5, 2024

View reviewed changes

core/src/replay_stage.rs Show resolved Hide resolved

carllin previously approved these changes Aug 5, 2024

View reviewed changes

AshwinSekar added 3 commits August 5, 2024 23:20

replay: do not start leader for a block we already have shreds for

0332355

pr feedback: comment, move existing check to blockstore fn

3f54bc6

move blockstore read after tick height check

3928df1

AshwinSekar dismissed carllin’s stale review via 3928df1 August 5, 2024 23:23

AshwinSekar force-pushed the start-leader-check-blockstore branch from 4da42a2 to 3928df1 Compare August 5, 2024 23:23

bw-solana reviewed Aug 6, 2024

View reviewed changes

ledger/src/blockstore.rs Show resolved Hide resolved

pr feedback: resuse blockstore fn in next_leader_slot

3693a12

AshwinSekar requested review from bw-solana and carllin August 6, 2024 20:48

carllin approved these changes Aug 6, 2024

View reviewed changes

AshwinSekar merged commit 15dbe7f into anza-xyz:master Aug 6, 2024
40 checks passed

AshwinSekar deleted the start-leader-check-blockstore branch August 6, 2024 21:37

AshwinSekar added the v2.0 Backport to v2.0 branch label Aug 8, 2024

mergify bot mentioned this pull request Aug 8, 2024

v2.0: replay: do not start leader for a block we already have shreds for (backport of #2416) #2484

Merged

jstarry self-requested a review August 8, 2024 06:05

jstarry reviewed Aug 8, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

replay: do not start leader for a block we already have shreds for #2416

replay: do not start leader for a block we already have shreds for #2416

AshwinSekar commented Aug 2, 2024

AshwinSekar commented Aug 2, 2024

mergify bot commented Aug 8, 2024

jstarry Aug 8, 2024

AshwinSekar Aug 8, 2024

	// I guess I missed my slot
	if next_leader != *my_pubkey {
	return false;

replay: do not start leader for a block we already have shreds for #2416

replay: do not start leader for a block we already have shreds for #2416

Conversation

AshwinSekar commented Aug 2, 2024

Problem

Summary of Changes

AshwinSekar commented Aug 2, 2024

mergify bot commented Aug 8, 2024

jstarry Aug 8, 2024

Choose a reason for hiding this comment

AshwinSekar Aug 8, 2024

Choose a reason for hiding this comment