[Segment Replication] Checkpoint Replay on Replica Shard #3658

Rishikesh1159 · 2022-06-23T00:57:20Z

Description

This PR will allow us to store latest received checkpoint for replica shard and replay the latest checkpoint after previous checkpoint replication process is done.

This is a part of the process of merging our feature branch - feature/segment-replication - back into main by re-PRing our changes from the feature branch.
The breakdown of the plan to merge to main is detailed here: #2355
For added context on segment replication - here's the design proposal #2229

Issues Resolved

#3558

Check List

New functionality includes testing.
- All tests pass
New functionality has been documented.
- New functionality has javadoc added
Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

… tests Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com>

Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com>

opensearch-ci-bot · 2022-06-23T01:25:36Z

❌ Gradle Check failure fcdfbd1
Log 6240

Reports 6240

Rishikesh1159 · 2022-06-23T01:30:26Z

start gradle check

opensearch-ci-bot · 2022-06-23T02:00:30Z

✅ Gradle Check success fcdfbd1
Log 6243

Reports 6243

server/src/main/java/org/opensearch/indices/replication/SegmentReplicationTargetService.java

kartg · 2022-06-23T17:35:30Z

server/src/main/java/org/opensearch/indices/replication/SegmentReplicationTargetService.java

+                    // if we received a checkpoint during the copy event that is ahead of this
+                    // try and process it.
+                    if (getLatestReceivedCheckpoint(replicaShard.shardId()).isAheadOf(replicaShard.getLatestReplicationCheckpoint())) {
+                        onNewCheckpoint(getLatestReceivedCheckpoint(replicaShard.shardId()), replicaShard);


I'm a little concerned that this recursive call could lead to a long-running thread and/or function stack overflow. As a follow-up to this PR, we should look into wrapping checkpoint processing in a Runnable so we can execute it async from the new-checkpoint notification.

cc @mch2

any reason not to do this now? @Rishikesh1159 @kartg ? We had a similar implementation in the feature branch - https://github.com/opensearch-project/OpenSearch/blob/feature/segment-replication/server/src/main/java/org/opensearch/index/shard/IndexShard.java#L3683

Ah, i didn't realize the POC had the threading in place. Yes, if we have prior art, we can pull it into this PR

QQ: Do newer checkpoints supersede older checkpoints? What happens when the replication is slow is backlogged with too many checkpoints to process.

Sure, I wrapped checkpoint processing in a Runnable in new commit

@Bukhtawar we only store one latest received checkpoint and replace it whenever there is a new checkpoint. We don't store all received checkpoints to replica shard. So, even when replication is slow, we won't be backlogged with too many checkpoints, because we process only one checkpoint (which is latest received checkpoint)

Thanks and I am assuming replication usually happens on the generic threadpool to start with?

@Rishikesh1159, @mch2 I would recommend doing explicit checks rather than simply forking to guarantee we don't end up using the same thread

void execute(Runnable runnable, Thread originalThread) { if (originalThread == Thread.currentThread()) { fork(runnable); } else { runnable.run(); } }

kartg

Self-resolved one of my earlier comments, and the others are nitpicks.

Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com>

Bukhtawar

Thanks @Rishikesh1159 left a comment on explicit checks on thread before we fork

dreamer-89 · 2022-06-27T19:11:05Z

...r/src/test/java/org/opensearch/indices/replication/SegmentReplicationTargetServiceTests.java

@@ -177,6 +177,39 @@ public void testNewCheckpoint_validationPassesAndReplicationFails() throws IOExc
        closeShard(indexShard, false);
    }

+    public void testReplicationOnDone() throws IOException {


@Rishikesh1159 : Will be good to have test cases for failure scenario, multiple checkpoints and IT exercising this code.

We have already covered failure scenario in previous PR.For multiple checkpoints we are already passing 2 checkpoints in many of our tests, so not sure if there is need to multiple checkpoints test case, But if needed we can do that also. And for IT for this, yes it is good to have an IT for this code, but I think we can make a different PR for that

Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com>

github-actions · 2022-07-20T17:33:38Z

Gradle Check (Jenkins) Run Completed with:

RESULT: FAILURE ❌
URL: https://build.ci.opensearch.org/job/gradle-check/818/
CommitID: e0ec134

Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com>

github-actions · 2022-07-20T18:05:04Z

Gradle Check (Jenkins) Run Completed with:

RESULT: FAILURE ❌
URL: https://build.ci.opensearch.org/job/gradle-check/820/
CommitID: 57afbd6

Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com>

github-actions · 2022-07-20T20:46:40Z

Gradle Check (Jenkins) Run Completed with:

RESULT: FAILURE ❌
URL: https://build.ci.opensearch.org/job/gradle-check/821/
CommitID: 4322d35

github-actions · 2022-07-20T21:09:50Z

Gradle Check (Jenkins) Run Completed with:

RESULT: ABORTED ❌
URL: https://build.ci.opensearch.org/job/gradle-check/824/
CommitID: 40c454c

github-actions · 2022-07-20T21:38:36Z

Gradle Check (Jenkins) Run Completed with:

RESULT: ABORTED ❌
URL: https://build.ci.opensearch.org/job/gradle-check/826/
CommitID: 40c454c

github-actions · 2022-07-20T21:48:07Z

Gradle Check (Jenkins) Run Completed with:

RESULT: ABORTED ❌
URL: https://build.ci.opensearch.org/job/gradle-check/831/
CommitID: 40c454c

github-actions · 2022-07-21T01:17:15Z

Gradle Check (Jenkins) Run Completed with:

RESULT: ABORTED ❌
URL: https://build.ci.opensearch.org/job/gradle-check/849/
CommitID: 40c454c

github-actions · 2022-07-21T14:24:40Z

Gradle Check (Jenkins) Run Completed with:

RESULT: SUCCESS ✅
URL: https://build.ci.opensearch.org/job/gradle-check/869/
CommitID: 40c454c

Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com>

…project#3658) * Adding Latest Recevied checkpoint, replay checkpoint logic along with tests Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com>

…aining segment replication changes (#4243) * [Segment Replication] Checkpoint Replay on Replica Shard (#3658) * Adding Latest Recevied checkpoint, replay checkpoint logic along with tests Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com> * [Segment Replication] Wire up segment replication with peer recovery and add ITs. (#3743) * Add null check when computing max segment version. With segment replication enabled it is possible Lucene does not set the SegmentInfos min segment version, leaving the default value as null. Signed-off-by: Marc Handalian <handalm@amazon.com> * Update peer recovery to set the translogUUID of replicas to the UUID generated on the primary. This change updates the UUID when the translog is created to the value stored in the passed segment userdata. This is to ensure during failover scenarios that the replica can be promoted and not have a uuid mismatch with the value stored in user data. Signed-off-by: Marc Handalian <handalm@amazon.com> * Wire up Segment Replication under the feature flag. This PR wires up segment replication and adds some initial integration tests. Signed-off-by: Marc Handalian <handalm@amazon.com> * Add test to ensure replicas use primary translog uuid with segrep. Signed-off-by: Marc Handalian <handalm@amazon.com> * Update SegmentReplicationIT to assert previous commit points are valid and SegmentInfos can be built. Fix nitpicks in PR feedback. Signed-off-by: Marc Handalian <handalm@amazon.com> * Fix test with Assert.fail to include a message. Signed-off-by: Marc Handalian <handalm@amazon.com> * Disable shard idle with segment replication. (#4118) This change disables shard idle when segment replication is enabled. Primary shards will only push out new segments on refresh, we do not want to block this based on search behavior. Replicas will only refresh on an externally provided SegmentInfos, so we do not want search requests to hang waiting for a refresh. Signed-off-by: Marc Handalian <handalm@amazon.com> * Fix isAheadOf logic for ReplicationCheckpoint comparison (#4112) Signed-off-by: Suraj Singh <surajrider@gmail.com> * Fix waitUntil refresh policy for segrep enabled indices. (#4137) Signed-off-by: Marc Handalian <handalm@amazon.com> * Add IndexShard#getLatestReplicationCheckpoint behind segrep enable feature flag (#4163) * Add IndexShard#getLatestReplicationCheckpoint behind segrep enable feature flag Signed-off-by: Suraj Singh <surajrider@gmail.com> * Address review comment. Move tests to SegmentReplicationIndexShardTests Signed-off-by: Suraj Singh <surajrider@gmail.com> * Add segrep enbaled index settings in TargetServiceTests, SourceHandlerTests Signed-off-by: Suraj Singh <surajrider@gmail.com> * [Segment Replication] Fix OngoingSegmentReplications to key by allocation ID instead of DiscoveryNode. (#4182) * Fix OngoingSegmentReplications to key by allocation ID instead of DiscoveryNode. This change fixes segrep to work with multiple shards per node by keying ongoing replications on allocation ID. This also updates cancel methods to ensure state is properly cleared on shard cancel. Signed-off-by: Marc Handalian <handalm@amazon.com> * Clean up cancel methods. Signed-off-by: Marc Handalian <handalm@amazon.com> Signed-off-by: Marc Handalian <handalm@amazon.com> * [Bug] [Segment Replication] Update store metadata recovery diff logic to ignore missing files causing exception (#4185) * Update Store for segment replication dif Signed-off-by: Poojita Raj <poojiraj@amazon.com> Signed-off-by: Poojita Raj <poojiraj@amazon.com> * Update recoveryDiff logic to ingore missing files causing exception on replica during copy Signed-off-by: Suraj Singh <surajrider@gmail.com> * Address review comments Signed-off-by: Suraj Singh <surajrider@gmail.com> Signed-off-by: Poojita Raj <poojiraj@amazon.com> Signed-off-by: Suraj Singh <surajrider@gmail.com> Co-authored-by: Poojita Raj <poojiraj@amazon.com> * [Segment Replication] Adding PrimaryMode check before publishing checkpoint and processing a received checkpoint. (#4157) * Adding PrimaryMode check before publishing checkpoint. Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com> * Applying spotless check Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com> * Moving segrep specific tests to SegmentReplicationIndexShardTests. Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com> * Adding logic and tests for rejecting checkpoints if shard is in PrimaryMode. Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com> * Applying ./gradlew :server:spotlessApply. Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com> * Applying ./gradlew :server:spotlessApply Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com> * Changing log level to warn in shouldProcessCheckpoint() of IndexShard.java class. Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com> * Removing unnecessary lazy logging in shouldProcessCheckpoint(). Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com> Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com> * [Segment Replication] Wait for documents to replicate to replica shards (#4236) * [Segment Replication] Add thread sleep to account for replica lag in delete operations test Signed-off-by: Suraj Singh <surajrider@gmail.com> * Address review comments, assertBusy on doc count rather than sleep Signed-off-by: Suraj Singh <surajrider@gmail.com> Signed-off-by: Suraj Singh <surajrider@gmail.com> * Segment Replication - Remove unnecessary call to markAllocationIdAsInSync. (#4224) This PR Removes an unnecessary call to markAllocationIdAsInSync on the primary shard when replication events complete. Recovery will manage this initial call. Signed-off-by: Marc Handalian <handalm@amazon.com> Signed-off-by: Marc Handalian <handalm@amazon.com> * Segment Replication - Add additional unit tests for update & delete ops. (#4237) * Segment Replication - Add additional unit tests for update & delete operations. Signed-off-by: Marc Handalian <handalm@amazon.com> * Fix spotless. Signed-off-by: Marc Handalian <handalm@amazon.com> Signed-off-by: Marc Handalian <handalm@amazon.com> Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com> Signed-off-by: Marc Handalian <handalm@amazon.com> Signed-off-by: Suraj Singh <surajrider@gmail.com> Signed-off-by: Poojita Raj <poojiraj@amazon.com> Co-authored-by: Marc Handalian <handalm@amazon.com> Co-authored-by: Suraj Singh <surajrider@gmail.com> Co-authored-by: Poojita Raj <poojiraj@amazon.com>

Rishikesh1159 added 2 commits June 23, 2022 00:33

Adding Latest Recevied checkpoint, replay checkpoint logic along with…

b7c5d6d

… tests Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com>

Making getLatestReceivedCheckpoint() as private and adding java docs

fcdfbd1

Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com>

Rishikesh1159 requested review from a team and reta as code owners June 23, 2022 00:57

kartg reviewed Jun 23, 2022

View reviewed changes

kartg approved these changes Jun 23, 2022

View reviewed changes

mch2 mentioned this pull request Jun 24, 2022

[Segment Replication] Design Proposal #2229

Closed

Rishikesh1159 and others added 3 commits June 28, 2022 15:20

Wrapping checkpoint processing in a runnable and addressing PR comments

72c36ee

Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com>

Merge branch 'opensearch-project:main' into seg-rep/checkpoint-replay

666ef0c

Merge branch 'opensearch-project:main' into seg-rep/checkpoint-replay

d9b2e71

Bukhtawar approved these changes Jul 5, 2022

View reviewed changes

dreamer-89 reviewed Jul 6, 2022

View reviewed changes

Adding condition to check if fork of thread is necessary.

e0ec134

Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com>

Rishikesh1159 changed the title ~~Segment Replication Checkpoint Replay on Replica Shard~~ [Segment Replication] Checkpoint Replay on Replica Shard Jul 20, 2022

dreamer-89 approved these changes Jul 20, 2022

View reviewed changes

Apply spotlesscheck

57afbd6

Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com>

Fix import statements

4322d35

Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com>

Merge branch 'main' into seg-rep/checkpoint-replay

40c454c

Rishikesh1159 merged commit bb593d6 into opensearch-project:main Jul 21, 2022

Rishikesh1159 added a commit to Rishikesh1159/OpenSearch that referenced this pull request Aug 10, 2022

Cherry-picking opensearch-project#3658.

a7ad735

Signed-off-by: Rishikesh1159 <rishireddy1159@gmail.com>

Rishikesh1159 mentioned this pull request Aug 17, 2022

[Backport 2.x] [Segment Replication] Backport all PR's containing remaining segment replication changes #4243

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Segment Replication] Checkpoint Replay on Replica Shard #3658

[Segment Replication] Checkpoint Replay on Replica Shard #3658

Rishikesh1159 commented Jun 23, 2022

opensearch-ci-bot commented Jun 23, 2022

Rishikesh1159 commented Jun 23, 2022

opensearch-ci-bot commented Jun 23, 2022

kartg Jun 23, 2022

mch2 Jun 24, 2022

kartg Jun 24, 2022

Bukhtawar Jun 25, 2022

Rishikesh1159 Jun 28, 2022

Rishikesh1159 Jun 28, 2022

Bukhtawar Jul 1, 2022

Bukhtawar Jul 5, 2022

kartg left a comment

Bukhtawar left a comment

dreamer-89 Jun 27, 2022 •

edited

Loading

Rishikesh1159 Jul 19, 2022

github-actions bot commented Jul 20, 2022

github-actions bot commented Jul 20, 2022

github-actions bot commented Jul 20, 2022

github-actions bot commented Jul 20, 2022

github-actions bot commented Jul 20, 2022

github-actions bot commented Jul 20, 2022

github-actions bot commented Jul 21, 2022

github-actions bot commented Jul 21, 2022

[Segment Replication] Checkpoint Replay on Replica Shard #3658

[Segment Replication] Checkpoint Replay on Replica Shard #3658

Conversation

Rishikesh1159 commented Jun 23, 2022

Description

Issues Resolved

Check List

opensearch-ci-bot commented Jun 23, 2022

Rishikesh1159 commented Jun 23, 2022

opensearch-ci-bot commented Jun 23, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kartg left a comment

Choose a reason for hiding this comment

Bukhtawar left a comment

Choose a reason for hiding this comment

dreamer-89 Jun 27, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

github-actions bot commented Jul 20, 2022

Gradle Check (Jenkins) Run Completed with:

github-actions bot commented Jul 20, 2022

Gradle Check (Jenkins) Run Completed with:

github-actions bot commented Jul 20, 2022

Gradle Check (Jenkins) Run Completed with:

github-actions bot commented Jul 20, 2022

Gradle Check (Jenkins) Run Completed with:

github-actions bot commented Jul 20, 2022

Gradle Check (Jenkins) Run Completed with:

github-actions bot commented Jul 20, 2022

Gradle Check (Jenkins) Run Completed with:

github-actions bot commented Jul 21, 2022

Gradle Check (Jenkins) Run Completed with:

github-actions bot commented Jul 21, 2022

Gradle Check (Jenkins) Run Completed with:

dreamer-89 Jun 27, 2022 •

edited

Loading