Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix snapshot repository registration failure in mixed OpenSearch version setups #16831

Merged
merged 1 commit into from
Dec 16, 2024

Conversation

ashking94
Copy link
Member

@ashking94 ashking94 commented Dec 11, 2024

Description

This PR addresses issue #16830, where snapshot repository registration fails in clusters with mixed OpenSearch versions, specifically when nodes running OS 2.17 are present alongside nodes running earlier 1.x or 2.x versions.

Problem:

  • Repository registration fails due to different handling of test paths in OS 2.17 compared to earlier versions.
  • This issue occurs during version upgrades, causing snapshot functionality to break.

Solution:

  • Implement version-aware logic for handling test paths.
  • Ensure backward compatibility with earlier OpenSearch versions.
  • Add checks to use the appropriate test path handling method based on the minimum node version in the cluster.

Changes:

  1. Modified code to include version checks.
  2. Updated test path handling logic to accommodate mixed version scenarios.

Testing:

  • Tested repository registration in clusters with:
    • OS 2.15 and OS 2.17 nodes
  • Verified successful repository registration in all scenarios.
  • Ensured existing functionality remains intact for homogeneous version clusters.

This fix ensures smooth operation of snapshot repositories during cluster upgrades and in environments where different OpenSearch versions coexist.

Related Issues

Resolves #16830

Check List

  • Functionality includes testing.
  • API changes companion pull request created, if applicable.
  • Public documentation issue/PR created, if applicable.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Copy link
Contributor

❌ Gradle check result for 2c427aa: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

@ashking94 ashking94 changed the title Handle version during repository registration for prefix mode verific… Handle version during repository registration for prefix mode verification Dec 13, 2024
…ation

Signed-off-by: Ashish Singh <ssashish@amazon.com>
@ashking94 ashking94 force-pushed the repo-verification-fix branch from 2c427aa to 51a6e67 Compare December 13, 2024 16:12
@ashking94 ashking94 changed the title Handle version during repository registration for prefix mode verification Fix snapshot repository registration failure in mixed OpenSearch version setups Dec 13, 2024
@github-actions github-actions bot added bug Something isn't working Storage:Snapshots labels Dec 13, 2024
Copy link
Member Author

@ashking94 ashking94 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's difficult to add unit tests as the BlobStoreRepositoryTest.java is actually a single node spin up and testing. There are existing integration tests that invokes this flow. The code is partially tested in the ITs that spins up the remote store nodes.

Copy link
Contributor

❌ Gradle check result for 51a6e67: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

@ashking94
Copy link
Member Author

❌ Gradle check result for 51a6e67: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Flaky test - #15831

Copy link
Contributor

❌ Gradle check result for 51a6e67: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Copy link
Contributor

❌ Gradle check result for 51a6e67: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

@ashking94
Copy link
Member Author

❌ Gradle check result for 51a6e67: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Flaky test - #16015

Copy link
Contributor

✅ Gradle check result for 51a6e67: SUCCESS

Copy link

codecov bot commented Dec 14, 2024

Codecov Report

Attention: Patch coverage is 40.00000% with 3 lines in your changes missing coverage. Please review.

Project coverage is 72.15%. Comparing base (5ba909a) to head (51a6e67).
Report is 8 commits behind head on main.

Files with missing lines Patch % Lines
...ch/repositories/blobstore/BlobStoreRepository.java 33.33% 0 Missing and 2 partials ⚠️
...nsearch/cluster/service/ClusterApplierService.java 0.00% 0 Missing and 1 partial ⚠️
Additional details and impacted files
@@             Coverage Diff              @@
##               main   #16831      +/-   ##
============================================
+ Coverage     72.11%   72.15%   +0.04%     
- Complexity    65237    65271      +34     
============================================
  Files          5318     5318              
  Lines        304003   304015      +12     
  Branches      43992    43995       +3     
============================================
+ Hits         219228   219377     +149     
+ Misses        66874    66649     -225     
- Partials      17901    17989      +88     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

@ashking94
Copy link
Member Author

Codecov Report

Attention: Patch coverage is 40.00000% with 3 lines in your changes missing coverage. Please review.

Project coverage is 72.15%. Comparing base (5ba909a) to head (51a6e67).
Report is 7 commits behind head on main.

Files with missing lines Patch % Lines
...ch/repositories/blobstore/BlobStoreRepository.java 33.33% 0 Missing and 2 partials ⚠️
...nsearch/cluster/service/ClusterApplierService.java 0.00% 0 Missing and 1 partial ⚠️
Additional details and impacted files
☔ View full report in Codecov by Sentry. 📢 Have feedback on the report? Share it here.

Some of the changes needs mixed version tests for which there is an issue created.

@ashking94 ashking94 merged commit d37cc9b into opensearch-project:main Dec 16, 2024
66 of 71 checks passed
@gbbafna gbbafna added the backport 2.x Backport to 2.x branch label Dec 16, 2024
@opensearch-trigger-bot
Copy link
Contributor

The backport to 2.x failed:

The process '/usr/bin/git' failed with exit code 128

To backport manually, run these commands in your terminal:

# Navigate to the root of your repository
cd $(git rev-parse --show-toplevel)
# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add ../.worktrees/OpenSearch/backport-2.x 2.x
# Navigate to the new working tree
pushd ../.worktrees/OpenSearch/backport-2.x
# Create a new branch
git switch --create backport/backport-16831-to-2.x
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 d37cc9b32600faef17b3c898f8c86b0fd7640d72
# Push it to GitHub
git push --set-upstream origin backport/backport-16831-to-2.x
# Go back to the original working tree
popd
# Delete the working tree
git worktree remove ../.worktrees/OpenSearch/backport-2.x

Then, create a pull request where the base branch is 2.x and the compare/head branch is backport/backport-16831-to-2.x.

@ashking94
Copy link
Member Author

The backport to 2.x failed:

The process '/usr/bin/git' failed with exit code 128

To backport manually, run these commands in your terminal:

# Navigate to the root of your repository
cd $(git rev-parse --show-toplevel)
# Fetch latest updates from GitHub
git fetch
# Create a new working tree
git worktree add ../.worktrees/OpenSearch/backport-2.x 2.x
# Navigate to the new working tree
pushd ../.worktrees/OpenSearch/backport-2.x
# Create a new branch
git switch --create backport/backport-16831-to-2.x
# Cherry-pick the merged commit of this pull request and resolve the conflicts
git cherry-pick -x --mainline 1 d37cc9b32600faef17b3c898f8c86b0fd7640d72
# Push it to GitHub
git push --set-upstream origin backport/backport-16831-to-2.x
# Go back to the original working tree
popd
# Delete the working tree
git worktree remove ../.worktrees/OpenSearch/backport-2.x

Then, create a pull request where the base branch is 2.x and the compare/head branch is backport/backport-16831-to-2.x.

Manually backporting this PR.

ashking94 added a commit to ashking94/OpenSearch that referenced this pull request Dec 16, 2024
@ashking94
Copy link
Member Author

Created manual backport #16849

ashking94 added a commit to ashking94/OpenSearch that referenced this pull request Dec 16, 2024
ashking94 added a commit to ashking94/OpenSearch that referenced this pull request Dec 17, 2024
ashking94 added a commit to ashking94/OpenSearch that referenced this pull request Dec 17, 2024
ashking94 added a commit that referenced this pull request Dec 18, 2024
…ation (#16831) (#16849)

Signed-off-by: Ashish Singh <ssashish@amazon.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: ✅ Done
Development

Successfully merging this pull request may close these issues.

[BUG] Snapshot repository fails if minNode version is below OS 2.17 in mixed os version setup
3 participants