Add pre-upgrade check to test cluster routing allocation is enabled #39340

bizybot · 2019-02-25T06:36:54Z

When following the steps mentioned in upgrade guide
https://www.elastic.co/guide/en/elastic-stack/6.6/upgrading-elastic-stack.html
if we disable the cluster shard allocation but fail to enable it after
upgrading the nodes and plugins, the next step of upgrading internal
indices fails. As we did not check the bulk request response for reindexing,
we delete the old index assuming it has been created. This is fatal
as we cannot recover from this state.

This commit adds a pre-upgrade check to test the cluster shard
allocation setting and fail upgrade if it is disabled. In case there
are search or bulk failures then we remove the read-only block and
fail the upgrade index request.

Closes #39339

When following the steps mentioned in upgrade guide https://www.elastic.co/guide/en/elastic-stack/6.6/upgrading-elastic-stack.html if we disable the cluster shard allocation but fail to enable it after upgrading the nodes and plugins, the next step of upgrading internal indices fails. As we did not check the bulk response for reindexing, we delete the old index assuming it has been created. This is fatal as we cannot recover from this state. This commit adds a pre-upgrade check to test the cluster shard allocation setting and fail upgrade if it is disabled. In case there are search or bulk failures then we remove the read only block and fail the upgrade index request. Closes elastic#39339

elasticmachine · 2019-02-25T06:36:55Z

Pinging @elastic/es-core-features

albertzaharovits

LGTM! See if you can find someone from core-infra-features to also have a look.

albertzaharovits · 2019-03-04T20:49:40Z

x-pack/plugin/upgrade/src/main/java/org/elasticsearch/xpack/upgrade/InternalIndexReindexer.java

@@ -76,32 +80,61 @@ public void upgrade(TaskId task, String index, ClusterState clusterState, Action
    private void innerUpgrade(ParentTaskAssigningClient parentAwareClient, String index, ClusterState clusterState,
                              ActionListener<BulkByScrollResponse> listener) {
        String newIndex = index + "-" + version;
+        logger.trace("upgrading index {} to new index {}", index, newIndex);
        try {
            checkMasterAndDataNodeVersion(clusterState);


This is where I would've put the allocation check, but no need to amend the PR for this. The reason is to keep all the checks in a single place.

imotov

LGTM. Left one comment regarding adding testing-only method.

imotov · 2019-03-05T15:46:28Z

x-pack/plugin/upgrade/src/main/java/org/elasticsearch/xpack/upgrade/IndexUpgradeCheck.java

@@ -106,4 +118,9 @@ public void upgrade(TaskId task, IndexMetaData indexMetaData, ClusterState state
                        ActionListener<BulkByScrollResponse> listener) {
        reindexer.upgrade(task, indexMetaData.getIndex().getName(), state, listener);
    }
+
+    // pkg scope for testing
+    InternalIndexReindexer getInternalIndexReindexer() {


That concerns me a bit. Could you add a comment why this is necessary?

tvernum · 2019-03-07T02:12:40Z

@bizybot I think this should be backported to 5.6 as well. We recommend that users run the /_xpack/migration/upgrade while they're on 5.6, so it is important to fix it there.

…lastic#39340) When following the steps mentioned in upgrade guide https://www.elastic.co/guide/en/elastic-stack/6.6/upgrading-elastic-stack.html if we disable the cluster shard allocation but fail to enable it after upgrading the nodes and plugins, the next step of upgrading internal indices fails. As we did not check the bulk request response for reindexing, we delete the old index assuming it has been created. This is fatal as we cannot recover from this state. This commit adds a pre-upgrade check to test the cluster shard allocation setting and fail upgrade if it is disabled. In case there are search or bulk failures then we remove the read-only block and fail the upgrade index request. Closes elastic#39339

* elastic/master: Add pre-upgrade check to test cluster routing allocation is enabled (elastic#39340) Update logstash-management.json to use typeless template (elastic#38653) Small simplifications to mapping validation. (elastic#39777) Update distribution build instructions to reflect file names with OS/architecture classifiers. (elastic#39762) Give jspawnhelper execute permissions in bundled JDK (elastic#39787) Maintain step order for ILM trace logging (elastic#39522) [ML-DataFrame] fix wire serialization issues in data frame response objects (elastic#39790) fix index refresh in test within 20_mix_typeless_typeful (elastic#39198) Combine overriddenOps and skippedOps in translog (elastic#39771)

…39340) (#39815) When following the steps mentioned in upgrade guide https://www.elastic.co/guide/en/elastic-stack/6.6/upgrading-elastic-stack.html if we disable the cluster shard allocation but fail to enable it after upgrading the nodes and plugins, the next step of upgrading internal indices fails. As we did not check the bulk request response for reindexing, we delete the old index assuming it has been created. This is fatal as we cannot recover from this state. This commit adds a pre-upgrade check to test the cluster shard allocation setting and fail upgrade if it is disabled. In case there are search or bulk failures then we remove the read-only block and fail the upgrade index request. Closes #39339

…39340) (#39816) When following the steps mentioned in upgrade guide https://www.elastic.co/guide/en/elastic-stack/6.6/upgrading-elastic-stack.html if we disable the cluster shard allocation but fail to enable it after upgrading the nodes and plugins, the next step of upgrading internal indices fails. As we did not check the bulk request response for reindexing, we delete the old index assuming it has been created. This is fatal as we cannot recover from this state. This commit adds a pre-upgrade check to test the cluster shard allocation setting and fail upgrade if it is disabled. In case there are search or bulk failures then we remove the read-only block and fail the upgrade index request. Closes #39339

…39340) (#39817) When following the steps mentioned in upgrade guide https://www.elastic.co/guide/en/elastic-stack/6.6/upgrading-elastic-stack.html if we disable the cluster shard allocation but fail to enable it after upgrading the nodes and plugins, the next step of upgrading internal indices fails. As we did not check the bulk request response for reindexing, we delete the old index assuming it has been created. This is fatal as we cannot recover from this state. This commit adds a pre-upgrade check to test the cluster shard allocation setting and fail upgrade if it is disabled. In case there are search or bulk failures then we remove the read-only block and fail the upgrade index request. Closes #39339

bizybot added >bug >upgrade :Data Management/Indices APIs APIs to create and manage indices and templates v6.7.0 v8.0.0 v7.2.0 labels Feb 25, 2019

bizybot requested review from imotov and albertzaharovits February 25, 2019 06:36

danielmitterdorfer removed the >upgrade label Feb 27, 2019

Yogesh Gaikwad added 3 commits February 27, 2019 22:30

Merge branch 'master' into fix-internal-index-reindexer

b373c17

revert unwanted change

c17c8a8

revert unwanted change

9f49851

albertzaharovits approved these changes Mar 4, 2019

View reviewed changes

imotov approved these changes Mar 5, 2019

View reviewed changes

bizybot added the v5.6.16 label Mar 8, 2019

bizybot merged commit bae7e71 into elastic:master Mar 8, 2019

jakelandis added v8.0.0-alpha1 and removed v8.0.0 labels Jul 26, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add pre-upgrade check to test cluster routing allocation is enabled #39340

Add pre-upgrade check to test cluster routing allocation is enabled #39340

bizybot commented Feb 25, 2019

elasticmachine commented Feb 25, 2019

albertzaharovits left a comment

albertzaharovits Mar 4, 2019

imotov left a comment

imotov Mar 5, 2019

tvernum commented Mar 7, 2019

Add pre-upgrade check to test cluster routing allocation is enabled #39340

Add pre-upgrade check to test cluster routing allocation is enabled #39340

Conversation

bizybot commented Feb 25, 2019

elasticmachine commented Feb 25, 2019

albertzaharovits left a comment

Choose a reason for hiding this comment

albertzaharovits Mar 4, 2019

Choose a reason for hiding this comment

imotov left a comment

Choose a reason for hiding this comment

imotov Mar 5, 2019

Choose a reason for hiding this comment

tvernum commented Mar 7, 2019