Use Elasticsearch readiness port #7847

pebrc · 2024-05-24T09:25:16Z

As of 8.2.0 use the new readiness.port setting to enable a TCP readiness check that is sensitive to cluster membeship of the node. This should improve cluster availablity during external disruptions e.g. due to node upgrades, as the PDB will disallow furhter upgrades until the most recently upgraded node has rejoined the cluster.

I choose to create a separate file mounted next to the existing script to facilitate upgrades. Specifically I wanted to avoid a situation where during the upgrade we update the scripts configmap overwriting the old probe script with the new one and thereby breaking the not yet upgraded nodes in the cluster.

The approach taken here creates some techical debt in form of the extra script hanging around when not needed. But I imaging we can drop it when we stop supporting ES < 8.2.0.

An alternative approach would be to integrate a version check in the existing script which seemed more complicated to reason about to me.

Marked as bug because a cluster might be unavailable if master nodes are deleted while the previously deleted ones are not yet back in the cluster, which violates our promise of interruption free ES operations when running in HA mode with multiple master nodes.

pebrc · 2024-05-24T09:51:04Z

buildkite test this -f p=gke,E2E_TAGS=es -m s=8.1.3,s=8.13.2

barkbay

I choose to create a separate file mounted next to the existing script to facilitate upgrades. Specifically I wanted to avoid a situation where during the upgrade we update the scripts configmap overwriting the old probe script with the new one and thereby breaking the not yet upgraded nodes in the cluster.

👍

LGTM!

pkg/controller/elasticsearch/nodespec/readiness_probe.go

thbkrkr

LGTM

pkg/controller/elasticsearch/nodespec/readiness_probe.go

thbkrkr · 2024-05-24T12:47:04Z

buildkite test this p=gke,s=8.1.3,t=TestVersionUpgradeToLatest8x

Edit: Oups, I didn't see you've already done it in the long version via: #7847 (comment).

pebrc · 2024-05-24T13:47:12Z

pkg/controller/elasticsearch/nodespec/__snapshots__/podspec_test.snap

+       "command": [
+        "bash",
+        "-c",
+        "/mnt/elastic-internal/scripts/pre-stop-hook-script.sh"


pkg/controller/elasticsearch/nodespec/__snapshots__/podspec_test.snap

pebrc · 2024-05-24T13:48:12Z

pkg/controller/elasticsearch/nodespec/__snapshots__/podspec_test.snap

+      "command": [
+       "bash",
+       "-c",
+       "/mnt/elastic-internal/scripts/readiness-port-script.sh"


New script post 8.2.0

pebrc · 2024-05-25T06:14:56Z

buildkite test this p=gke,s=8.1.3,t=TestVersionUpgradeToLatest8x

pebrc · 2024-05-25T06:51:07Z

buildkite test this -f p=kind,s=8.1.3 -m t=TestVersionUpgradeSingleToLatest8x,t=TestVersionUpgradeTwoNodesToLatest8x,t=TestExternalESStackMonitoring,t=TestForceUpgradePendingPodsInOneStatefulSet,t=TestKillSingleNodeReusePV,t=TestPodTemplateValidation,t=TestRedClusterCanBeModifiedByDisablingPredicate,t=TestStackConfigPolicy,t=TestVolumeRetention

pebrc · 2024-05-25T13:53:20Z

There is an edge case with single node clusters that I did not consider:

the readiness.port reports an ES node as not ready if a shutdown record exists for the ES node
an unready node is not listed in the (internal) k8s service we use and assumed not reacheable
this means in turn that the operator is not able to call the DELETE _nodes/$NODE_ID/shutdown API to remove the shutdown record and the cluster remains unready forever

The problem does not present itself with multinode clusters where at least one node is available at all times in the same way but the edge case applies to those as well if for example all nodes are deleted at the same time or some other external factor.

pebrc · 2024-05-25T15:21:39Z

buildkite test this -f p=kind,E2E_TAGS=es -m s=8.1.3,s=8.13.2

pebrc · 2024-05-27T09:08:09Z

buildkite test this -f p=kind,E2E_TAGS=es -m s=8.1.3,s=8.13.2

pebrc · 2024-05-27T12:33:39Z

I am thinking about the different options to address the problem:

consider this a bug in the Elasticsearch implementation of the readiness probe? (I leaning towards no, as the contract for the shutdown API is indeed for the orchestrator to clean up the shutdown records)
Allow unready Pods in the service (maybe only as of ES >= 8.2.0), this is what I have been testing in 2358947 and following commits. While this addresses the issue at hand it also introduces extra errors when the operator tries to connect to a node that is not ready and actually also not serving requests.
Rewrite the Elasticsearch client logic to not go through the k8s service but instead to pick a Pod at random from a priority list which is listing all ready pods and then all running (but not ready ) pods.
A variant of 3 that does still go throught the service but uses a different label that the operator puts onto the pods when they become ready to serve but not necessarily ready in the k8s sense (Njal's idea from 2019) I don't like that too much due to the extra API requests required for labeling the pods and prefer 3.

Not that all of the options only affect the internal service, the external service used by users and clients should still be based on teh readiness of the Pods.

@barkbay @thbkrkr given that you have reviewed this PR I would be curious about your thoughts.

barkbay · 2024-05-27T14:55:50Z

Could we rely on the service unless there is no ready endpoint, in which case we would pick a random Pod? (mostly to be conservative and only use a new logic for some edge cases like single node clusters, all Pods deleted at the same time...)

This reverts commit 2358947.

This reverts commit 09a6fc4.

pebrc · 2024-05-29T17:07:26Z

buildkite test this -f p=kind,E2E_TAGS=es -m s=8.1.3,s=8.13.2

pebrc · 2024-05-29T19:48:43Z

The test run from the comment above completed successfully in 2h instead of the usual 4h, which is suspicious. I need to take a closer look if this is a bug or if the fact the we optmistically make connections to non-ready pods speeds up the tests so much.

pebrc · 2024-05-29T21:11:13Z

buildkite test this -f p=gke,TESTS_MATCH=* -m s=8.1.3,s=8.13.2

pebrc · 2024-05-29T21:43:54Z

buildkite test this -f p=gke,t=Test -m s=8.1.3,s=8.13.2

thbkrkr · 2024-05-30T10:10:50Z

It is indeed faster.

# gke from cloud-on-k8s-operator-nightly/builds/512
> awk '{print $2, $1}' prev | grep -v "ms " | grep "^[[0-9]*m" | sort -rn | head -10
20m18.72s TestSamples
13m51.4s TestUpdateESSecureSettings
12m34.79s TestFleetMode
10m43.55s TestVersionUpgradeOrdering
9m59.37s TestMutationAndReversal
9m26.65s TestMutationWithLargerMaxUnavailable
9m14.36s TestMutationHTTPToHTTPS
8m21.28s TestMutationResizeMemoryDown
8m20.25s TestFleetAPMIntegrationRecipe
8m19.11s TestMutationResizeMemoryUp

# gke-8-13-2 from cloud-on-k8s-operator/builds/8587
> awk '{print $2, $1}' new | grep -v "ms " | grep "^[[0-9]*m" | sort -rn | head -10
17m23.71s TestSamples
10m22.91s TestFleetMode
8m27.45s TestUpdateESSecureSettings
7m55.65s TestAutoscaling
7m37.89s TestVersionUpgradeOrdering
6m7.49s TestMutationHTTPToHTTPS
6m51.17s TestMetricbeatStackMonitoringRecipe
6m5.69s TestExternalESStackMonitoring
6m20.72s TestFleetAgentWithoutTLS
5m54.3s TestGlobalCA

pkg/controller/elasticsearch/client/url.go

pkg/controller/elasticsearch/services/services.go

pkg/controller/elasticsearch/driver/driver.go

pkg/controller/elasticsearch/services/services.go

barkbay

LGTM

pkg/utils/k8s/k8sutils.go

pkg/controller/elasticsearch/client/url.go

pebrc · 2024-06-04T17:56:35Z

I tried to compare memory usage against a baseline from the last nightly run:

with the one of the longer runs triggered from this PR here #7847 (comment)

I did not spot any significant change, but the comparison leaves much to be desired.

pebrc added 2 commits May 24, 2024 10:56

Use new readiness.port as of 8.2.0

c113939

Script indentation

151df5a

pebrc added the >bug Something isn't working label May 24, 2024

barkbay approved these changes May 24, 2024

View reviewed changes

pkg/controller/elasticsearch/nodespec/readiness_probe.go Show resolved Hide resolved

thbkrkr approved these changes May 24, 2024

View reviewed changes

pkg/controller/elasticsearch/nodespec/readiness_probe.go Show resolved Hide resolved

pebrc added 3 commits May 24, 2024 15:31

Add unit test for buildPodTemplateSpec

2652e0f

don't panic

b3b2fa7

regen NOTICE and dep docs

43fb42a

pebrc commented May 24, 2024

View reviewed changes

pebrc added 2 commits May 24, 2024 15:55

Remove unnecessary env vars

653da51

Restore HEADLESS_SERVICE_NAME var

147353d

pebrc marked this pull request as draft May 25, 2024 15:20

Experiment: list unready pods in internal service

2358947

Experiment: adjust e2e test to use internal svc

09a6fc4

pebrc added the v2.14.0 label May 27, 2024

pebrc added 6 commits May 28, 2024 20:22

Experiment 2: url provider

130ae8d

Revert "Experiment: list unready pods in internal service"

05adf98

This reverts commit 2358947.

Revert "Experiment: adjust e2e test to use internal svc"

5e1a4f4

This reverts commit 09a6fc4.

Adjust force-upgrade test to new behaviour

dfac339

Dynamically fetch pod list in url provider + tests

66cde4f

missing license header

71f81a4

my local linter does not work

0727a1b

cleanup

f954193

pebrc marked this pull request as ready for review May 29, 2024 20:32

pebrc added 2 commits May 29, 2024 22:44

tweaks to urlprovider

8c84518

Clarify naming of readines probe script

1649df7

pebrc requested review from thbkrkr and barkbay May 30, 2024 05:05

barkbay reviewed Jun 3, 2024

View reviewed changes

pkg/controller/elasticsearch/client/url.go Outdated Show resolved Hide resolved

pkg/controller/elasticsearch/services/services.go Outdated Show resolved Hide resolved

pkg/controller/elasticsearch/driver/driver.go Show resolved Hide resolved

pebrc commented Jun 3, 2024

View reviewed changes

pkg/controller/elasticsearch/services/services.go Outdated Show resolved Hide resolved

pebrc added 3 commits June 3, 2024 19:47

review: interface structure, error handling

878169e

review: autoscaling online detection

78aca26

go docs and such

8fbf987

barkbay approved these changes Jun 4, 2024

View reviewed changes

pkg/utils/k8s/k8sutils.go Outdated Show resolved Hide resolved

pkg/controller/elasticsearch/client/url.go Outdated Show resolved Hide resolved

pkg/controller/elasticsearch/client/url.go Outdated Show resolved Hide resolved

pebrc added 2 commits June 4, 2024 20:01

review: wording

b78c287

Merge remote-tracking branch 'upstream/main' into use-readiness-port

2876e15

pebrc enabled auto-merge (squash) June 4, 2024 18:15

pebrc merged commit d1a0de6 into elastic:main Jun 4, 2024
5 checks passed

barkbay changed the title ~~Use readiness port~~ Use Elasticearch readiness port Jul 25, 2024

barkbay changed the title ~~Use Elasticearch readiness port~~ Use Elasticsearch readiness port Jul 25, 2024

pebrc mentioned this pull request Sep 9, 2024

Elasticsearch rolling upgrades do not wait for Pods to rejoin the cluster #5449

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use Elasticsearch readiness port #7847

Use Elasticsearch readiness port #7847

pebrc commented May 24, 2024 •

edited

Loading

pebrc commented May 24, 2024

barkbay left a comment

thbkrkr left a comment

thbkrkr commented May 24, 2024 •

edited

Loading

pebrc May 24, 2024

pebrc May 24, 2024

pebrc commented May 25, 2024

pebrc commented May 25, 2024

pebrc commented May 25, 2024

pebrc commented May 25, 2024

pebrc commented May 27, 2024

pebrc commented May 27, 2024 •

edited

Loading

barkbay commented May 27, 2024 •

edited

Loading

pebrc commented May 29, 2024

pebrc commented May 29, 2024 •

edited

Loading

pebrc commented May 29, 2024

pebrc commented May 29, 2024

thbkrkr commented May 30, 2024 •

edited

Loading

barkbay left a comment

pebrc commented Jun 4, 2024

Use Elasticsearch readiness port #7847

Use Elasticsearch readiness port #7847

Conversation

pebrc commented May 24, 2024 • edited Loading

pebrc commented May 24, 2024

barkbay left a comment

Choose a reason for hiding this comment

thbkrkr left a comment

Choose a reason for hiding this comment

thbkrkr commented May 24, 2024 • edited Loading

pebrc May 24, 2024

Choose a reason for hiding this comment

pebrc May 24, 2024

Choose a reason for hiding this comment

pebrc commented May 25, 2024

pebrc commented May 25, 2024

pebrc commented May 25, 2024

pebrc commented May 25, 2024

pebrc commented May 27, 2024

pebrc commented May 27, 2024 • edited Loading

barkbay commented May 27, 2024 • edited Loading

pebrc commented May 29, 2024

pebrc commented May 29, 2024 • edited Loading

pebrc commented May 29, 2024

pebrc commented May 29, 2024

thbkrkr commented May 30, 2024 • edited Loading

barkbay left a comment

Choose a reason for hiding this comment

pebrc commented Jun 4, 2024

pebrc commented May 24, 2024 •

edited

Loading

thbkrkr commented May 24, 2024 •

edited

Loading

pebrc commented May 27, 2024 •

edited

Loading

barkbay commented May 27, 2024 •

edited

Loading

pebrc commented May 29, 2024 •

edited

Loading

thbkrkr commented May 30, 2024 •

edited

Loading