Enhance health check endpoint to support serializable request #13399

ahrtr · 2021-10-07T07:15:03Z

Fix #13340

Please let me know whether the solution is accepted. If yes, then I can continue to add some unit test for V3 server.

server/etcdserver/api/etcdhttp/metrics.go

ahrtr · 2021-10-12T14:01:03Z

@serathius and others, any comments please?

server/etcdserver/api/etcdhttp/metrics.go

ahrtr · 2021-10-26T05:12:20Z

Could anyone take a look at this PR?

serathius · 2021-10-26T07:08:05Z

cc @hexfusion @ptabor

ahrtr · 2021-11-14T20:37:26Z

Rebased the PR to resolve the conflict.

hexfusion

@ahrtr thanks overall this is an improvement one small nit. Can you please also update the 3.6 changelog?

hexfusion · 2021-11-14T21:29:14Z

server/etcdserver/api/etcdhttp/metrics.go

@@ -65,7 +67,15 @@ func NewHealthHandler(lg *zap.Logger, hfunc func(excludedAlarms AlarmSet) Health
 			return
 		}
 		excludedAlarms := getExcludedAlarms(r)
-		h := hfunc(excludedAlarms)
+		// Kubernetes Probes (i.e. livenessProbe) use "/health" endpoint to make a decision whether to restart a specific container.


minor nit: Can we condense the docs a bit, this feels overly verbose. Perhaps it would be good for folks revisiting this PR to add the full docs to the PR description above. Perhaps something like below?

Passing the query parameter "serializable=true" ensures that the health of the local etcd is checked vs the health of the cluster. This is useful for probes attempting to validate the liveness of the etcd process vs readiness of the cluster to serve requests.

@hexfusion Thanks for the comment. Fixed.

I also squashed all the commits.

ahrtr · 2021-11-14T22:06:34Z

@ahrtr thanks overall this is an improvement one small nit. Can you please also update the 3.6 changelog?

Thanks & Done!

hexfusion

thank you

spzala

lgtm
Thanks @ahrtr

pacoxu · 2022-02-17T03:34:16Z

See kubernetes/kubeadm#2567 (comment).

Could this be cherry-picked to etcd 3.5 and release a patch? Or Kubernetes has to wait for etcd 3.6 for this liveness enhancement.

ahrtr · 2022-02-17T03:53:31Z

See kubernetes/kubeadm#2567 (comment).

Could this be cherry-picked to etcd 3.5 and release a patch? Or Kubernetes has to wait for etcd 3.6 for this liveness enhancement.

Sure, let me backport the fix to 3.5.

Refs: * kubernetes/kubernetes#110072 * etcd-io/etcd#13399 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>

serathius reviewed Oct 7, 2021

View reviewed changes

server/etcdserver/api/etcdhttp/metrics.go Outdated Show resolved Hide resolved

ahrtr force-pushed the serializable_health_check branch 2 times, most recently from d9cd065 to 554d319 Compare October 8, 2021 00:42

serathius reviewed Oct 13, 2021

View reviewed changes

server/etcdserver/api/etcdhttp/metrics.go Outdated Show resolved Hide resolved

serathius reviewed Oct 13, 2021

View reviewed changes

server/etcdserver/api/etcdhttp/metrics.go Outdated Show resolved Hide resolved

serathius approved these changes Oct 13, 2021

View reviewed changes

ahrtr force-pushed the serializable_health_check branch from 058cfdd to 754cce1 Compare November 14, 2021 20:33

hexfusion approved these changes Nov 14, 2021

View reviewed changes

enhance health check endpoint to support serializable request

09ff051

ahrtr force-pushed the serializable_health_check branch from 754cce1 to 09ff051 Compare November 14, 2021 21:59

hexfusion approved these changes Nov 15, 2021

View reviewed changes

spzala approved these changes Nov 15, 2021

View reviewed changes

spzala merged commit d357f9b into etcd-io:main Nov 15, 2021

ishan16696 mentioned this pull request Dec 22, 2021

[Feature] Liveness Probe on multi-node etcd gardener/etcd-druid#280

Open

ahrtr mentioned this pull request Feb 17, 2022

[3.5] enhance health check endpoint to support serializable request #13706

Merged

This was referenced May 16, 2022

Backport pull/13525 to 3.5 #14048

Closed

[3.5] etcd server shouldn't wait for the ready notification infinitely on startup #14064

Closed

brandond added a commit to brandond/rke2 that referenced this pull request Jun 16, 2022

Use serializable health checks for etcd probes

6390b75

Refs: * kubernetes/kubernetes#110072 * etcd-io/etcd#13399 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>

brandond mentioned this pull request Jun 16, 2022

Use serializable health checks for etcd probes rancher/rke2#3073

Merged

brandond added a commit to brandond/rke2 that referenced this pull request Jun 16, 2022

Use serializable health checks for etcd probes

d312a30

Refs: * kubernetes/kubernetes#110072 * etcd-io/etcd#13399 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>

brandond added a commit to brandond/rke2 that referenced this pull request Jun 16, 2022

Use serializable health checks for etcd probes

f0e241c

Refs: * kubernetes/kubernetes#110072 * etcd-io/etcd#13399 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>

This was referenced Jun 16, 2022

[release-1.23] Use serializable health checks for etcd probes rancher/rke2#3074

Merged

[release-1.22] Use serializable health checks for etcd probes rancher/rke2#3075

Merged

brandond mentioned this pull request Jun 16, 2022

Use serializable health checks for etcd probes rancher/rke2#3076

Closed

brandond added a commit to rancher/rke2 that referenced this pull request Jun 16, 2022

Use serializable health checks for etcd probes

d2854f0

Refs: * kubernetes/kubernetes#110072 * etcd-io/etcd#13399 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>

brandond added a commit to rancher/rke2 that referenced this pull request Jun 16, 2022

Use serializable health checks for etcd probes

e075504

Refs: * kubernetes/kubernetes#110072 * etcd-io/etcd#13399 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>

brandond added a commit to rancher/rke2 that referenced this pull request Jun 16, 2022

Use serializable health checks for etcd probes

6c8d170

Refs: * kubernetes/kubernetes#110072 * etcd-io/etcd#13399 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>

bsctl mentioned this pull request Aug 9, 2022

Improve the etcd liveness probe in helm chart clastix/kamaji#110

Closed

ishan16696 mentioned this pull request Oct 6, 2022

[Upgrade] Move the etcd from v3.4.26 to v3.4.34 to v3.5.x gardener/etcd-druid#445

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance health check endpoint to support serializable request #13399

Enhance health check endpoint to support serializable request #13399

ahrtr commented Oct 7, 2021

ahrtr commented Oct 12, 2021

ahrtr commented Oct 26, 2021

serathius commented Oct 26, 2021

ahrtr commented Nov 14, 2021

hexfusion left a comment

hexfusion Nov 14, 2021

ahrtr Nov 14, 2021

ahrtr commented Nov 14, 2021

hexfusion left a comment

spzala left a comment

pacoxu commented Feb 17, 2022

ahrtr commented Feb 17, 2022

Enhance health check endpoint to support serializable request #13399

Enhance health check endpoint to support serializable request #13399

Conversation

ahrtr commented Oct 7, 2021

ahrtr commented Oct 12, 2021

ahrtr commented Oct 26, 2021

serathius commented Oct 26, 2021

ahrtr commented Nov 14, 2021

hexfusion left a comment

Choose a reason for hiding this comment

hexfusion Nov 14, 2021

Choose a reason for hiding this comment

ahrtr Nov 14, 2021

Choose a reason for hiding this comment

ahrtr commented Nov 14, 2021

hexfusion left a comment

Choose a reason for hiding this comment

spzala left a comment

Choose a reason for hiding this comment

pacoxu commented Feb 17, 2022

ahrtr commented Feb 17, 2022