Specify health-check for each service #97

hbagdi · 2020-02-15T00:21:33Z

What would you like to be added:

As a Service owner, when I'm exposing my service to other users/services outside the k8s cluster (or even inside), I want to define active health-checking behavior for my service.

Why is this needed:

In case an instances of a service goes unhealthy, the proxy can skip sending requests to that specific instance and instead route traffic to other instances. This is also useful during rolling upgrades where a health-check endpoint can stop responding and stop accepting connections before the pod is replaced.

/kind feature
/kind user-story

hbagdi · 2020-02-15T00:24:26Z

The new API we design should take care of two things:

There can be multiple routes pointing to the same service. There should be a way of de-duplicating this data or having a priority around which configuration to use in such a case.
User should be asked to explicitly define health-check behavior via this API and we should refrain to re-use readiness/liveliness probes.

This was also discussed during Ingress v1beta -> v1 transition but was punted because of the scope of the change required.

fejta-bot · 2020-05-15T00:58:24Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

hbagdi · 2020-05-15T22:19:02Z

/remove-lifecycle stale

fejta-bot · 2020-08-13T22:24:05Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2020-09-12T23:06:12Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

hbagdi · 2020-09-12T23:15:20Z

/remove-lifecycle rotten

fejta-bot · 2020-12-12T00:14:47Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

hbagdi · 2020-12-14T16:15:12Z

/remove-lifecycle stale
/lifecycle frozen

hbagdi · 2021-01-12T09:42:01Z

/remove-lifecycle stale
/lifecycle frozen

stevesloka · 2021-04-06T01:28:59Z

Would it make sense to apply these to BackendPolicies? This would fit in with allowing these to be specified once since the backendRef could be used in multiple routes. But since these are namespaced, it might not fit all use cases.

Also, backendPolicy has a tls option to specify certs for the backend making this a good place to add higher level of customizations that other users might not require if just using serviceName in a HTTPRoute (for example) making this a good spot for this addition.

We also need to think about the protocol of the service to know if we need say a path for HTTP types whereas that would be invalid for a TCP type. We could look at the type of object this is referenced from possibly to understand what types might be valid, meaning if this is referenced from a TCPRoute then path param is not required.

This idea would add a healthCheck to the backendRefs array with the following values:

path: Path in the upstream service to validate against (optional)
intervalSeconds: Number of seconds wait between health checks (optional)
timeoutSeconds: Number of seconds to wait for a response before giving up (optional)
unhealthyThresholdCount: Number of times the health checks need to fail to become unhealthy (optional)
healthyThresholdCount: Number of times the health checks need to succeed to become healthy (optional)

Note: These are in seconds, could switch to something more generic and allow users to specify milliseconds, etc.

Example yaml:

kind: BackendPolicy
apiVersion: networking.x-k8s.io/v1alpha1
metadata:
  name: policy1
spec: 
  backendRefs:
  - group: core
    kind: Service
    name: svc1
    port: 80
  healthCheck:
    path: /healthz
    intervalSeconds: 5
    timeoutSeconds: 2
    unhealthyThresholdCount: 3
    healthyThresholdCount: 5

howardjohn · 2021-04-06T01:33:15Z

One concern with backend policies is that its not explicitly when we should check it. This could be solvable by a comment in the spec, but we need to consider it.

For example, if I have 10 Gateways, which ones should health check policy1? is it all of them, any of them with routes to policy1, etc. If all gateways have routes to it, we will now get 10x the health check load.

The reason I care is we will have lots of gateways most likely, so want to make sure the spec is clear here

stevesloka · 2021-04-06T15:42:21Z

Yeah having multiple Gateways makes it tricky, I'd expect each Gateway to do it's own Health Checks against the resource.

hbagdi · 2021-04-12T19:50:51Z

Interesting. I think "who" does the health-checking is up to the implementation. You use a health-checking service or a proxy or some more complicated distributed mechanism, that is up to you and should be transparent to the end user. What matters more here is what happens when health-checks are failing and how different implementations react differently to it.

As I think more about this, it feels like BackendPolicy seems like a resource to define configuration of different types and not behavior. Making behavior consistent in this area seems infeasible.

howardjohn · 2021-04-12T20:04:39Z

Another question which may be obvious but not to me - why do we need this at gateway level instead of using pod readinessProbes?

hbagdi · 2021-04-12T21:29:48Z

why do we need this at gateway level instead of using pod readinessProbes?

Network view is sometimes different from kubelet view.
Pods could be ready but health-checks for traffic can be a different probe. Service readiness from gateway can go in and out of rotation more frequently than pod readiness probes.
I think pod probes are not strictly network-specific, users can configure a shell command/script as well.
The feedback loop of (a) pod failing a check (b) kubelet noticing it and then updating API-server (c) k8s core controller(s) noticing that and then updating endpoints accordingly (d) controller for Gateway/Ingress noticing the change (e) proxy hot path finally reflecting the change, can be sometimes a bit slower than desired.
Service-level health-check sometimes is a higher-order concept that is not specific to one pod but pods of different deployments.
Health-check defined this way can be seen as a way to standardize health-checking in a large environment (lot of services owned by a lot of teams)

robscott · 2021-04-14T06:15:47Z

To follow up on this, I've written up a quick doc to try to summarize the portability of health check config across implementations. There's a spreadsheet that covers some of the most commonly supported features. I likely got at least some things wrong here, so please correct me if I did.

At a high level, most implementations support HTTP health checks with the option to specify Path, Timeout, and Interval. The next most supported fields are Hostname and Healthy/Unhealthy Thresholds.

costinm · 2021-04-14T18:05:54Z

This seems to match what Pod readiness probe supports - any reason to not just use the exact same thing (as spec and API), and default to the HTTP readiness probe in the pod, if it exists ?

I assume like all K8S services, it will be required that gateways respect the K8S readiness semantics as well, i.e. if K8S kubelet
( and the associated EndpointSlice, Endpoint) is marked not ready, it will take priority and no traffic will be sent.

The network view may be different from kubelet view - but that doesn't mean users need to maintain 2 different endpoints.

I am concerned a bit with the scalability of such system - distributed health check in a mesh with multiple networks, security, etc are pretty difficult - if we are worried about the feeback loop from K8S we should also worry about the load and feedback loop on the health check system if it is mandated by the Gateaway API.

hbagdi · 2021-04-21T22:45:41Z

I assume like all K8S services, it will be required that gateways respect the K8S readiness semantics as well, i.e. if K8S kubelet
( and the associated EndpointSlice, Endpoint) is marked not ready, it will take priority and no traffic will be sent.

This actually is a great point and requires some more discussion and probably an issue of its own. Some Ingress Controllers route traffic to the VIP of the k8s Service while others route traffic directly to the endpoints. Which behavior do we expect from implementations in Gateway API? cc @mark-church @bowei @robscott @danehans

The network view may be different from kubelet view - but that doesn't mean users need to maintain 2 different endpoints.

Users can maintain the same endpoint if that's what they want. We are not asking the users to maintain a different endpoint. Is that acceptable @costinm ?

I am concerned a bit with the scalability of such system - distributed health check in a mesh with multiple networks, security, etc are pretty difficult - if we are worried about the feeback loop from K8S we should also worry about the load and feedback loop on the health check system if it is mandated by the Gateaway API.

I think this point is similar to John's point above.
Costin, if the API is not prescriptive about how the health-checks are performed, is that okay?

youngnick · 2021-05-19T07:07:07Z

I think the thing that we would probably need to be a little prescriptive about is that the Gateway (however it's implemented) should be doing its own active checking if health checks are specified.

That way we can be clear how they are distinct in behavior from adding a readiness check to a Pod and having Endpoints not visible in that way.

howardjohn · 2021-05-19T14:59:49Z

I don't think its clear what "the Gateway" is, given the API is implementation agnostic. Does it mean all instances of the gateway (pod for in cluster workloads) must send requests to the pods to determine if its healthy? Does it mean one of them can and share the information with others? Can I spin up a distinct workload that does the health checks and reports it back? Stretching this really far, can that distinct workload be "kubelet" and "reporting it back" mean "put it in the pod readiness status"?

youngnick · 2021-05-20T01:46:31Z

That's a great point. I was thinking of it more in terms of saying something like "if you specify a healthcheck here, it must be done by some other mechanism than a Readiness check". You may use a readiness check as well (to filter endpoints or something), but asking here implies having the Gateway controller do something more active than that. Note that this isn't taking a stance on active health-checks v passive ones, just that it's something that's not the Kubelet.

I do think it's fair to talk about "the Gateway" as a thing, since, at its heart, a Gateway describes some piece of network infrastructure that translates between traffic that doesn't know about cluster networking internals to something that does. (I've been working on this as a Gateway definition, but I don't think it's all the way there).

hbagdi · 2021-08-24T15:06:21Z

In search for a solution, we were unsuccessful in coming up with a portable way to support this feature.
As a compromise, we came up with an easy and portable way to support such implementation-specific features in a consistent manner. The solution is documented in #713.

If there are more portable ways to tackle this, we would love to hear about them.

For now, I'll close this issue.
/close

k8s-ci-robot · 2021-08-24T15:06:30Z

@hbagdi: Closing this issue.

In response to this:

In search for a solution, we were unsuccessful in coming up with a portable way to support this feature.
As a compromise, we came up with an easy and portable way to support such implementation-specific features in a consistent manner. The solution is documented in #713.

If there are more portable ways to tackle this, we would love to hear about them.

For now, I'll close this issue.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

hbagdi added the kind/feature Categorizes issue or PR as related to a new feature. label Feb 15, 2020

k8s-ci-robot added the kind/user-story Categorizes an issue as capturing a user story label Feb 15, 2020

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 15, 2020

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 15, 2020

hbagdi mentioned this issue May 19, 2020

Service-level configuration #196

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 13, 2020

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Sep 12, 2020

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Sep 12, 2020

akutz mentioned this issue Nov 8, 2020

Support for ExternalTrafficPolicy #451

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 12, 2020

hbagdi mentioned this issue Dec 28, 2020

Separate readinessProbe for Service kubernetes/kubernetes#97393

Closed

k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jan 12, 2021

robscott added this to the v0.3.0 milestone Mar 24, 2021

robscott added the priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. label Apr 14, 2021

robscott removed this from the v0.3.0 milestone Apr 23, 2021

This was referenced Aug 3, 2021

Implementing policy attachment GEP #736

Merged

GEP: Pattern for policy attachment #713

Open

Extensible Service policy and configuration #611

Closed

k8s-ci-robot closed this as completed Aug 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specify health-check for each service #97

Specify health-check for each service #97

hbagdi commented Feb 15, 2020

hbagdi commented Feb 15, 2020

fejta-bot commented May 15, 2020

hbagdi commented May 15, 2020

fejta-bot commented Aug 13, 2020

fejta-bot commented Sep 12, 2020

hbagdi commented Sep 12, 2020

fejta-bot commented Dec 12, 2020

hbagdi commented Dec 14, 2020 •

edited

Loading

hbagdi commented Jan 12, 2021

stevesloka commented Apr 6, 2021

howardjohn commented Apr 6, 2021

stevesloka commented Apr 6, 2021

hbagdi commented Apr 12, 2021

howardjohn commented Apr 12, 2021

hbagdi commented Apr 12, 2021

robscott commented Apr 14, 2021

costinm commented Apr 14, 2021

hbagdi commented Apr 21, 2021

youngnick commented May 19, 2021

howardjohn commented May 19, 2021

youngnick commented May 20, 2021

hbagdi commented Aug 24, 2021

k8s-ci-robot commented Aug 24, 2021

Specify health-check for each service #97

Specify health-check for each service #97

Comments

hbagdi commented Feb 15, 2020

hbagdi commented Feb 15, 2020

fejta-bot commented May 15, 2020

hbagdi commented May 15, 2020

fejta-bot commented Aug 13, 2020

fejta-bot commented Sep 12, 2020

hbagdi commented Sep 12, 2020

fejta-bot commented Dec 12, 2020

hbagdi commented Dec 14, 2020 • edited Loading

hbagdi commented Jan 12, 2021

stevesloka commented Apr 6, 2021

howardjohn commented Apr 6, 2021

stevesloka commented Apr 6, 2021

hbagdi commented Apr 12, 2021

howardjohn commented Apr 12, 2021

hbagdi commented Apr 12, 2021

robscott commented Apr 14, 2021

costinm commented Apr 14, 2021

hbagdi commented Apr 21, 2021

youngnick commented May 19, 2021

howardjohn commented May 19, 2021

youngnick commented May 20, 2021

hbagdi commented Aug 24, 2021

k8s-ci-robot commented Aug 24, 2021

hbagdi commented Dec 14, 2020 •

edited

Loading